To Thanh Dat

Undergraduate Student @ University of Science, VNU-HCM

My research focuses on Computer Vision, alongside Efficient and Faithful Vision-Language Models (VLMs) and Multimodal Large Language Models (MLLMs). My long-term goal is to build Efficient and Reliable Multimodal Agents with Adaptive Learning for Lifelong Learning.

Email GitHub Resume

What I do

Current research and engineering projects centered on efficient, mobile-friendly vision models, multimodal reasoning, and practical deployment.

My portfolio My Papers My Blog

Events

2021

Started study @ Ben Tre High School for Gifted Students

2024

Started Bachelor's degree in Information Technology High-Quality Program @ HCMUS

2025

First paper published @ ICCVW 2025

2026

Second paper published @ CVPRW 2026

Worked as a collaborator @ GStar Summit 2026

Interests

Outside research, I enjoy learning about systems, building applications based on my research, reading about new ideas, and contributing to tech communities.

Deep LearningComputer VisionMultimodal ModelsEfficient ComputingBuilding Communities

Unserious Stuff

Enter the password (if yk yk) to unlock a more unserious corner of the homepage.