To Thanh Dat

Undergraduate Student @ University of Science, VNU-HCM

My research focuses on Computer Vision, alongside Efficient and Faithful Vision-Language Models (VLMs) and Multimodal Large Language Models (MLLMs). My long-term goal is to build Efficient and Reliable Multimodal Agents with Adaptive Learning for Lifelong Learning.

To Thanh Dat

What I do

Current research and engineering projects centered on efficient, mobile-friendly vision models, multimodal reasoning, and practical deployment.

Events

2021
Started study @ Ben Tre High School for Gifted Students
2024
Started Bachelor's degree in Information Technology High-Quality Program @ HCMUS
2025
First paper published @ ICCVW 2025
2026
Second paper published @ CVPRW 2026
Worked as a collaborator @ GStar Summit 2026

Interests

Outside research, I enjoy learning about systems, building applications based on my research, reading about new ideas, and contributing to tech communities.

Deep LearningComputer VisionMultimodal ModelsEfficient ComputingBuilding Communities

Unserious Stuff

Enter the password (if yk yk) to unlock a more unserious corner of the homepage.

The portfolio version of this site lives at /portfolio.

@ 2026 Tô Thành Đạt. All rights reserved