Paper Short Showcase

Bridging the Training-Deployment Gap: Gated Encoding and Multi-Scale Refinement for Efficient Quantization-Aware Image Enhancement
CVPRW 2026 Mobile AI Workshop

Bridging the Training-Deployment Gap: Gated Encoding and Multi-Scale Refinement for Efficient Quantization-Aware Image Enhancement

QATIE

We present a quantization-aware image enhancement pipeline that preserves quality after deployment by training the model to handle quantization noise directly. The method combines gated encoding, multi-scale refinement, and fake quantization during training to support mobile-friendly inference with strong visual fidelity.

STER-VLM: Spatio-Temporal With Enhanced Reference Vision-Language Models
ICCVW 2025 AI City Challenge

STER-VLM: Spatio-Temporal With Enhanced Reference Vision-Language Models

STER-VLM

We fine-tune a vision-language model for traffic video captioning and question answering by separating spatial and temporal reasoning, then filtering frames to better capture the most relevant context in each scenario.

Template structure inspired by Nerfies.

@ 2026 Tô Thành Đạt. All rights reserved