Avatar

Manh Tuan Do

AI & Computer Vision Enthusiast | M.Sc. Student @Paris-Saclay

About Me

My name is Manh Tuan Do. In 2023, I graduated with a Bachelor of Mechatronics (Valedictorian) at Vietnam National University, Hanoi (VNU), then I was retained as a teaching assistant at VNU under the guidance of Dr. Ha Manh Hung and conducted research work with Prof. Oscal T.-C. Chen. In 2024, I received a fully-funded scholarship for a Master's M1 in Electrical Engineering (Computer Vision & AI track) at Paris-Saclay University. Now I am studying M2 in Mechatronics, Machine Vision, and Artificial Intelligence also at Paris-Saclay University, and I am fortunate and honored to be a research intern at Westlake under the guidance of Prof. Huan Wang.

In terms of my research interests, initially, when I was a bachelor student combined with the foundation in robotics and mechatronics, I decided to focus on AI applications including the implementation of AI lightweight models into hardware, embedded systems, microcontrollers to make the system more intelligent in course projects and student research. Up to the thesis, I began modifying the architecture of models by finding ways to optimize the model for lightweight and accuracy improvements in the object detection problem, specifically in the YOLO model.

Recently, I have been exploring 3D object detection and 3D reconstruction using geometric methods like SFM and using DNNs like NeRF. Besides, I have studied GAN-based data augmentation and unpaired neural style transfer between images of two classes (cow, horse).

May not yet I have a pure foundation in math or computer science, but I can affirm that I am on the journey to becoming a genuine AI research expert in the near future. Waiting for me !!

  • 2025–2026: M.Sc. M2 Mechatronics, Machine Vision and Artificial Intelligence, Paris-Saclay University
  • 2024–2025: M.Sc. M1 Electrical Engineering (Computer Vision Track), Paris-Saclay University
  • 2019–2023: B.Sc. Mechatronics, VNU-HN (Valedictorian)
  • 2022-2024: Teaching Assistant, Vietnam National University
  • 2023-2025: Research Assistant, National Chung Cheng University
  • 2025: Research Intern, Westlake University

Research Interests

Key Projects

Plant Disease Detection

Transformer-based Plant Disease Detection

Novel lightweight YOLO model with SCAT and Ghost convolution for real-time strawberry disease detection. Achieved mAP@0.5 of 80.3% with 8% improvement over baseline.

UAV Detection

UAV-based Human Detection

Enhanced image processing algorithms for unmanned aerial vehicle operations. Built dataset with 6,600+ labeled bounding boxes across various environments.

Medical AI

Minimally Invasive Surgical Tool Detection

Deep learning framework for real-time surgical tool detection using YOLOv8s. Achieved 98.6% mAP@0.5 on Surgical Tools and 95.7% on m2cai16-tool-locations dataset.

Conveyor Belt

AI-Based Product Classification

Developed AI-based sorting system using YOLOv5 on Raspberry Pi with infrared sensors. Achieved 90%+ accuracy in classifying plastic bottles and water containers.

PPE Detection

Personal Protective Equipment Detection

Improved YOLOv5 for real construction site PPE detection with targeted augmentations and attention mechanisms. Achieved 95% mAP for on-site safety monitoring.

Airplane Detection

YOLO-Ghost Airplane Detection

Enhanced YOLO-Ghost with C2f and CIoU loss for improved airplane detection accuracy on satellite imagery. Optimized for lightweight deployment.

Publications

First-author: proposed a cost-sensitive loss for transformer detectors in real-time plant disease detection.

DOI | Full Text

Developed a non-local GCN architecture for SQL-injection detection with high accuracy and low complexity.

PDF

Introduced a multi-head residual attention GCN to improve hand-point classification in visual communication tasks.

PDF

Enhanced YOLO-Ghost with C2f and CIoU loss to boost airplane detection accuracy on satellite imagery.

Proceedings

Proposed RHM, a non-local GCN variant, achieving state-of-the-art performance on SQL injection datasets.

PDF

Applied transformer-based local-global feature fusion for plant pathology classification with high F1-score.

PDF

Demonstrated a CNN-based detector for lychee ripeness stages, achieving 92% accuracy across varied lighting.

PDF

Integrated transformer modules into YOLO for robust UAV-based human detection in diverse environments.

DOI

Enhanced YOLOv5 with targeted augmentations and attention, achieving 95% mAP for PPE detection on site.

PDF

💡 Challenge Myself