Course Projects at NTU
This collection reflects my work in Deep Learning/Computer Vision/Image Processing.
These projects are implemented in Python and C++.
-
2024 Fall
Deep Learning for Computer Vision
- Final Project: Multi-concept Text-to-Image Personalization
- 3D novel view synthesis (3D gaussian splatting)
- Zero-shot image captioning with LLaVA
- Image captioning using PEFT on vision language models
- Conditional diffusion models (DDPM)
- Generating human faces (DDIM)
- Text-to-image personalization (DDPM, stable diffusion)
- Semantic segmentation (FCN, DeepLabV3, SAM)
- Self-supervised learning (ResNet)
-
2024 Fall
Principles and Applications of Digital Image Processing
-
2023 Fall
Introduction to Artificial Intelligence