Computer Vision · Perception · Generative AI · Remote Sensing
Applied research in computer vision, perception, and generative AI.
A portfolio of research-driven systems and implementation studies across computational imaging, geometric vision, diffusion models, remote sensing, and camera-based perception. These projects emphasize building models and pipelines from first principles, analyzing failure modes, and adapting modern vision methods to real visual data.
Remote Sensing
UAV-Based Tree Crown Detection for Urban Tree Stress Analysis
Developed a multimodal vision pipeline for detecting and segmenting individual tree crowns from UAV imagery, combining RGB, vegetation, and height-derived signals with GroundingDINO, Segment Anything, and bounding-box refinement.
Computational ImagingLight Field Refocusing and Synthetic Aperture Rendering
Implemented light-field rendering techniques for computational refocusing and synthetic aperture control using structured camera arrays and handheld capture experiments.
Augmented Reality
Marker-Based Augmented Reality with Camera Calibration
Built a camera-calibrated AR pipeline that tracks 2D keypoints, estimates projection matrices, and renders 3D geometry into real video frames.
Diffusion Models
Denoising U-Net Training for Digit Diffusion
Trained a compact U-Net diffusion model for noisy image restoration, timestep-conditioned denoising, and label-conditioned digit generation.
Generative AI
Diffusion-Based Image Generation, Editing, and Restoration
Explored diffusion-based generation and editing with iterative denoising, classifier-free guidance, image-to-image translation, inpainting, and prompt-conditioned visual transformations.
Robust Estimation
Panorama Reconstruction with Homographies, Feature Matching, and RANSAC
Built a panorama reconstruction pipeline combining homography estimation, feature detection, adaptive non-maximal suppression, descriptor matching, RANSAC, and image warping.
Geometric VisionGeometric Face Morphing and Population-Average Modeling
Developed a geometry-based morphing pipeline using facial correspondences, Delaunay triangulation, affine warping, population averages, and controlled extrapolation.
Frequency Analysis
Frequency-Domain Image Processing and Multi-Resolution Blending
Implemented classical frequency-domain and multi-resolution techniques for edge detection, sharpening, hybrid images, Fourier analysis, and seamless image blending.
Computational Imaging
Computational Restoration of Historical Glass-Plate Photography
Built a multiscale image-alignment pipeline to reconstruct color photographs from historical glass-plate negatives using pyramid search, edge-aware scoring, and channel registration.