Portfolio

Geo-Image-Textualization

Reinforcement learning-based framework for generating semantically aligned geometry image-caption pairs

MLLM Interpretability Research

Enhancing the interpretability of LLaVA by uncovering underlying attention mechanisms and developing adaptive pruning techniques