Geo-Image-Textualization
Reinforcement learning-based framework for generating semantically aligned geometry image-caption pairs
Reinforcement learning-based framework for generating semantically aligned geometry image-caption pairs
Enhancing the interpretability of LLaVA by uncovering underlying attention mechanisms and developing adaptive pruning techniques