Sitemap

A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.

Pages

Posts

Future Blog Post

less than 1 minute read

Published:

This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

Blog Post number 4

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 3

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 2

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 1

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

portfolio

Geo-Image-Textualization

Reinforcement learning-based framework for generating semantically aligned geometry image-caption pairs

MLLM Interpretability Research

Enhancing the interpretability of LLaVA by uncovering underlying attention mechanisms and developing adaptive pruning techniques

publications

Continual Learning of Large Language Models: A Comprehensive Survey

Published in ACM Computing Surveys, 2024

A comprehensive survey on continual learning approaches for large language models, covering methodologies, challenges, and future directions.

Recommended citation: Haizhou Shi, Zihao Xu, Hengyi Wang, Weiyi Qin, Wenyuan Wang, Yibin Wang, Zifeng Wang, Sayna Ebrahimi, Hao Wang. "Continual Learning of Large Language Models: A Comprehensive Survey." ACM Computing Surveys.
Download Paper

Multi-tailed vision transformer for efficient inference

Published in Neural Networks, 2024

A novel architecture that uses multiple tails to generate visual sequences of different lengths for efficient vision transformer inference.

Recommended citation: Yunke Wang, Bo Du, Wenyuan Wang, Chang Xu. "Multi-tailed vision transformer for efficient inference." Neural Networks, 2024, 174: 106235.
Download Paper

Probabilistic Residual User Clustering

Published in IJCAI2025 Workshop / Submitted to TMLR, 2024

A causal Bayesian framework that clusters users and models residuals between predicted and true ratings to enhance recommendation accuracy.

Recommended citation: Wenyuan Wang, Yusong Zhao, Zihao Xu, Hengyi Wang, Shreya Venugopal, Desmond Lobo, Chengzhi Mao, Qi Xu, Zhigang Hua, Yan Xie, Bo Long, Shuang Yang, Hao Wang. "Probabilistic Residual User Clustering." IJCAI2025 Workshop / Submitted to TMLR.
Download Paper | Download Slides

Generalizable Geometric Image Caption Synthesis

Published in NeurIPS Datasets and Benchmarks Track (Under Review), 2025

A reinforcement learning-based framework for generating semantically aligned geometry image-caption pairs, creating the first dataset with full modality equivalence for geometric reasoning.

Recommended citation: Wenyuan Wang*, Yue Xin*, Rui Pan*, BingXu Meng*, Renjie Pi, Tong Zhang. "Generalizable Geometric Image Caption Synthesis." Submitted to NeurIPS Datasets and Benchmarks Track.
Download Paper | Download Slides | Download Bibtex

Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models

Published in NAACL 2025 Main, 2025

A comprehensive benchmark for evaluating the long-context capabilities of multimodal large language models.

Recommended citation: Hengyi Wang, Haizhou Shi, Shiwei Tan, Weiyi Qin, Wenyuan Wang, Tunyu Zhang, Akshay Nambi, Tanuja Ganu, Hao Wang. "Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models." NAACL 2025 Main.
Download Paper | Download Slides

talks

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.