Sitemap

A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.

Posts

Future Blog Post

less than 1 minute read

This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

Blog Post number 4

less than 1 minute read

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 3

less than 1 minute read

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 2

less than 1 minute read

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 1

less than 1 minute read

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

portfolio

Geo-Image-Textualization

Reinforcement learning-based framework for generating semantically aligned geometry image-caption pairs

MLLM Interpretability Research

Enhancing the interpretability of LLaVA by uncovering underlying attention mechanisms and developing adaptive pruning techniques

publications

● Multi-tailed vision transformer for efficient inference

Authors: Yunke Wang, Bo Du, Wenyuan Wang, Chang Xu

Neural Networks

A novel architecture that uses multiple tails to generate visual sequences of different lengths for efficient vision transformer inference.

● Continual Learning of Large Language Models: A Comprehensive Survey

Authors: Haizhou Shi, Zihao Xu, Hengyi Wang, Weiyi Qin, Wenyuan Wang, Yibin Wang, Zifeng Wang, Sayna Ebrahimi, Hao Wang

ACM Computing Surveys

A comprehensive survey on continual learning approaches for large language models, covering methodologies, challenges, and future directions.

● Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models

Authors: Hengyi Wang, Haizhou Shi, Shiwei Tan, Weiyi Qin, Wenyuan Wang, Tunyu Zhang, Akshay Nambi, Tanuja Ganu, Hao Wang

NAACL 2025 Main

A comprehensive benchmark for evaluating the long-context capabilities of multimodal large language models.

● Probabilistic Residual User Clustering

Authors: Wenyuan Wang, Yusong Zhao, Zihao Xu, Hengyi Wang, Shreya Venugopal, Desmond Lobo, Chengzhi Mao, Qi Xu, Zhigang Hua, Yan Xie, Bo Long, Shuang Yang, Hao Wang

IJCAI2025 Workshop / Submitted to TMLR

A causal Bayesian framework that clusters users and models residuals between predicted and true ratings to enhance recommendation accuracy.

● Generalizable Geometric Image Caption Synthesis

Authors: Yue Xin*, 'Wenyuan Wang*, Rui Pan, Ruida Wang, BingXu Meng, Renjie Pi, Shizhe Diao, Tong Zhang

ICLR 2026(Under Review)

A reinforcement learning-based framework for generating semantically aligned geometry image-caption pairs, creating the first dataset with full modality equivalence for geometric reasoning.

teaching

Teaching experience 1

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

This is a description of a teaching experience. You can use markdown like any other post.

Wenyuan Wang(王文渊)

Sitemap

Pages

Posts

portfolio

publications

● Multi-tailed vision transformer for efficient inference

● Continual Learning of Large Language Models: A Comprehensive Survey

● Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models

● Probabilistic Residual User Clustering

● Generalizable Geometric Image Caption Synthesis

talks

teaching