I am Haoxiang Sun, an undergraduate student in Computer Science and Engineering at Sichuan University (2023.09 - expected 2027.06), currently with GPA 3.79/4.0.
Since 2024.12, I have been a research intern at DICALab (Data Intelligence and Computing Art Laboratory), supervised by Dr. Tao Wang. My current work focuses on:
- Multimodal Large Language Models (MLLMs / VLLMs)
- Transferring VLLMs to real-world applications
- Inference acceleration and efficient decoding
Updated on May 2026.
🔥 News
- 2026.04: I will join Peking University (Shenzhen Graduate School) as a master’s student in Fall 2027.
- 2026.03: From Structure to Synergy: A Survey of Vision-Language Perception Paradigm Evolution in Multimodal Large Language Models accepted by Information Fusion.
- 2026.02: Dr. Seg: Revisiting GRPO Training for Visual Large Language Models through Perception-Oriented Design accepted by CVPR 2026.
- 2024.12: Joined DICALab as a research intern.
🧩 Open Source Contributions
- 2026.01 - now, Contributor @ vLLM-speculators
💻 Internships
- 2024.12 - now, Research Intern, DICALab (Data Intelligence and Computing Art Laboratory), Chengdu, China.
Supervised by Dr. Tao Wang; focusing on multimodal LLMs for visual perception.
🛠 Technical Skills
- Languages: Mandarin (native), English (fluent)
- Programming Languages: C/C++, Python
- Frameworks: PyTorch, vLLM, VeRL