Hi everyone, I’m currently working on 3D scene reconstruction and understanding. My prior research covers open-vocabulary segmentation and multi-label learning, and now I’m applying my 2D semantic understanding expertise to 3D scene research.

🔥 News

2026.05: 🎉🎉 PIAA has been accepted by the International Conference on Machine Learning (ICML).

📝 Publications

ICML 2026

[CLS] is Not Enough: Multi-Label Recognition via Patch-Level Inference and Adaptive Aggregation

Akang Wang, Xili Deng, Zhanxuan Hu, YiZhao, Yonghang Tai, Huafeng Li

Project

We tackle multi-label recognition by analytically deriving a patch-based visual classifier to generate reliable, fine-grained object predictions without any model training. We then adaptively fuse these local patch scores with the global image context to robustly identify multiple co-existing targets.

📖 Educations

2025.09 - now, Master of Computer Science and Technology Yunnan Normal University, Kunming, China.