Publication MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention Huiqiang Jiang, Yucheng Li, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Zhenhua Han, Amir H. Abdi, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu NeurIPS 2024 | December 2024 spotlight Github Project
Publication ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models Siwei Wang, Yifei Shen, Shi Feng, Haoran Sun, Shang-Hua Teng, Wei Chen NeurIPS 2024 | December 2024
Publication Query-Efficient Correlation Clustering with Noisy Oracle Yuko Kuroki, Atsushi Miyauchi, Francesco Bonchi, Wei Chen NeurIPS 2024 | December 2024
Publication Convergence to Equilibrium of No-regret Dynamics in Congestion Games Volkan Cevher, Wei Chen, Leello Dadi, Jing Dong, Ioannis Panageas, Stratis Skoulakis, Luca Viano, Baoxiang Wang, Siwei Wang, Jingyu Wu Conference on Web and Internet Economics (WINE) | December 2024
Publication Can Graph Learning Improve Task Planning? Xixi Wu, Yifei Shen, Caihua Shan, Kaitao Song, Siwei Wang, Bohang Zhang, Jiarui Feng, Hong Cheng, Wei Chen, Yun Xiong, Dongsheng Li NeurIPS 2024 | December 2024
Publication Combinatorial Causal Bandits without Graph Skeleton Shi Feng, Nuoya Xiong, Wei Chen Proceedings of the 16th Asian Conference on Machine Learning (ACML) | December 2024
Publication LORASC: Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning Yifan Yang, Yifei Shen, Yuqing Yang, Lili Qiu, Fangyun Wei 2024 Empirical Methods in Natural Language Processing | November 2024
Publication LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation Weiquan Huang, Aoqi Wu, Yifan Yang, Xufang Luo, Yuqing Yang, Liang Hu, Qi Dai, Xiyang Dai, Dongdong Chen, Chong Luo, Lili Qiu November 2024 Project
Publication VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models Yifei Liu, Jicheng Wen, Yang Wang, Shengyu Ye, Li Lyna Zhang, Ting Cao, Cheng Li, Mao Yang November 2024 Github
Publication Uncovering Nested Data Parallelism and Data Reuse in DNN Computation with FractalTensor Siran Liu, Chengxiang Qi, Ying Cao, Chao Yang, Weifang Hu, Xuanhua Shi, Fan Yang, Mao Yang SOSP | November 2024