Publication MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention Huiqiang Jiang, Yucheng Li, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Zhenhua Han, Amir H. Abdi, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu NeurIPS 2024 | December 2024 spotlight Github Project
Publication ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models Siwei Wang, Yifei Shen, Shi Feng, Haoran Sun, Shang-Hua Teng, Wei Chen NeurIPS 2024 | December 2024
Publication Query-Efficient Correlation Clustering with Noisy Oracle Yuko Kuroki, Atsushi Miyauchi, Francesco Bonchi, Wei Chen NeurIPS 2024 | December 2024
Publication Can Graph Learning Improve Task Planning? Xixi Wu, Yifei Shen, Caihua Shan, Kaitao Song, Siwei Wang, Bohang Zhang, Jiarui Feng, Hong Cheng, Wei Chen, Yun Xiong, Dongsheng Li NeurIPS 2024 | December 2024
Publication Combinatorial Causal Bandits without Graph Skeleton Shi Feng, Nuoya Xiong, Wei Chen Proceedings of the 16th Asian Conference on Machine Learning (ACML) | December 2024
Publication LORASC: Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning Yifan Yang, Yifei Shen, Yuqing Yang, Lili Qiu, Fangyun Wei 2024 Empirical Methods in Natural Language Processing | November 2024
Publication VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models Yifei Liu, Jicheng Wen, Yang Wang, Shengyu Ye, Li Lyna Zhang, Ting Cao, Cheng Li, Mao Yang November 2024 Github
Publication Uncovering Nested Data Parallelism and Data Reuse in DNN Computation with FractalTensor Siran Liu, Chengxiang Qi, Ying Cao, Chao Yang, Weifang Hu, Xuanhua Shi, Fan Yang, Mao Yang SOSP | November 2024
Publication Self-maintaining [networked] systems: The rise of datacenter robotics! Freddie Hong, Iason Sarantopoulos, Elliott Hogg, David Richardson, Yizhong Zhang, Hugh Williams, David Sweeney, Andromachi Chatzieleftheriou, Ant Rowstron 2024 Hot Topics in Networks | November 2024 Project
Publication Reviving Cloud Gaming Sessions Yifan Yang, Lili Qiu, Yuqing Yang CoNEXT 2024 – International Conference on emerging Networking EXperiments and Technologies | November 2024