{"id":1053873,"date":"2024-07-04T07:55:27","date_gmt":"2024-07-04T14:55:27","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-project&p=1053873"},"modified":"2024-08-21T23:25:22","modified_gmt":"2024-08-22T06:25:22","slug":"minference-million-tokens-prompt-inference-for-long-context-llms","status":"publish","type":"msr-project","link":"https:\/\/www.microsoft.com\/en-us\/research\/project\/minference-million-tokens-prompt-inference-for-long-context-llms\/","title":{"rendered":"MInference: Million-Tokens Prompt Inference for Long-context LLMs"},"content":{"rendered":"
\n\t
\n\t\t
\n\t\t\t\"MInference\t\t<\/div>\n\t\t\n\t\t
\n\t\t\t\n\t\t\t
\n\t\t\t\t\n\t\t\t\t
\n\t\t\t\t\t\n\t\t\t\t\t
\n\t\t\t\t\t\t
\n\t\t\t\t\t\t\t\n\t\t\t\t\t\t\t\n\n

MInference<\/h1>\n\n\n\n

Million-Tokens<\/em><\/strong> Prompt Inference for Long-context LLMs<\/p>\n\n\t\t\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n<\/section>\n\n\n\n\n\n

<\/div>\n\n\n\n
\n
\n