←
Back to Open Source
🐙 GitHub Detail
Q
mit-han-lab/Quest
By mit-han-lab
[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
GitHub
Cuda
MIT License
Updated 04 Jun 2026
Live Snapshot
⭐
Stars
391
🍴
Forks
48
📄
License
MIT License
🧩
Type
Cuda
📘
About this open-source project
Live information fetched from GitHub.
[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
🌿
Default Branch
main
🐞
Open Issues
6
👀
Watchers
391