Menu
Back to Open Source

🐙 GitHub Detail

Q

mit-han-lab/Quest

By mit-han-lab

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

GitHub Cuda MIT License Updated 04 Jun 2026

Live Snapshot

Stars

391

🍴

Forks

48

📄

License

MIT License

🧩

Type

Cuda

📘

About this open-source project

Live information fetched from GitHub.

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

🌿

Default Branch

main

🐞

Open Issues

6

👀

Watchers

391