Blog(2)
Blog(2)
Research Areas(0)
Publications(28)
Two-Stage Grid Optimization for Group-wise Quantization of LLMs
On the Importance of a Multi-Scale Calibration for Quantization
TurboBoA: Faster and Exact Attention-aware Quantization without Backpropagation
News(5)
Others(0)