Blog(1)
Blog(1)
Research Areas(0)
Publications(28)
Two-Stage Grid Optimization for Group-wise Quantization of LLMs
On the Importance of a Multi-Scale Calibration for Quantization
TurboBoA: Faster and Exact Attention-aware Quantization without Backpropagation
News(4)
Others(0)