Blog(3)
Blog(3)
Research Areas(0)
Publications(39)
LittleBit: Ultra-Low Bit Quantization via Latent Factorization
MoDeGPT: Modular Decomposition for Large Language Model Compression
Exploring compressibility of transformer based text-to-music (TTM) models
News(6)
Others(1)