Quantization Applications
- Post-training Quantization on Diffusion Models
- SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
- LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
- Quantizable Transformers Removing Outliers by Helping Attention Heads Do Nothing
- Q-DM: An Efficient Low-bit Quantized Diffusion Model