Quantization Applications
- Post-training Quantization on Diffusion Models
- GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
- SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
- LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
- Quantizable Transformers Removing Outliers by Helping Attention Heads Do Nothing
- Q-DM: An Efficient Low-bit Quantized Diffusion Model