ML System
- A systematic methodology for analysis of deep learning hardware and software platforms
- Precious: Resource-Demand Estimation for Embedded Neural Network Accelerators
- A Learned Performance Model for Tensor Processing Units
- Learned TPU Cost Model for XLA Tensor Programs
- Efficient Mixed-Precision Large Language Model Inference with TurboMind