Fully Nested Neural Network for Adaptive Compression and Quantization
Neural network compression and quantization are important tasks for fitting state-of-the-art models into the computational, memory and power constraints of mobile devices and embedded hardware.
Mr. Yufei CUI, Mr. Ziquan LIU, Ms. Qiao LI, Dr. Antoni CHAN, Mr. Wuguannan YAO
- Neural network compression
- Neural network quantization
- Deployed Artificial Intelligence