Fully Nested Neural Network for Adaptive Compression and Quantization
Neural network compression and quantization are important tasks for fitting state-of-the-art models into the computational, memory and power constraints of mobile devices and embedded hardware.
Mr. Yufei CUI, Mr. Ziquan LIU, Ms. Qiao LI, Prof. CHAN Antoni Bert, Mr. Wuguannan YAO
- Neural network compression
- Neural network quantization
- Deployed Artificial Intelligence