Neural Network (NN) Pruning is a task of reducing the size of a Neural Network by removing some of its parameters/weights.
Pruning is often performed with the objective of reducing the memory, computational, and energy bandwidths required for training and deploying NN models which are notorious for their large model size, computational expense, and energy consumption.
Continue reading Neural Network Pruning Research Review 2020