Search Results
2 results for “pruning”
Infrastructure
Model Compression
Model compression is a set of techniques that reduce the size, memory footprint, and computational cost of machine learning models while preserving predictive accuracy, enabling deployment on resource-constrained hardware.
6 min readUpdated June 2026
Infrastructure
Model Pruning
A model compression technique that removes redundant or low-importance parameters from a neural network to reduce size, memory footprint, and inference latency while preserving accuracy.
6 min readUpdated June 2026