Search Results
2 results for “inference optimisation”
Infrastructure
OpenVINO
OpenVINO is an open-source toolkit developed by Intel for optimising and deploying deep learning inference across Intel hardware, including CPUs, GPUs, Neural Processing Units, and FPGAs, with broad support for major AI frameworks and model formats.
6 min readUpdated June 2026
Infrastructure
Quantisation
Quantisation is a model compression technique that reduces the numerical precision of a neural network's weights and activations from high-bit floating-point formats to lower-bit representations, decreasing memory usage and accelerating inference with minimal accuracy loss.
7 min readUpdated May 2026