Quantization refers to techniques for performing computations and storing tensors at lower bitwidths than floating point precision. PyTorch supports both per tensor and per channel asymmetric linear quantization. To learn more how to use quantized functions in PyTorch, please refer to the Quantization documentation.

PyTorch 量化函数

量化是指以低于浮点精度的位宽执行计算和存储张量的技术。PyTorch支持每个张量和每个通道的非对称线性量化。要了解如何在PyTorch中使用量化函数,请参阅量化文档。