Quantization requires a large amount of CPU memory. However, the memory required can be reduced by using swap memory. Depending on the GPUs/drivers, there may be a difference in performance, which ...