INT4 LoRA fine-tuning vs QLoRA: A user inquired about the distinctions among INT4 LoRA wonderful-tuning and QLoRA in terms of accuracy and speed. A different member explained that QLoRA with HQQ involves frozen quantized weights, isn't going to use tinnygemm, and makes use of dequantizing along with torch.matmul and that ChatGPT offers some