FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization Paper • 2306.00317 • Published Jun 1, 2023
Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization Paper • 2305.14152 • Published May 23, 2023 • 1