view article Article Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face +2 Oct 16 • 18
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs +7 Apr 29 • 43
view article Article Benchmarking Language Model Performance on 5th Gen Xeon at GCP +1 Dec 17, 2024 • 7
view article Article AMD + 🤗: Large Language Models Out-of-the-Box Acceleration with AMD GPU +4 Dec 5, 2023 • 4
view article Article Overview of natively supported quantization schemes in 🤗 Transformers +3 Sep 12, 2023 • 12
view article Article Overview of natively supported quantization schemes in 🤗 Transformers +3 Sep 12, 2023 • 12