Bielik-4.5B-v3.0-Instruct-MLX-4bit

MLX format conversion of speakleash/Bielik-4.5B-v3.0-Instruct for Apple Silicon with 4-bit quantization.

Model Description

Bielik-4.5B-v3.0-Instruct is a Polish instruction-tuned language model based on the Qwen architecture, trained by SpeakLeash. This is the 4-bit quantized MLX conversion optimized for Apple Silicon Macs with reduced memory footprint.

Usage

from mlx_lm import load, generate

model, tokenizer = load("czlonkowski/Bielik-4.5B-v3.0-Instruct-MLX-4bit")
response = generate(model, tokenizer, prompt="Wyjaśnij czym jest uczenie maszynowe.", max_tokens=200)
print(response)

Or via CLI:

mlx_lm.generate --model czlonkowski/Bielik-4.5B-v3.0-Instruct-MLX-4bit --prompt "Wyjaśnij czym jest uczenie maszynowe."

Model Details

Related Models

License

Please refer to the original model's license at speakleash/Bielik-4.5B-v3.0-Instruct.

Downloads last month
15
Safetensors
Model size
0.7B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for czlonkowski/Bielik-4.5B-v3.0-Instruct-MLX-4bit

Quantized
(14)
this model