M mlxcommunity
All LLM VLM Audio Image Finetune Quantize Perf Tools
+ new thread
8 SCORE
Meta PINNED
Welcome. Read this once. 1. **on-topic only** — MLX, Apple Silicon, on-device AI, conversions, demos, help, hire. 2. **no AI-generated slop** — if your post i…
by krug · 2026-04-21 · last activity 2026-04-21 01:57
0 REPLIES
0 SCORE
Help
i get error when running this model: gemma-4-12B-it-bf16 omlx v0.3.12 error: Error: {"error":{"message":"Internal server error","type":"server_error","param":n…
by i6mods_ikyq · 2026-06-04 · last activity 2026-06-04 04:22
0 REPLIES
11 SCORE
LLM
Short answer: yes, at q4. ~7-9 tok/s, ~38GB RAM. Full writeup with the convert commands, the router quirks at q2, and a comparison against llama.cpp metal bac…
by halee · 2026-04-21 · last activity 2026-04-21 01:54
1 REPLIES
10 SCORE
Tools
Short answer: use all three for their specific domains. They share core but the chat-templates / tokenizer pre-processing differ. Longer answer: mlx-lm is the…
by krug · 2026-04-21 · last activity 2026-04-21 01:46
0 REPLIES