Run LLMs on macOS using llm-mlx and Apple's…

Feb 17

Plus LLM 0.22, the annotated release notes

5 Comments

Fantastic work, Simon! Apple MLX really is amazing, isn't it? Any thoughts on Llamafile and how your LLM library compares to that?

Expand full comment

KimSia Sim

Feb 18

Thank you for this, Simon

I’m a Python/Django dev on Mac M1 Max (32 gb ram) and have been using Claude sonnet extensively at work via cursor

Any thoughts on how close this can get to replacing sonnet ?

I noted you wrote that llama 70b can run locally on a 64gb ram and get close to gpt 4o level

I’m thinking of perhaps some kind of divide n conquer

1. Consider upgrading to new Mac with more ram

2. Consider using your suggested setup but for easier queries

Your thoughts about these two paths

Expand full comment

Lawrence Sinclair

Mar 24

Is this an alternative to Ollama, and if so, why would one use it instead?

Expand full comment

James Wang

Feb 20

Great guide! I’ve personally been really enjoying running local LLMs on my Mac Studio (still M1 Max, but lots of RAM) and this just gives me more impetus to upgrade when the new ones come out with hopefully 256gb of integrated RAM…

Expand full comment

Alex Rosenfeld

Feb 18

Thanks Simon! Appreciate your work.

Expand full comment

Simon Willison’s Newsletter

Run LLMs on macOS using llm-mlx and Apple's…