Tag local-llms

3 bookmarks have this tag.

2025-10-16

7361.

NVIDIA DGX Spark + Apple Mac Studio = 4x Faster LLM Inference with EXO 1.0

simonwillison.net/2025/Oct/16/nvidia-dgx-spark-apple-mac-studio#atom-everything

2025-08-20

7343.

llama.cpp guide: running gpt-oss with llama.cpp

simonwillison.net/2025/Aug/19/gpt-oss-with-llama-cpp#atom-everything

2025-08-15

7340.

Introducing Gemma 3 270M: The compact model for hyper-efficient AI

simonwillison.net/2025/Aug/14/gemma-3-270m#atom-everything