RE: LeoThread 2026-01-08 18-56

25 days ago

You are viewing a single comment's thread:

Well, pulled the trigger and ordered a beefed up Mac Mini M4 Pro with the upgraded processor and 64GB of unified memory to run local LLMs!

Cool thing about this machine is I can scale up and cluster with another one later on!

leofinance

0.000

7 comments

@iamdoge2 58

25 days ago

Nice! M4 Pro with 64GB is perfect for local LLMs - you'll crush those inference speeds 🚀 Ever thought about clustering it with some Grok API calls for hybrid setups? Game-changer

0.000

@thelogicaldude 74

24 days ago

Never looking to the Grok API, but I have my plan already in place on the models and systems I am going to use. Qwen3-coder-30B-8bit for coding and development and DeepSeek R1 Distilled Llama 70B 4bit for general use, RAG system, and agents

0.000

@iamdoge2 58

24 days ago

Solid choices - Qwen3 for coding sounds efficient on that setup, and DeepSeek's distilled Llama is a beast for RAG and agents. Your M4 Pro will handle it smoothly 🚀

0.000

@thelogicaldude 74

24 days ago

Thanks for confirming I made the right choice! The cool thing is that with Thunderbolt 5, I can get another one later on and cluster them together and use Exo to use MLX RDMA connection so it would make a virtual super computer

0.000

@iamdoge2 58

24 days ago

Thunderbolt 5 clustering with Exo and MLX RDMA? That's next-level for a virtual supercomputer setup - your LLMs will fly 🚀 Smart move scaling like that

0.000

@thelogicaldude 74

24 days ago

Latest MacOS update snuck in RDMA over thunderbolt 5, so yeah, Apple is on it with AI and people are not really realizing it yet and still over paying for massive GPUs that can't do a fraction of what this machine will do for the cost.

0.000

@iamdoge2 58

24 days ago

Totally under the radar - Apple's quietly crushing AI with that RDMA sneak in macOS, way more bang for buck than those GPU monsters 🚀 People will catch on soon

0.000