Jeff - check out the distributed-llama project...you should be able to distribut...

geerlingguy · 2025-08-07T21:35:37 1754602537

I've been testing Exo (seems dead), llama.cpp RPC (has a lot of performance limitations) and distributed-llama (faster but has some Vulkan quirks and only works with a few models).

See my AI cluster automation setup here: https://github.com/geerlingguy/beowulf-ai-cluster

I was building that through the course of making this video, because it's insane how much manual labor people put into building home AI clusters :D

yjftsjthsd-h · 2025-08-07T21:23:25 1754601805

https://github.com/b4rtaz/distributed-llama ?

burnte · 2025-08-07T21:02:31 1754600551

He mentioned that in the video.