DEV Community

Cover image for Prima.cpp: Fast 30-70B LLM Inference on Heterogeneous and Low-Resource HomeClusters
Paperium
Paperium

Posted on • Originally published at paperium.net

Prima.cpp: Fast 30-70B LLM Inference on Heterogeneous and Low-Resource HomeClusters

{{ $json.postContent }}

Top comments (0)