LLM inference in C/C++
Updated 2024-09-15 09:12:26 +02:00
A simple one-file way to run various GGML and GGUF models with a KoboldAI UI
Updated 2024-09-14 05:34:16 +02:00