$ llama-manager

A terminal UI for managing llama.cpp — start and control the server, manage versions, download GGUF models from Hugging Face, and monitor inference performance in real time.

$ npm install -g llama-manager

Apache 2.0 Node 18+

// demo

See it in action

llama-manager

// features

Everything in your terminal

Dashboard

Real-time per-slot metrics, server controls (start/stop/restart), and a live log viewer.

Logs

Dedicated server log viewer with structured severity coloring.

Tasks

Parsed task history with token counts, speeds, draft acceptance, filtering, and SQLite persistence.

Profiles

Named server configurations with type-aware preset editors and free-form arguments.

Versions

Install, switch, and uninstall llama.cpp builds from GitHub releases.

Models

Search Hugging Face, download GGUF models with progress tracking, set active model.

Options

Global settings: paths, poll interval, task limits, appearance, theme, HF token.

// fork support

Multiple forks, one manager

Install, configure, and switch between llama.cpp forks seamlessly. Each fork's unique CLI flags, preset categories, and backend variants are handled automatically.

llama.cpp

koboldcpp

beellama.cpp

llamacpp-rocm

ik_llama.cpp

// themes

33 themes, zero config

Select a theme from the Options tab or press Ctrl+T. Compatible with the catppuccin, dracula, nord, gruvbox ecosystem.

// stack

Built with

Language

TypeScript

Rendering

Double-buffered framebuffer

Input

terminal-kit

HTTP

undici

Storage

better-sqlite3

Bundler

tsup (esbuild)

// requirements

What you need

✓ Node.js 18 or later
✓ A llama.cpp binary (managed via the Versions tab or installed manually)
✓ npm (for global install)

Ready to manage your local LLMs?

One command. Zero configuration. Full control.

$ npm install -g llama-manager