Pluma¶
Local LLM chat in a single binary. Tavern card support, character editing, image generation, text-to-speech — no installer, no runtime to manage, sensible profiles so things work on the first try.
Install¶
Detects OS + arch, fetches the latest release tarball, verifies sha256, extracts to the current directory. Read the script before piping into bash — it's short and does exactly what it says.
Tarballs + signatures live on every tagged release. Grab the file for your platform:
Either way: serves on http://localhost:8787, opens your browser, and the three-screen first-run wizard takes you the rest of the distance.
Get started → Releases → Repo on GitHub →
What's in the box¶
| Area | What ships |
|---|---|
| Chat | OpenAI streaming + Anthropic native plugin · SillyTavern preset import · mobile-first UI with desktop sidebar · resizable rail · per-conversation persona |
| Characters | Tavern Card v1/v2/v3 (PNG + JSON) · AI character creator · auto-portrait via SD · chub.ai browser · personas anchored in the system prompt |
| Models | HuggingFace browser for GGUF + MLX · cancellable multi-file downloads · format auto-detect |
| Images | A1111 + ComfyUI workflow JSONs · in-chat wand button · workflow library |
| Voice | Any OpenAI-compatible /v1/audio/speech endpoint · voice library for codec engines (OmniVoice, XTTS) with reference-audio cloning |
| Security | AES-256-GCM at rest · OS keyring for keys · WebAuthn passkeys · embedded Tailscale (tsnet) · host allowlist · trusted-proxy aware |
| Quality of life | Hot-reload dev loop · auto-titled conversations · sampler family auto-pick · system-message hoisting · reduced-motion respected |
Where to next¶
- New here: Install → First run → Your first chat.
- Coming from SillyTavern: drop Tavern cards into
<datadir>/characters/. They show up on the next list. See Characters & cards. - Want it on your phone: Multi-device access (Tailscale).
- Tuning generation: Samplers & presets.
- Configuring TTS: Voice & TTS. Cloning voices: Voice cloning.
- Looking up a setting: Reference.
- Building from source: Development.
- Something broke: Troubleshooting.