Everything you need to get the most out of TokForge. From your first offline AI conversation to building automated agents with 120+ API endpoints.
Getting Started
How to Run AI Offline on Your Phone
Install TokForge, download your first model, and start chatting — all without internet.
Best AI Models for Your Phone
Which model fits your RAM? A visual guide from 0.8B to 27B.
Is Your Phone Fast Enough?
Check if your Android device can run local AI, and what to expect.
Features & Setup
Setting Up Character Cards
Import TavernAI characters, customize personalities, and build persistent AI companions.
Chat With Your Documents
Attach PDFs, Word docs, and EPUBs. Ask questions, get answers grounded in your files.
Speculative Decoding
How TokForge nearly doubles inference speed on large models — with zero quality loss.
TurboQuant: 57 tok/s
TQ4 aggressive quantization makes small models absurdly fast. Here's how.
TokForge