by Antoine Lemor · GitHub
🚀 About

High-performance inference API for transformer models trained with LLM Tool. Deploy your classification, generation, and embedding models with automatic resource management, concurrent request handling, and Ollama integration for generative AI.

📊 Text Classification
🤖 Ollama Integration
GPU Acceleration
🔒 API Key Auth
📦 LLM Tool Ecosystem
🤖 Available Models
Loading models...
🏷️ Annotation Tool

Doccano — open-source annotation tool for text classification, sequence labeling, and sequence-to-sequence tasks.

Open Doccano
Requires a separate Doccano account
🧪 Playground
🔐
Test the LLM Tool inference API
Your API key is stored locally and never sent to third parties
API Online
Version 2.1.0 • Endpoints: /health, /models, /infer