by Antoine Lemor · GitHub
🚀 About

High-performance inference API for transformer models trained with LLM Tool. Deploy your classification, generation, and embedding models with automatic resource management, concurrent request handling, and Ollama integration for generative AI.

📊 Text Classification
🤖 Ollama Integration
GPU Acceleration
🔒 API Key Auth
📦 LLM Tool Ecosystem
🤖 Available Models ▸ Show all deployed models
🔒
Deployed models are private
Enter your API key in the Playground below to browse models, resources, and capabilities.
██████   ██████   ██████  ██████  █████  ███    ██  ██████
██   ██ ██    ██ ██      ██      ██   ██ ████   ██ ██    ██
██   ██ ██    ██ ██      ██      ███████ ██ ██  ██ ██    ██
██   ██ ██    ██ ██      ██      ██   ██ ██  ██ ██ ██    ██
██████   ██████   ██████  ██████ ██   ██ ██   ████  ██████
— reinvented by —
██╗     ██╗     ███╗   ███╗    ████████╗ ██████╗  ██████╗ ██╗     
██║     ██║     ████╗ ████║    ╚══██╔══╝██╔═══██╗██╔═══██╗██║     
██║     ██║     ██╔████╔██║       ██║   ██║   ██║██║   ██║██║     
██║     ██║     ██║╚██╔╝██║       ██║   ██║   ██║██║   ██║██║     
███████╗███████╗██║ ╚═╝ ██║       ██║   ╚██████╔╝╚██████╔╝███████╗
╚══════╝╚══════╝╚═╝     ╚═╝       ╚═╝    ╚═════╝  ╚═════╝ ╚══════╝
Open Doccano by LLM Tool
🧪 Playground
🔐
Test the LLM Tool inference API
Contact Antoine Lemor to obtain an API key
API Online
Version 2.1.0 • Endpoints: /health, /models, /infer
For an inference token or Doccano access — contact Antoine Lemor