DocsRouterDocsRouter
Api

Models

Available Vision LLM models and pricing

Models

DocsRouter provides access to 100+ vision-capable models through a unified API. Models are dynamically fetched from OpenRouter, ensuring you always have access to the latest vision models.

View all available models on our Providers page with live filtering and search.

List Models

GET https://api.docsrouter.com/v1/models

Returns all available vision models (models with image input capability) with OpenRouter-compatible schema plus DocsRouter extensions.

Authentication

This endpoint supports both authenticated and unauthenticated requests:

  • Without authentication: Returns public model information
  • With authentication: Returns additional pricing details specific to your account tier
# Unauthenticated (public info only)
curl https://api.docsrouter.com/v1/models

# Authenticated (full details)
curl https://api.docsrouter.com/v1/models \
  -H "Authorization: Bearer YOUR_API_KEY"

Query Parameters

ParameterTypeDefaultDescription
providerstringallFilter by provider (e.g., google, openai, anthropic)
capabilitystringallFilter by capability: vision, pdf, handwriting
recommendedbooleanfalseOnly return recommended models
limitnumber100Maximum number of models to return

Response

{
  "object": "list",
  "data": [
    {
      "id": "google/gemini-2.0-flash-001",
      "object": "model",
      "created": 1702425600,
      "owned_by": "google",
      "name": "Gemini 2.0 Flash",
      "description": "Fast, cheap, and reliable. Best default choice for most OCR tasks.",
      "context_length": 1048576,
      "architecture": {
        "input_modalities": ["text", "image"],
        "output_modalities": ["text"],
        "tokenizer": "gemini",
        "instruct_type": "gemini"
      },
      "pricing": {
        "prompt": "0.00000010",
        "completion": "0.00000040",
        "image": "0.0001315",
        "request": "0"
      },
      "top_provider": {
        "context_length": 1048576,
        "max_completion_tokens": 8192,
        "is_moderated": false
      },
      "docsrouter": {
        "category": "vision_llm",
        "recommended": true,
        "supports_tables": true,
        "supports_handwriting": true,
        "supports_forms": true,
        "best_for": ["general documents", "invoices", "receipts", "forms"],
        "cost_per_1m_tokens": 0.10
      }
    }
  ]
}

Get Specific Model

GET https://api.docsrouter.com/v1/models/{model_id}

Returns details for a specific model including pricing, capabilities, and context limits.

Path Parameters

ParameterDescription
model_idThe model ID (e.g., google/gemini-2.0-flash-001)

Example

curl https://api.docsrouter.com/v1/models/google/gemini-2.0-flash-001 \
  -H "Authorization: Bearer YOUR_API_KEY"

Response

{
  "id": "google/gemini-2.0-flash-001",
  "object": "model",
  "created": 1702425600,
  "owned_by": "google",
  "name": "Gemini 2.0 Flash",
  "description": "Fast, cheap, and reliable. Best default choice for most OCR tasks.",
  "context_length": 1048576,
  "architecture": {
    "input_modalities": ["text", "image"],
    "output_modalities": ["text"],
    "tokenizer": "gemini"
  },
  "pricing": {
    "prompt": "0.00000010",
    "completion": "0.00000040",
    "image": "0.0001315"
  },
  "docsrouter": {
    "category": "vision_llm",
    "recommended": true,
    "supports_tables": true,
    "supports_handwriting": true,
    "best_for": ["general documents", "invoices", "receipts"]
  }
}

DocsRouter Extensions

The docsrouter object in each model contains OCR-specific metadata:

FieldTypeDescription
categorystringModel type: vision_llm, traditional_ocr, or specialized_ocr
recommendedbooleanWhether this model is recommended for general use
supports_tablesbooleanCan extract tables as structured data
supports_handwritingbooleanGood at recognizing handwritten text
supports_formsbooleanCan extract form key-value pairs
best_forarrayList of use cases this model excels at
cost_per_1m_tokensnumberCost per million tokens (USD)

Model Reference

IDProviderCost / 1M TokensBest For
google/gemini-2.0-flash-001Google$0.10Recommended. Best balance of speed, cost, and accuracy.
google/gemini-2.5-flash-previewGoogle$0.15Complex layouts, multi-column documents.
google/gemini-2.5-pro-previewGoogle$2.50Legal documents, contracts, scientific papers.
openai/gpt-4o-miniOpenAI$0.15Simple documents, printed text.
openai/gpt-4oOpenAI$5.00Structured extraction, complex reasoning.
anthropic/claude-sonnet-4Anthropic$3.00Document analysis, summarization.
anthropic/claude-3.5-sonnetAnthropic$3.00Handwriting, dense technical documents.
mistralai/pixtral-large-2411Mistral$2.00European documents, multi-language.

Selecting a Model

const response = await client.chat.completions.create({
  model: 'google/gemini-2.0-flash-001',
  messages: [...]
});

In Native OCR Endpoint

{
  "model": "anthropic/claude-3.5-sonnet",
  "url": "https://example.com/document.png"
}

Default Model

If no model is specified, google/gemini-2.0-flash-001 is used. This model offers the best balance of:

  • Speed: ~1-2 seconds per page
  • Cost: $0.10 per million tokens
  • Accuracy: 95%+ on printed documents

Model Categories

Vision LLMs

General-purpose models that can understand images and extract text. Best for most use cases.

Traditional OCR (Coming Soon)

Specialized OCR engines like Google Cloud Vision, AWS Textract, and Azure Document Intelligence. Better for high-volume, simple documents.

Specialized OCR (Coming Soon)

Domain-specific models for invoices, receipts, IDs, and other document types.

Additional Endpoints

Get Model Statistics

GET https://api.docsrouter.com/v1/models/stats

Returns aggregate statistics about available models:

{
  "total": 127,
  "by_provider": {
    "google": 15,
    "openai": 8,
    "anthropic": 6,
    "meta-llama": 12,
    ...
  },
  "by_category": {
    "vision_llm": 127
  }
}
GET https://api.docsrouter.com/v1/models/recommended

Returns only the models we recommend for OCR tasks based on quality and cost-effectiveness.

Get Models by Category

GET https://api.docsrouter.com/v1/models/categories

Returns models grouped by category (vision_llm, traditional_ocr, specialized_ocr).

On this page