AI Model APIs
AI Model APIsProviders
AI Model APIs

The complete platform for comparing AI models. Find pricing, capabilities, and the perfect model for your use case.

contact@aimodelapis.com

Resources

  • AI Models APIs
  • Providers

About

  • About
  • Contact
  • Privacy Policy
  • Terms of Service

© 2026 AI Model APIs. All rights reserved.

Back to Models
Venice AI

Hermes 3 Llama 3.1 405b API

Venice AI
About

Process massive datasets with Hermes 3 Llama 3.1 405b, featuring an expansive 128K context window for long-document analysis. This model delivers cost-effective pricing at $1.10/1M input and $3.00/1M output tokens, open weights architecture. Access Hermes 3 Llama 3.1 405b via the Venice AI API with up to 16K output tokens.

Capabilities

Input Modalities

text

Output Modalities

text
Reasoning
No
Structured Output
No
Tool Use
No
WeightsOpen
Temperature
Adjustable
Attachment
Not Supported
Limits
Context Window
128K

Tokens

Input Limit
128K

Tokens

Max Output
16K

Tokens

Pricing

Standard (per 1M tokens)

Input
$1.10
Output
$3.00
Frequently Asked Questions

Hermes 3 Llama 3.1 405b by Venice AI costs $1.10 per 1M input tokens and $3.00 per 1M output tokens.

Model Details
ID
hermes-3-llama-3.1-405b
Provider
Venice AI
Family
hermes
Release Date
Sep 25, 2025
Knowledge Cutoff
Apr 1, 2024
API Integration
NPM Package
venice-ai-sdk-provider
Environment Variables
VENICE_API_KEY
Documentation