AI Model APIs
AI Model APIsProviders
AI Model APIs

The complete platform for comparing AI models. Find pricing, capabilities, and the perfect model for your use case.

contact@aimodelapis.com

Resources

  • AI Models APIs
  • Providers

About

  • About
  • Contact
  • Privacy Policy
  • Terms of Service

© 2026 AI Model APIs. All rights reserved.

Back to Models
Cloudflare Workers AI

@cf/meta/llama-3.1-8b-instruct-fp8 API

Cloudflare Workers AI
About

@cf/meta/llama-3.1-8b-instruct-fp8 is a high-performance LLM available via the Cloudflare Workers AI API, ideal for scalable text generation and natural language processing. This model delivers cost-effective pricing at $0.15/1M input and $0.29/1M output tokens, native tool calling support, open weights architecture. Access @cf/meta/llama-3.1-8b-instruct-fp8 via the Cloudflare Workers AI API with up to 32K output tokens.

Capabilities

Input Modalities

text

Output Modalities

text
Reasoning
No
Structured Output
No
Tool Use
Yes
WeightsOpen
Temperature
Adjustable
Attachment
Not Supported
Limits
Context Window
32K

Tokens

Input Limit
32K

Tokens

Max Output
32K

Tokens

Pricing

Standard (per 1M tokens)

Input
$0.15
Output
$0.29
Frequently Asked Questions

@cf/meta/llama-3.1-8b-instruct-fp8 by Cloudflare Workers AI costs $0.15 per 1M input tokens and $0.29 per 1M output tokens.

Model Details
ID
llama-3.1-8b-instruct-fp8
Provider
Cloudflare Workers AI
Family
llama
Release Date
Jul 25, 2024
Knowledge Cutoff
N/A
API Integration
NPM Package
@ai-sdk/openai-compatible
Environment Variables
CLOUDFLARE_ACCOUNT_ID
CLOUDFLARE_API_KEY
API Base URL
https://api.cloudflare.com/client/v4/accounts/${CLOUDFLARE_ACCOUNT_ID}/ai/v1
Documentation