Qwen 3 Embedding 4B is a high-performance LLM available via the Inference API, ideal for scalable text generation and natural language processing. This model delivers cost-effective pricing at $0.01/1M input and Free/1M output tokens, open weights architecture. Access Qwen 3 Embedding 4B via the Inference API with up to 2K output tokens.
Tokens
Tokens
Tokens
Qwen 3 Embedding 4B by Inference costs $0.01 per 1M input tokens and Free per 1M output tokens.