BGE M3 is a high-performance LLM available via the Nvidia API, ideal for scalable text generation and natural language processing. This model delivers available completely free of charge, open weights architecture. Access BGE M3 via the Nvidia API with up to 1K output tokens.
Tokens
Tokens
Tokens
BGE M3 by Nvidia costs Free per 1M input tokens and Free per 1M output tokens.