llama-3_2-nemoretriever-300m-embed-v1 is a high-performance LLM available via the Nvidia API, ideal for scalable text generation and natural language processing. This model delivers available completely free of charge, open weights architecture. Access llama-3_2-nemoretriever-300m-embed-v1 via the Nvidia API with up to 2K output tokens.
Tokens
Tokens
Tokens
llama-3_2-nemoretriever-300m-embed-v1 by Nvidia costs Free per 1M input tokens and Free per 1M output tokens.