Nvidia Introduces Llama Nemotron Open-Source LLMs to Build and Deploy AI Agents at CES 2025

As a part of the Nemotron family, Nvidia also announced Cosmos Nemotron vision language models.

Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 8 January 2025 12:46 IST

Nvidia Introduces Llama Nemotron Open-Source LLMs to Build and Deploy AI Agents at CES 2025

Photo Credit: Nvidia

The LLMs under the Nemotron family are available in Nano, Super, and Ultra sizes

Highlights

These AI models are available as a part of the Nvidia NIM microservice
AI agents built using these models can be used for customer support
They can also be used for fraud detection and product supply chain

Nvidia announced the Llama Nemotron family of open large language models (LLMs) on Monday. The company said that with the rise of artificial intelligence (AI) agents, new and more sophisticated AI models were required to handle the workflow of agentic AI. Highlighting the need for more power and higher efficiency, the tech giant stated that the Nemotron family models can create and deploy AI agents across various applications. The company claimed that the AI models will be available for enterprises via the Nvidia NIM microservice.

Nvidia Introduces Nemotron Family of AI Models

In a blog post, the tech giant announced its new series of open-source LLMs dubbed Nemotron. The series also contains Cosmos Nemotron vision language models (VLMs), and these can be used to build AI agents that analyse and respond to images and videos. Nvidia said the vision-focused agents can be deployed in autonomous machines, hospitals, stores and warehouses, as well as sports events, movies, and news.

Built with Meta's Llama foundation models, the Nvidia Llama Nemotron models are said to be optimised to build and develop AI agents. While the company did not reveal the architecture and technical details, it claimed that these models are trained using “latest techniques and high-quality datasets”. The models can be used to train agentic capabilities such as instruction following, chat, function calling, coding and mathematics, and more. Nemotron is also said to optimise the AI agents' size to make it easy to deploy.

Nvidia GeForce Now Cloud Game Streaming Service Coming to India This Year

Nvidia stated that SAP, ServiceNow, and other AI agent platform providers will be among the first to use the new Llama Nemotron models.

The Nemotron and Cosmos Nemotron models will be available in three parameter sizes — Nano, Super, and Ultra. Nano is the most cost-effective model built with low latency as the primary focus. Super is a high-accuracy model that can be run on a single GPU. Finally, Ultra is the highest-accuracy model designed for data centre-scale applications.

Nvidia highlighted that enterprises can access the Nemotron model family as downloadable models and as NIM. These models will also be available as application programming interfaces (APIs). While the models are open-source, they are only available for academic and research usage.

Comments

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Further reading: Nvidia, Nvidia Nemotron, AI, Artificial Intelligence, LLM, AI Model, AI Agent, CES 2025

Akash Dutta Email Akash Dutta

Akash Dutta is a Senior Sub Editor at Gadgets 360. He is particularly interested in the social impact of technological developments and loves reading about emerging fields such as AI, metaverse, and fediverse. In hi... more »