NVLM 1.0

NVLM 1.0Open-source, frontier-class multimodal LLMs for state-of-the-art vision-language tasks.

NVLM 1.0: Open Frontier-Class Multimodal LLMs achieving state-of-the-art results on vision-language tasks, rivaling GPT-4o, Llama 3-V 405B, and InternVL 2. Powerful, open-source, and ready for your next project.

Vision-Language Tasks

Open-Source LLMs

NVLM 1.0 Alternatives

Unsloth AI

Unsloth AI

Open Source Training & Fine-tuning of LLMs

Llama 3.1 by Meta

Llama 3.1 by Meta

Open-source AI you can customize and deploy anywhere.

BerriAI

BerriAI

Call every LLM API like it's OpenAI [100+ LLMs]

GradientJ

GradientJ

Platform to build large language model applications

Airtrain AI

Airtrain AI

No-code LLM fine-tuning and evaluation.

Llama

Llama

3.1-405B: an open source model to rival GPT-4o / Claude-3.5

Xylem AI

Xylem AI

Fast, Scalable Infrastructure for Fine-tuning and Inferencing LLMs.

Automorphic

Automorphic

Infuse knowledge into language models with just 10 samples

Atla

Atla

We build LLMs to evaluate other LLMs

Ollama

Ollama

Get up and running with large language models, locally.

Encord

Encord

All the tools you need to build better vision models, faster

Felafax

Felafax

Expanding AI Infra beyond NVIDIA

Datacurve

Datacurve

Curated data for training LLMs

NVLM 1.0 screenshot

NVLM 1.0

NVLM 1.0 is a family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks, rivaling the leading proprietary models (e.g., GPT-4o) and open-access models (e.g., Llama 3-V 405B and InternVL 2). Remarkably, after multimodal training, NVLM 1.0 shows improved accuracy on text-only tasks over its LLM backbone. We are open-sourcing the model weights and training code in Megatron-Core for the community.

Product Highlights

Feature 1: Achieves state-of-the-art results on vision-language tasks.
Feature 2: Improved accuracy on text-only tasks.
Feature 3: Open-sourced.

Use Cases

Use case 1: NVLM 1.0 is used to answer questions related to images and text.
Use case 2: NVLM 1.0 is used to generate descriptive text for images.
Use case 3: NVLM 1.0 is used to analyze text and images and perform logical reasoning.

Target Audience

NVLM 1.0 is used by researchers and developers interested in building multimodal applications.

NVLM 1.0 Alternatives

Unsloth AI

Unsloth AI

Open Source Training & Fine-tuning of LLMs

Llama 3.1 by Meta

Llama 3.1 by Meta

Open-source AI you can customize and deploy anywhere.

BerriAI

BerriAI

Call every LLM API like it's OpenAI [100+ LLMs]

GradientJ

GradientJ

Platform to build large language model applications

Airtrain AI

Airtrain AI

No-code LLM fine-tuning and evaluation.

Llama

Llama

3.1-405B: an open source model to rival GPT-4o / Claude-3.5

Xylem AI

Xylem AI

Fast, Scalable Infrastructure for Fine-tuning and Inferencing LLMs.

Automorphic

Automorphic

Infuse knowledge into language models with just 10 samples

Atla

Atla

We build LLMs to evaluate other LLMs

Ollama

Ollama

Get up and running with large language models, locally.

Encord

Encord

All the tools you need to build better vision models, faster

Felafax

Felafax

Expanding AI Infra beyond NVIDIA

Datacurve

Datacurve

Curated data for training LLMs

Weekly Top 10 Products