ai/meta-llama

docker

Hub

Verified Publisher

By Docker

•Updated 3 months ago

Meta’s Llama with vllm runner for CUDA-accelerated, advanced NLP applications.

Image

Machine Learning & AI

Gen AI

2.7K

Overview Tags

Llama Model - `ai/meta-llama`

Overview

The Llama model Docker image provides Meta's Llama model (version 3.1) packaged with vllm for efficient deployment on NVIDIA GPUs (CUDA 12.6). Designed for advanced natural language processing (NLP) applications, this image handles complex language queries and offers a range of model sizes to suit different performance and resource needs. Use cases include interactive chatbots, summarization tools, and automated content generation.

Key Features and Use Cases

Versatile NLP Abilities: Ideal for interactive conversational agents, summarization tools, and content generation.
CUDA-Optimized Performance: Runs efficiently on NVIDIA GPUs, making it suitable for real-time, high-throughput applications.
Multiple Model Sizes: Offers different variants for various use cases and resource constraints.

Getting Started

To get started, use the following command to launch the Llama model:

docker run -it --rm --gpus=all -p 8000:8000 --name vllm ai/meta-llama:3.1-8B-Instruct-cuda-12.6 --cpu-offload-gb 5 --max-model-len 30576

This starts the model server with GPU support and CPU offloading for enhanced performance. You can test the model’s NLP capabilities with an OpenAI-compatible request:

curl -s http://localhost:8000/v1/completions \
-H "Content-Type: application/json" \
-d '{
  "model": "llm",
  "prompt": "Enter your prompt here",
  "max_tokens": 10,
  "temperature": 0.5
}'

Image Variants and Tags

3.1-8B-Instruct, 3.1-8B-Instruct-cuda-12.6: Optimized for general-purpose NLP applications; balances performance with resource use.

License

The Llama models developed by Meta are distributed under the Meta Llama Community License, which grants users a non-exclusive, worldwide, non-transferable, and royalty-free limited license to use, reproduce, distribute, and modify the Llama Materials. For comprehensive details, please refer to the full license text available on Meta's official GitHub repository⁠.

Docker Pull Command

docker pull ai/meta-llama

ai/meta-llama

Llama Model - `ai/meta-llama`

Overview

Key Features and Use Cases

Getting Started

Image Variants and Tags

License

Docker Pull Command

Functional Cookies

Strictly Necessary Cookies

Performance Cookies

Targeting Cookies

ai/meta-llama

Llama Model - ai/meta-llama

Overview

Key Features and Use Cases

Getting Started

Image Variants and Tags

License

Docker Pull Command

Llama Model - `ai/meta-llama`