Llama 3.1 70B

Meta

ChatCodeOpen Source

About

Strong open-weight model balancing capability and efficiency. Great for production workloads requiring solid reasoning without the cost of the 405B variant. Open weights allow for fine-tuning and self-hosting.

Provider

Meta

Context Window

128k tokens

Max Output

4,096 tokens

Model ID

meta/llama-3.1-70b

Capabilities

Open weights128k contextTool useStreaming

API Usage

Drop-in compatible with any OpenAI client library.

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.deployai.dev/v1",
  apiKey: process.env.DEPLOYAI_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "meta/llama-3.1-70b",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ],
});

console.log(completion.choices[0].message.content);

Related Models

View all

Llama 3.1 405B

Open Source

Largest open-weight model. State-of-the-art performance across benchmarks with full open access.

Meta

Llama 3.1 8B

Value

Efficient open-weight model for fast inference. Good for focused tasks where speed matters most.

Meta

GPT-4o

Popular

Most capable multimodal model. Excels at complex reasoning, coding, creative writing, and vision tasks.

OpenAI