Strong open-weight model balancing capability and efficiency. Great for production workloads requiring solid reasoning without the cost of the 405B variant. Open weights allow for fine-tuning and self-hosting.
Meta
128k tokens
4,096 tokens
meta/llama-3.1-70b
Drop-in compatible with any OpenAI client library.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.deployai.dev/v1",
apiKey: process.env.DEPLOYAI_API_KEY,
});
const completion = await client.chat.completions.create({
model: "meta/llama-3.1-70b",
messages: [
{ role: "user", content: "Hello, how are you?" }
],
});
console.log(completion.choices[0].message.content);Largest open-weight model. State-of-the-art performance across benchmarks with full open access.
Efficient open-weight model for fast inference. Good for focused tasks where speed matters most.
Most capable multimodal model. Excels at complex reasoning, coding, creative writing, and vision tasks.