Google's ultra-fast multimodal model with native tool use, code execution, and the largest context window available at 1 million tokens. Excels at processing very long documents, multimodal inputs, and tasks requiring code execution within the model's response.
1M tokens
8,192 tokens
google/gemini-2.0-flash
Drop-in compatible with any OpenAI client library.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.deployai.dev/v1",
apiKey: process.env.DEPLOYAI_API_KEY,
});
const completion = await client.chat.completions.create({
model: "google/gemini-2.0-flash",
messages: [
{ role: "user", content: "Hello, how are you?" }
],
});
console.log(completion.choices[0].message.content);Powerful multimodal model with a massive context window. Excels at long-document understanding and analysis.
Most capable multimodal model. Excels at complex reasoning, coding, creative writing, and vision tasks.
Affordable and efficient small model for fast, focused tasks. Great cost-to-performance ratio.