OpenRouter | Google: Gemini 3.5 Flash

Last updated: 0 days ago

Model By: Google

Type: MText; Context: 1049K tokens; Max output: 65500 tokens; Latency: 2620 ms; Throughput: 132 qps Google's most cost-effective video generation model, designed for high-volume applications and rapid iteration. Veo 3.1 Lite generates 720p and 1080p video from text or image prompts with native synch It is highly optimized for coding proficiency and parallel agentic execution loops, supporting text, image, video, audio, and PDF inputs.

Google's most cost-effective video generation model, designed for high-volume applications and rapid iteration. Veo 3.1 Lite generates 720p and 1080p video from text or image prompts with native synch

It is highly optimized for coding proficiency and parallel agentic execution loops, supporting text, image, video, audio, and PDF inputs.

MTexthigh-throughputlong-contextmultimodalfunction callingstructured outputcode completionprovider-api

Latency

2620ms

Throughput

132 QPS

Total Context

1000K

Max Output

65,500 tokens

Input Price

$0.08333/M

Output Price

$1.5/M

API Parameters & Capabilities

Model Type

MText

Parameter Size

Input Modality

text, image, video, file, audio

Output Modality

text

Inference Speed

165 token/s

Success Rate

Peak Concurrency

132 QPS

Release Date

5/19/2026

Integration & Pricing Details

Pricing Mode

Free Tier

Supported Languages

SDK

OpenRouter SDK / OpenAI-compatible API

API Key Acquisition

OpenRouter dashboard API keys

Rate Limit

Docs, Examples & Related Links

API Endpoint Official Docs Related Link

Example Code

import { OpenRouter } from "@openrouter/sdk"; const openrouter = new OpenRouter({  apiKey: "<OPENROUTER_API_KEY>"}); // Stream the response to get reasoning tokens in usageconst stream = await openrouter.chat.send({  model: "google/gemini-3.5-flash",  messages: [    {      role: "user",      content: "How many r's are in the word 'strawberry'?"    }  ],  stream: true}); let response = "";for await (const chunk of stream) {  const content = chunk.choices[0]?.delta?.content;  if (content) {    response += content;    process.stdout.write(content);  }   // Usage information comes in the final chunk  if (chunk.usage) {    console.log("\nReasoning tokens:", chunk.usage.reasoningTokens);  }}

User Reviews

0 verified user reviews

Loading reviews...

Overall Rating

0.0/ 5

0 Reviews

API Usability0.0

Stability0.0

Speed0.0

Docs Quality0.0

Add Review & Rating

Review Title

Overall Rating5.0

API Usability5.0

Stability5.0

Speed5.0

Documentation Quality5.0

Review Content

FAQ & Compliance

What use cases is this API best for?

It is well-suited for chatbots, code generation, content summarization, and enterprise knowledge Q&A scenarios requiring strong reasoning.

How does the pricing work?

Pricing is based on token usage for both input and output. Check the pricing section for detailed rates.

How does authentication work?

This API uses API key authentication. You can generate and manage your API keys in the developer dashboard.

Are there rate limits?

Yes, there are rate limits depending on your subscription plan. Free tier has lower limits compared to paid plans.