Gemini

Google: Gemini 2.5 Flash Lite

Chat
google/gemini-2.5-flash-lite

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, [thinking] (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the Reasoning API parameter to selectively trade off cost for intelligence.

上下文視窗
1M
最大輸出 Token
66K
發布日期
2025-07-22
能力
視覺函式呼叫提示快取PDF 輸入
可用供應商
GoogleCloudVertex
支援的協定
OpenAIopenaiGeminigemini

供应商

GoogleCloudVertex
輸入 Token
$0.1/M
輸出 Token
$0.4/M
快取讀取
$0.025/M
快取寫入
$1/M
音訊輸入
$0.3/M
快取音訊
$0.3/M
網路搜尋
$0.035/R
接入协议
OpenAIopenai/v1/chat/completions
Geminigemini

程式碼範例

from google import genai
client = genai.Client(
api_key="YOUR_OFOX_API_KEY",
http_options={"api_version": "v1beta", "base_url": "https://api.ofox.io/gemini"},
)
response = client.models.generate_content(
model="google/gemini-2.5-flash-lite",
contents="Hello!",
)
print(response.text)

運行狀態

常見問題

Google: Gemini 2.5 Flash Lite 在 Ofox.ai 上的價格為輸入 $0.1/M/百萬 Token,輸出 $0.4/M/百萬 Token。按量計費,無月費。