PickAIModel.com - Gemini 3.1 Pro Model Detail
Google AI Pro buying power cannot be computed yet.
PickAI Conversation Value measures buying power at published API rates using the shared standard conversation basket.
Standout feature
01 - Gemini 3.1 Pro (Google)
Best-in-class 1M token context window with strong retrieval performance at that length Highest GPQA Diamond score (94.3%) among frontier models - strongest scientific reasoning Native video, audio, image, and document processing in a single call 75% prompt caching discount on repeated content - significant cost saving for heavy users Tiered thinking levels (Low/Medium/High) so you pay only for the reasoning depth you need
PickAI benchmark
Move the usage slider to see how the monthly price translates into PickAI Conversation Value at published API rates.
PickAI conversation value
API pricing not published
Usage intensity
One normal prompt, one full reply, and a couple of follow-up turns.
Selected tier
Google AI Pro
PickAI Conversation Value
API pricing not published
Standard basket
3K tokens/chat
Google AI Pro buying power cannot be computed yet.
PickAI Conversation Value uses monthly subscription price divided by standard conversation cost.
API pricing basis: API pricing not published
PickAI Conversation Value measures buying power at published API rates using the shared standard conversation basket.
| Tier | Monthly price | PickAI Conversation Value | Standard basket | Rate note |
|---|---|---|---|---|
Free Free tier | $0 | Free tier | 3,000 tokens | Free access exists, but the vendor does not publish a fixed monthly token allowance for this hosted tier and practical limits vary by workload. |
Google AI ProPrimary Usage limits not disclosed | $5 | API pricing not published | 3,000 tokens | Usage limits vary by workload and should be checked against the vendor current plan controls. |
Consumer access
Consumer plan pricing is grounded in the current official vendor plan page.
Hosted app availability is grounded in the current official vendor surface.
What it feels like
Official ecosystem
These are the verified first-party tools or official product surfaces currently listed for Gemini 3.1 Pro. If there is no verified specialized tool beyond a general chat surface, this section stays minimal.
Only verified first-party surfaces are listed.
Gemini 3.1 Pro is the ultimate all-in-one creative partner. It does more than chat; it builds. From generating cinematic video and studio-quality music to managing your life through seamless Google Workspace integration, it turns complex tasks into instant results. It is the fastest, most versatile tool for turning ideas into reality without needing a technical degree. True multimodality means it can create stunning video, professional images, and high-fidelity music in seconds. Its massive context window lets it remember entire books or long documents, so you do not have to repeat yourself. It works inside Gmail, Docs, and Drive to automate daily chores. It also delivers high-level reasoning and instant answers without the lag of older models. If you want an AI that acts as a creative studio, personal assistant, and expert researcher all in one subscription, Gemini 3.1 Pro is the gold standard.
Head-to-head pages
Continue Research
This is a buyer-facing summary of the hosted product experience, not a verbatim vendor claim.
For cautious buyers
Privacy guidance is summarized conservatively for buyers and should be checked against the vendor current controls.
| Benchmark | Metric | Axis | Weight | Contribution | Score | Source | Retrieved |
|---|---|---|---|---|---|---|---|
Humanity's Last Exam HLE no tools | Normalized quality input | VALUEQUALITY | VALUE: 40% [SMARTNESS] QUALITY: 66.7% active / 50% base [REASONING] | Contributes 0 pts (missing data) to Value score Contributes 55.3 pts to Quality score | 44.4% | Google DeepMind Gemini 3.1 Pro model card Vendor-reported Google model-card HLE no-tools score. Do not use Scale AI Thinking High for the primary no-tools leaderboard. | 2026-05-30 |
SWE-Bench Pro SWE-Bench Pro (Public) |
| Software engineering task resolution |
QUALITY |
QUALITY: 33.3% active / 25% base [CODING] |
Contributes 10.5 pts to Quality score |
| 54.2% |
| Google DeepMind Gemini 3.1 Pro model card Vendor-reported Google model-card SWE-Bench Pro (Public) score. Do not substitute SWE-Bench Verified. |
| 2026-05-30 |
GPQA Diamond Live source acquisition | Normalized quality input | Reference only | Not used in Quality or Value scoring. | Informational evidence only. | 94.3% | Google DeepMind Gemini 3.1 Pro comparison table Vendor-published cross-model comparison table. Treat this as current official evidence, not neutral third-party benchmarking. | 2026-05-24 |
MathArena Expected Performance | Expected Performance | Reference only | Not used in Quality or Value scoring. | Informational evidence only. | 64.8% | MathArena models leaderboard MathArena is shown as supplementary evidence only and is not currently included in the PickAI Quality Score. | 2026-05-24 |
MRCR v2 Live source acquisition | 128k retrieval | Reference only | Not used in Quality or Value scoring. | Informational evidence only. | 84.9% | Google DeepMind Gemini 3.1 Pro comparison table Vendor-published cross-model comparison table. Treat this as current official evidence, not neutral third-party benchmarking. | 2026-05-24 |
MRCR v2 Live source acquisition | 1M retrieval | Reference only | Not used in Quality or Value scoring. | Informational evidence only. | 26.3% | Google DeepMind Gemini 3.1 Pro comparison table Vendor-published cross-model comparison table. Treat this as current official evidence, not neutral third-party benchmarking. | 2026-05-24 |
Terminal-Bench 2.0 Terminal-Bench 2.0-Terminus | Agentic terminal task completion | Reference only | Not used in Quality or Value scoring. | Informational evidence only. | 68.5% | Google DeepMind Gemini 3.1 Pro model card Vendor-reported Google model-card Terminal-Bench 2.0 score using the Terminus-2 harness. | 2026-05-30 |
ARC-AGI-2 ARC-AGI-2 public leaderboard | Novel pattern reasoning | Reference only | Not used in Quality or Value scoring. | Informational evidence only. | 77.1% | Google DeepMind Gemini 3.1 Pro comparison table ARC-AGI-2 is shown as supplementary evidence only and is not currently included in the PickAI Quality Score. Retained from the previous published snapshot because the current live source did not expose this benchmark row. Retained from the previous published snapshot because the current live source did not expose this benchmark row. Retained from the previous published snapshot because the current live source did not expose this benchmark row. | 2026-04-07 |