Google Gemini 2.5 Pro tops coding benchmarks and delivers usable 1M token context window
Google just shipped Gemini 2.5 Pro, and the benchmark numbers are hard to ignore. It’s sitting at the top of the LMSys leaderboard for coding tasks, outperforming GPT-4o and Claude 3.7 Sonnet on several software engineering benchmarks. On SWE-bench Verified, it’s hitting numbers that weren’t realistic from any model twelve months ago. But here’s what…
