中医AI

GPT-5.2’s Performance in Programming and Mathematics

12 月 14, 2025

—

由

Xiao Huang: How capable is GPT-5.2 in specific tasks like programming and mathematics?

DOORM: GPT-5.2 Thinking achieved an 80% score in the programming benchmark SWE-bench Verified and a 100% score in the mathematics competition AIME 2025. This demonstrates its strong capabilities in complex reasoning tasks, even surpassing the previous generation of specialized vertical models

发表回复取消回复

要发表评论，您必须先登录。

GPT-5.2’s Performance in Programming and Mathematics

评论

发表回复 取消回复

发表回复取消回复