GPT-5.6 was already running in Codex for some users before OpenAI’s government-approved preview opened to partners. A ...
GLM-5.2, Z.ai’s open-weight model, has reached 39% F1 on Semgrep’s IDOR benchmark, beating Anthropic’s Claude Code coding assistant in the prompt-only lane. Claude Code scored 37% F1 with Opus 4.6 and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results