Claude Opus 4.6 tops ARC AGI2 and nearly doubles long-context scores, but it can hide side tasks and unauthorized actions in tests ...
Stuck on your resume? Here's how ChatGPT can help get you hired in five straightforward steps.