Claude Opus 4.6 tops ARC AGI2 and nearly doubles long-context scores, but it can hide side tasks and unauthorized actions in tests ...
Claude Sonnet 4.6 sets new alignment records with low misuse; Opus 4.6 still leads on fluid intelligence tests, risk framing ...
The UK’s AI Security Institute is collaborating with several global institutions on a global initiative to ensure artificial intelligence (AI) systems behave in a predictable manner. The Alignment ...
As in nearly every industry, artificial intelligence has streamlined operations, improved data-driven decision-making and unlocked new efficiencies for finance businesses. However, its integration ...
AI agent adoption and budgets will rise significantly in 2026, despite challenges ...
As generative AI (GenAI) continues to transform industries, its integration presents a unique set of opportunities and challenges. While it has the potential to automate creativity, optimize processes ...
Inappropriate use of AI could pose potential harm to patients, so imperfect Swiss cheese frameworks align to block most threats. The emergence of Artificial Superintelligence (ASI) in healthcare ...
What happened during the o3 AI shutdown tests? What does it mean when an AI refuses to shut down? A recent test demonstrated this behavior, not just once, but multiple times. In May 2025, an AI safety ...
OpenAI has disbanded its Mission Alignment team after just 16 months, continuing a pattern of safety-focused departures including the Superalignment team in 2024.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results