CMU Builds Self-Improving AI Agents via Constitutional RL
Carnegie Mellon researchers introduce Constitutional RL, a framework enabling AI agents to self-improve while following …
136 articles about 'AI Safety'
Carnegie Mellon researchers introduce Constitutional RL, a framework enabling AI agents to self-improve while following …
Governments worldwide race to regulate AI, but striking the right balance between fostering innovation and protecting th…
Former Google CEO Eric Schmidt says artificial general intelligence could emerge within 3 years, raising urgent question…
A bipartisan Senate committee proposes a new regulatory framework targeting foundation model developers with transparenc…
OpenAI publishes new research on superalignment techniques aimed at keeping frontier language models safe and aligned wi…
Experts warn Trump's AI safety testing framework faces major pitfalls despite vindicating Biden's original approach.
Security researchers uncover a universal jailbreak vulnerability that bypasses safety guardrails across GPT-4, Claude, G…
Anthropic publishes new Constitutional AI 2.0 paper advancing scalable oversight methods for safer, more aligned AI syst…
Former OpenAI employees testify before U.S. Senate, raising alarms about internal safety culture and calling for federal…
After dismissing Biden's AI safety framework, the Trump administration now embraces testing requirements following alarm…
Meta overhauls its age-verification system with AI that analyzes bone structure and height after simple disguises fooled…
Mira Murati testified under oath that Sam Altman made false representations about safety review processes for a new AI m…