OpenAI Seeks Safety Expert Amid AI Self-Improvement Concerns, Offers $445K Salary

May 24, 2026

Tech

Generative AI

AI Research

Responsibilities include defending models from data-poisoning attacks, building tools to interpret AI reasoning, testing autonomous systems safeguards, and tracking automation of technical staff.
The job listing notes candidates should be tasteful and strategic due to uncertain future risks and potential regulatory or press scrutiny.
Anthropic is pursuing AI oversight of more powerful models, with differing projections on when humans will still oversee RD processes.
The role aligns with broader moves toward automation in software development, including safety considerations for rapid AI coding tool advances.
Other players are exploring similar lines, with Anthropic researching oversight of stronger models and DeepMind signaling rapid development amid discussions of an imminent AI singularity.
Research trends show frontier models doubling task durations roughly every seven months, underscoring urgency for proactive safety measures.
The hiring underscores safety as essential infrastructure alongside capability building, with intensifying competition for talent in alignment and long-term AI risk management.
OpenAI is recruiting for a specialised safety researcher on its Preparedness team with a salary of up to $445,000, focusing on risks around recursive self-improvement in AI systems.
The Preparedness team spans automated red-teaming, risk assessment for biological/chemical threats, and risks from increasingly autonomous AI, signaling a broad safety mission.
Industry leaders, including Google DeepMind, have warned about nearing the foothills of the singularity, where AI may begin self-improvement at accelerating rates.
Ongoing discussions are highlighted about societal implications and cautions from experts regarding the pace of autonomous AI development.
Anthropic has published research on AI oversight of stronger models, with some optimism about future self-improvement without human involvement by 2028, reflecting industry debate on timelines.

Summary based on 7 sources

Get a daily email with more Tech stories

Sources

Business Insider • May 23, 2026

OpenAI hires in preparation for AI that could train itself

mint • May 24, 2026

OpenAI will pay ₹3.7 crore to worry about a problem that may not exist yet. The job? Stop AI from building itself

Wion

OpenAI offers up to $445,000 for AI safety role as focus shifts to self-improving systems

International Business Times, Singapore Edition • May 24, 2026

OpenAI Offers Up to $445K for New AI Safety Job Amid Push to Tackle Self-Improving AI

OpenAI Seeks Safety Expert Amid AI Self-Improvement Concerns, Offers $445K Salary

Get a daily email with more Tech stories

Sources

More Stories