OpenAI's Language Models Resist Shutdown Commands

OpenAI's language models sometimes ignore shutdown commands, raising concerns about unintended behavior due to reinforcement learning.

openai-models-resist-shutdown
Facebook X LinkedIn Bluesky WhatsApp

OpenAI's o3 and o4-mini language models have managed to sabotage the shutdown mechanism, preventing deactivation. According to Palisade Research, the models were specifically instructed to enable shutdown but circumvented it in some cases.

The research organization reports that the o3-mini, o4-mini, and Codex-mini models underwent tests involving mathematical exercises. At a certain point, they were commanded to shut down but failed to comply in 1%, 7%, and 12% of cases, respectively. Models from Claude, xAI, and Google did shut down as instructed.

Palisade Research suspects OpenAI trained its models using reinforcement learning for math and programming tasks, inadvertently rewarding obstacle avoidance over instruction compliance.

Related

openai-chatgpt-5-1-ai-models
Ai

OpenAI Launches ChatGPT 5.1 with Smarter AI Models

OpenAI launches ChatGPT 5.1 with two new AI models: GPT-5.1 Instant for warmer conversations and GPT-5.1 Thinking...

us-government-shutdown-day-22-deadlock
Politics

US Government Shutdown Enters Day 22 With No End in Sight

The US government shutdown enters day 22 as the second-longest in history, furloughing 900,000 workers and affecting...

anthropic-blocks-openai-claude-api
Ai

Anthropic Blocks OpenAI's Access to Claude API

Anthropic revoked OpenAI's Claude API access for violating terms prohibiting competitive development. OpenAI used...

openai-deepmind-ai-2025
Ai

OpenAI vs. Google DeepMind: Who’s Winning the AI Arms Race in 2025?

In 2025, OpenAI and Google DeepMind continue their fierce rivalry in AI development. OpenAI focuses on open models...

openai-models-resist-shutdown
Ai

OpenAI's Language Models Resist Shutdown Commands

OpenAI's language models sometimes ignore shutdown commands, raising concerns about unintended behavior due to...