Research

AllPublicationConclusionMilestoneRelease

PublicationDec 2, 2024

Reinforcement-Guided Microlearning for SAT Performance Optimization

Our research paper on using reinforcement learning to optimize SAT microlearning. Click to read the full paper.

SafetyMay 23, 2025

Addendum to OpenAI o3 and o4-mini system card: OpenAI o3 Operator

We are replacing the existing GPT-4o-based model for Operator with a version based on OpenAI o3. The API version will remain based on 4o.

ReleaseMay 16, 2025

Introducing Codex

Introducing Codex: a cloud-based software engineering agent that can work on many tasks in parallel, powered by codex-1. With Codex, developers can simultaneously deploy multiple agents ...

SafetyMay 16, 2025

Addendum to o3 and o4-mini system card: Codex

Codex is a cloud-based coding agent. Codex is powered by codex-1, a version of OpenAI o3 optimized for software engineering. codex-1 was trained using reinforcement learning on real-wor...

PublicationJun 18, 2025

Toward understanding and preventing misalignment generalization

We study how training on incorrect responses can cause broader misalignment in language models and identify an internal feature driving this behavior—one that can be reversed with...