All Sources (19)

OpenAI Blog

OpenAI Blog

openai.com/blog/
48
Articles
6月19日 02:01
Last updated
No Image

Preparing for future AI risks in biology

OpenAI Blog
tool
Toward understanding and preventing misalignment generalization

Toward understanding and preventing misalignment generalization

We study how training on incorrect responses can cause broader misalignment in language models and identify an internal feature driving this behavior—one that can be reversed with minimal fine-tuning.

OpenAI Blog
tool