OpenAI Blog
openai.com/blog/ 48
Articles
6月19日 02:01
Last updated
No Image
Preparing for future AI risks in biology
OpenAI Blog
tool

Toward understanding and preventing misalignment generalization
We study how training on incorrect responses can cause broader misalignment in language models and identify an internal feature driving this behavior—one that can be reversed with minimal fine-tuning.
OpenAI Blog
tool