A new study made a version of GPT-5 Thinking admit its own misbehavior. But it's not a quick fix for bigger safety issues.
Source: ADnet
Source Link: https://www.zdnet.com/article/openai-is-training-models-to-confess-when-they-lie-what-it-means-for-future-ai/