
Engadget
shared a link post in group #Gadgets

www.engadget.com
OpenAI's new confession system teaches models to be honest about bad behaviors
OpenAI is working on a framework that will train AI models to acknowledge when they've engaged in undesirable behavior.
