Found Angels Models Interview

OpenAI found features in AI models that correspond to different ‘personas’

OpenAI researchers say they’ve discovered hidden features inside AI models that correspond to misaligned “personas,” according to new research published by the company on Wednesday. By looking at an ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Feedback

OpenAI found features in AI models that correspond to different ‘personas’

Trending now