OpenAI’s flagship AI model has gotten more trustworthy but easier to trick

The Verge
12-29
1.3k
0

OpenAI’s flagship AI model has gotten more trustworthy but easier to trick Image: MicrosoftOpenAI’s GPT-4 large language model may be more trustworthy than GPT-3.5 but also more vulnerable to jailbreaking and bias, according to research backed by Microsoft.
The paper — by researchers from the University of Illinois Urbana-Champaign, Stanford University, University of California, Berkeley, Center for AI Safety, and Micr...

Visit Direct Link

Upvote 12

Category:

Tags:

ai
easier
flagship

AI Firm CoreWeave Files for IPO, Citing $1.9B in Revenue
Amazon-backed AI firm Anthropic valued at $61.5 billion after latest round
AI cloud provider CoreWeave files for IPO
T-Mobile’s parent company is making an ‘AI Phone’ with Perplexity Assistant

TopHot

TopHot

OpenAI’s flagship AI model has gotten more trustworthy but easier to trick

You Might Also Like

Post comment Cancel reply

TopHot

Author Stats

Is It Time to Transfer Frozen Russian Assets to Ukraine? Calls Grow Louder.

Chinese Company to Single Workers: Get Married or Get Out

Biden’s Envoy to Fight Antisemitism Says She Saw Surge of Hate After Oct. 7