Facebook’s New AI System Has a ‘High Propensity’ for Racism and Bias


From the article: In a paper accompanying the release, Meta researchers write that the model “has a high propensity to generate toxic language and reinforce harmful stereotypes, even when provided with a relatively innocuous prompt.” This means it’s easy to get biased and harmful results even when you’re not trying. The system is also vulnerable to “adversarial prompts,” where small, trivial changes in phrasing can be used to evade the system’s safeguards and produce toxic content.

The researchers further warn that the system has an even higher risk of generating toxic results than its predecessors, writing that “OPT-175B has a higher toxicity rate than either PaLM or Davinci,” referring to two previous language models. They suspect this is in part due to the training data including unfiltered text taken from social media conversations, which increases the model’s tendency to both recognize and generate hate speech.



Source link

Related articles

Markets Weekly Outlook: Gold and Oil Diverge as Market Sentiment Improves

Market sentiment improves amid US-China commerce speak optimism, regardless of considerations over tariff impacts on the worldwide economic system. Key financial knowledge releases are anticipated throughout Asia, Europe, and the US, with a concentrate...

Telos Company 2025 Q1 – Outcomes – Earnings Name Presentation (NASDAQ:TLS)

This text was written byObserveSearching for Alpha's transcripts staff is liable for the event of all of our transcript-related tasks. We presently publish hundreds of quarterly earnings calls per quarter on our website...

Credit score Agricole: Right here is why we preserve an above-consensus USD outlook

Credit score Agricole maintains an above-consensus medium-term bullish outlook on the USD, anticipating a restoration in H2 2025 and 2026 pushed by supportive fiscal coverage, easing monetary situations, and sticky inflation. Whereas some...

I requested the Google Pixel 9a to make a picture of a profitable particular person and the outcomes had been depressingly predictable

If a brand new telephone gave me an occasional electrical shock, I wouldn’t suggest it. Even when it solely shocked me often, after I open a selected app, I might say no. If...
spot_img

Latest articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

WP2Social Auto Publish Powered By : XYZScripts.com