More Than 1,000 Images of Child Sexual Abuse Found in Data Used to Train AI

Name: Actors Express Concerns About AI
Uploaded: 2023-12-21T21:00:02Z
Description: Video Source: Advocate Channel

Shuttershock

More than a thousand images of child sexual abuse material were found in a massive public dataset used to train popular AI image-generating models, a new report has found.

New York (CNN) — More than a thousand images of child sexual abuse material were found in a massive public dataset used to train popular AI image-generating models, Stanford Internet Observatory researchers said in a study published earlier this week.

The presence of these images in the training data may make it easier for AI models to create new and realistic AI-generated images of child abuse content, or “deepfake” images of children being exploited.

The findings also raise a slew of new concerns surrounding the opaque nature of the training data that serves as the foundation of a new crop of powerful generative AI tools.

The massive dataset that the Stanford researchers examined, known as LAION 5B, contains billions of images that have been scraped from the internet, including from social media and adult entertainment websites.

Of the more than five billion images in the dataset, the Stanford researchers said they identified at least 1,008 instances of child sexual abuse material.

LAION, the German nonprofit behind the dataset, said in a statement on its website that it has a “zero tolerance policy for illegal content.”

The organization said that it received a copy of the report from Stanford and is in the process of evaluating its findings. It also noted that datasets go through “intensive filtering tools” to ensure they are safe and comply with the law.

“In an abundance of caution we have taken LAION 5B offline,” the organization added, saying that it is working with the UK-based Internet Watch Foundation “to find and remove links that may still point to suspicious, potentially unlawful content on the public web.”

LAION said it planned to complete a full safety review of LAION 5B by the second half of January and plans to republish the dataset at that time.

The Stanford team, meanwhile, said that removal of the identified images is currently in progress after the researchers reported the image URLs to the National Center for Missing and Exploited Children and the Canadian Centre for Child Protection.

In the report, the researchers said that while developers of LAION 5B did attempt to filter certain explicit content, an earlier version of the popular image-generating model Stable Diffusion was ultimately trained on “a wide array of content, both explicit and otherwise.”

A spokesperson for Stability AI, the London-based startup behind Stable Diffusion, told CNN in a statement that this earlier version, Stable Diffusion 1.5, was released by a separate company and not by Stability AI.

And the Stanford researchers do note that Stable Diffusion 2.0 largely filtered out results that were deemed unsafe, and as a result had little to no explicit material in the training set.

“This report focuses on the LAION-5b dataset as a whole,” the Stability AI spokesperson told CNN in a statement. “Stability AI models were trained on a filtered subset of that dataset. In addition, we subsequently fine-tuned these models to mitigate residual behaviors.”

The spokesperson added that Stability AI only hosts versions of Stable Diffusion that includes filters that remove unsafe content from reaching the models.

“By removing that content before it ever reaches the model, we can help to prevent the model from generating unsafe content,” the spokesperson said, adding that the company prohibits use of its products for unlawful activity.

But the Stanford researchers note in the report that Stable Diffusion 1.5, which is still used in some corners of the internet, remains “the most popular model for generating explicit imagery.”

As part of their recommendations, the researchers said that models based on Stable Diffusion 1.5 should be “deprecated and distribution ceased where feasible.”

More broadly, the Stanford report said that massive web-scale datasets are highly problematic for a number of reasons, even with the attempts at safety filtering, because of their possible inclusion of not just child sexual abuse material but also because of other privacy and copyright concerns that arises from their use.

The report recommended that such datasets should be restricted to “research settings only” and that only “more curated and well-sourced datasets” should be used for publicly distributed models.

Actors Express Concerns About AI

Video Source: Advocate Channel

From Your Site Articles

More Than 1,000 Images of Child Sexual Abuse Found in Data Used to Train AI

More than a thousand images of child sexual abuse material were found in a massive public dataset used to train popular AI image-generating models, a new report has found.

Actors Express Concerns About AI

From our sponsors

From our partners

Watch Advocate Channel

Top Stories

Trending Stories

Broadway's best raise over $1 million for LGBTQ+ and HIV causes

Jelani Alladin & Zane Phillips sizzle in 'Strangers on a Beach' trailer

HRC holds 'die-in' to protest Trump health care cuts

Maine Gov. Janet Mills beats Donald Trump, gets school meal funds restored while defending trans kids

My magical and affordable escape to Portugal

11 sultriest lesbian and sapphic femmes in TV and film history

The queer men of Alcatraz: the hidden history of the prison Trump wants to reopen

Discover the power of Wellness in your life

Met Gala 2025: Best-dressed Black queer celebs

Bruno Alcantara is stripping down & giving sensual massages on his new show

Cobblestones, castles, and culture: Your LGBTQ+ guide to Edinburgh

Lorde elaborates on gender identity at the Met Gala: 'I feel like a man and a woman'

Zane Phillips breaks down his steamy 'Mid-Century Modern' scenes with Matt Bomer

The 'King of Drag' teaser trailer is here — and we are royally impressed

'Boys! Boys! Boys!' debuts new podcast

Who could succeed Pope Francis? We look at 10 possibilities, both LGBTQ-friendly and not

The Tryst Puerto Vallarta is the perfect destination for gays to embrace their worst behavior

Meet all 39 queer players in this season's WNBA

EDC Weekend Desert Days hits Sin City with Ultra Load

Her partner of 45 years developed Alzheimer’s. She's sharing their story to 'take away fear'

Met Gala 2025: 7 not-so 'Superfine' looks

Lady Gaga's Brazil concert faced a bomb plot targeting the LGBTQ+ community. Here's everything we know

Met Gala 2025: Dramatic hats steal the show

Two right-wing Supreme Court justices signal they may uphold access to PrEP and more

500,000 Children at Risk: PEPFAR Funding Crisis

Recommended Stories for You

Onya stage, tour, & television: Onya Nurve has the crown & vision

The Talk Season 5 premieres this spring with HIV guidance for the newly diagnosed

Insomniac & Tomorrowland unite to make EDM history in Las Vegas

Celebrating Black History Month with our annual African American issue

Project 2025 vowed to roll back LGBTQ+ rights. Here's everything Trump has done so far

Celebrities unite for brain health at Power of Love gala in Las Vegas

The sneaky way one Utah teacher showed support for LGBTQ+ students despite bigoted laws

Boxers NYC Bartender Calendar: Bold, artistic, bulging

Jess King is here to help you live your happiest, healthiest life yet

The LGBTQ+ must-sees of Charleston, S.C.: a curated guide 2025

The Tryst Puerto Vallarta is the perfect escape for gays to embrace their worst behavior

New Music Friday: Kelly Clarkson's new ballad, Barbra and Hozier duet

12 'straight' movies featuring gal pals who totally should have ended up together

Penis count debate over medieval tapestry comes to a head

Take a bite out of Netflix's new Las Vegas restaurant

First, do no harm—unless you're intersex or trans

The sneaky way one Utah teacher showed support for LGBTQ+ students despite bigoted laws

Your 'fight-or-flight' is not broken—it's just been living in America

Trump’s tariff on foreign films is his revenge on Hollywood, a Red Scare redux, and an autocratic power move

Isabela Merced says 'experience in queer relationships' helped her 'Last of Us' role

BREAKING NEWS: Trump admin moves to end federal HIV prevention programs

Ricky Martin delivers showstopping performance for 2024 World AIDS Day

'Drag Race's Jewels Sparkles: Real men support their queer kids

Adult star Colby Jaxxx dishes on his spicy career & hosting a steamy reality show

LGBTQ+ demonstrators reenacted historic White House protest

Catherine Thorbecke