Stanford researchers find Mastodon has a massive child abuse material problem (www.theverge.com)

submitted 1 year ago* (last edited 1 year ago) by corb3t@lemmy.world to c/technology@lemmy.ml

185 comments fedilink hide all child comments

Not a good look for Mastodon - what can be done to automate the removal of CSAM?

top 50 comments

sorted by: hot top controversial new old

[-] fubo@lemmy.world 20 points 1 year ago* (last edited 1 year ago)

https://stacks.stanford.edu/file/druid:vb515nd6874/20230724-fediverse-csam-report.pdf

I'd suggest that anyone who cares about the issue take the time to read the actual report, not just drama-oriented news articles about it.

[-] Dee@lemmy.world 4 points 1 year ago

So if I'm understanding right, based on their recommendations this will all be addressed as more moderation and QOL tools are introduced as we move further down the development roadmap?

[-] duncesplayed@lemmy.one 4 points 1 year ago

If I can try to summarize the main findings:

Computer-generated (e.g.., Stable Diffusion) child porn is not criminalized in Japan, and so many Japanese Mastodon servers don't remove it
Porn involving real children is removed, but not immediately, as it depends on instance admins to catch it, and they have other things to do. Also, when an account is banned, the Mastodon server software is not sending out a "delete" for all of their posted material (which would signal other instances to delete it)

Problem #2 can hopefully be improved with better tooling. I don't know what you do about problem #1, though.

[-] CaptainBasculin@lemmy.ml 3 points 1 year ago

Such a signal exists in the ActivityPub protocol, so I wonder why it's not being used.

[-] HughJanus@lemmy.ml 2 points 1 year ago

I don't know what you do about problem #1, though.

Well the simple answer is that it doesn't have to be illegal to remove it.

The legal question is a lot harder, considering AI image generation has reached levels that are almost indistinguishable from reality.

[-] sugar_in_your_tea@sh.itjust.works 2 points 1 year ago* (last edited 1 year ago)

In which case, admins should err on the side of caution and remove something that might be illegal.

I personally would prefer to have nothing remotely close to CSAM, but as long as children aren't being harmed in any conceivable way, I don't think it would be illegal to post art containing children. But communities should absolutely manage things however they think is best for their community.

In other words, I don't think #1 is a problem at all, imo things should only be illegal if there's a clear victim.

load more comments (14 replies)

[-] mindbleach@lemmy.world 1 points 1 year ago

4.1 Illustrated and Computer-Generated CSAM

Stopped reading.

Child abuse laws "exclude anime" for the same reason animal cruelty laws "exclude lettuce." Drawings are not children.

Drawings are not real.

Half the goddamn point of saying CSAM instead of CP is to make clear that Bart Simpson doesn't count. Bart Simpson is not real. It is fundamentally impossible to violate Bart Simpson's rights, because he doesn't fucking exist. There is nothing to protect him from. He cannot be harmed. He is imaginary.

This cannot be a controversial statement. Anyone who can't distinguish fiction from real life has brain problems.

You can't rape someone in MS Paint. Songs about murder don't leave a body. If you write about robbing Fort Knox, the gold is still there. We're not about to arrest Mads Mikkelsen for eating people. It did not happen. It was not real.

If you still want to get mad at people for jerking off to the wrong fantasies, that is an entirely different problem from photographs of child rape.

[-] DrQuint@lemmy.world 3 points 1 year ago

Oh, wait, Japanese in the other comment, now I get it. This conversation is a about AI Loli porn.

Pfft, of course, that's why no one is saying the words they mean, because it suddenly becomes much harder to take the stance since hatred towards Loli Porn is not universal.

[-] sugar_in_your_tea@sh.itjust.works 4 points 1 year ago

I mean, I think it's disgusting, but I don't think it should be illegal. I feel the same way about cigarettes, 2 girls 1 cup, and profane language. It's absolutely not for me, but that shouldn't make it illegal.

As long as there's no victim, knock yourself out with whatever disgusting, weird stuff you're into.

load more comments (72 replies)

[-] priapus@sh.itjust.works 19 points 1 year ago

Seems odd that they mention Mastodon as a Twitter alternative in this article, but do not make any mention of the fact that Twitter is also rife with these problems, more so as they lose employees and therefore moderation capabilities. These problems have been around on Twitter for far longer, and not nearly enough has been done.

[-] Dave@lemmy.nz 11 points 1 year ago* (last edited 1 year ago)

The actual report is probably better to read.

It points out that you upload to one server, and that server then sends the image to thousands of others. How do those thousands of others scan for this? In theory, using the PhotoDNA tool that large companies use, but then you have to send the every image to PhotoDNA thousands of times, once for each server (because how do you trust another server telling you it's fine?).

The report provides recommendations on how servers can use signatures and public keys to trust scan results from PhotoDNA, so images can be federated with a level of trust. It also suggests large players entering the market (Meta, Wordpress, etc) should collaborate to build these tools that all servers can use.

Basically the original report points out the ease of finding CSAM on mastodon, and addresses the challenges unique to federation including proposing solutions. It doesn't claim centralised servers have it solved, it just addresses additional challenges federation has.

load more comments (2 replies)

[-] whatsarefoogee@lemmy.world 18 points 1 year ago

Mastodon is a piece of software. I don't see anyone saying "phpBB" or "WordPress" has a massive child abuse material problem.

Has anyone in the history ever said "Not a good look for phpBB"? No. Why? Because it would make no sense whatsoever.

I feel kind of a loss for words because how obvious it should be. It's like saying "paper is being used for illegal material. Not a good look for paper."

What is the solution to someone hosting illegal material on an nginx server? You report it to the authorities. You want to automate it? Go ahead and crawl the web for illegal material and generate automated reports. Though you'll probably be the first to end up in prison.

load more comments (7 replies)

[-] HughJanus@lemmy.ml 11 points 1 year ago* (last edited 1 year ago)

“We got more photoDNA hits in a two-day period than we’ve probably had in the entire history of our organization of doing any kind of social media analysis, and it’s not even close,”

How do you have "probably" and "it's not even close" in the same sentence?

Here's the thing, and what I've been saying for a long time about The Fediverse:

I don't care what platform you have, if it is sufficiently popular, you're GOING to have CSAM. You're going to have alt-right assholes. You're going to have transphobia, you're going to have racism and every other kind of discrimination.

People point fingers at Meta for "allowing" this but there's no amount of money that can reasonably moderate 3 b-b-billion users. Meta, and probably every other platform that's not Twitter or False social, does what they can about this.

Masto and Fedi admins need to be cognizant of the amount of users on their instances and need to have a sufficient number of moderators to manage those users. If they don't have them, they need to close registrations.

But ultimately the Fediverse can also create safe-havens for these sorts of things. Making it easy to set up a discriminatory network that has no outside moderation. This is the downside of free speech.

[-] Grimpen@lemmy.ca 4 points 1 year ago

Heck, Truth Social uses Mastodon, IIRC.

Ultimately, it's software. Even if my home instance does a good job of enforcing it's CoC, and every instance it federated with does as well, someone else can spin up their own instance, load up on whatever, and I'll never know or even be aware if it's never federated with my instance.

[-] HughJanus@lemmy.ml 2 points 1 year ago

I think it uses SOME code

[-] clutchmatic@lemmy.world 1 points 1 year ago

People point fingers at Meta for "allowing" this but there's no amount of money that can reasonably moderate 3 b-b-billion users.

This is a prime use case for AI technology

[-] mindbleach@lemmy.world 7 points 1 year ago

"Now you have two problems."

load more comments (1 replies)

[-] Dubious_Fart@lemmy.ml 3 points 1 year ago

No, cause then you end up with a case like the guy who lost 15+ years of emails, his phone number, all his photos, his contacts, and everything else he had tied to a google account, because Googles automated detection triggered on a naked photo of their baby, that they sent to the doctor during covid, that the doctor requested, about a rash on the babies diaper area.. and no amount of common sense would stay their hand or reverse their ignorant judgement that this man was a child pornographer, and even called the police on him.

load more comments (2 replies)

[-] Serinus@lemmy.ml 10 points 1 year ago

Don't bother with the bullshit clickbait article. Honestly, don't give them the views.

The underlying study is good though, and worth reading.

https://purl.stanford.edu/vb515nd6874

[-] southsamurai@sh.itjust.works 7 points 1 year ago

These articles are written by idiots, serving the whims of a corporate stooge to try and smear any other than corporate services and it isn't even thinly veiled. Look at who this all comes from

[-] TWeaK@lemm.ee 4 points 1 year ago

The article written by WaPo and regurgitated by The Verge is crap, but the study from Stanford is solid. However, it's nowhere near as doom and gloom as the articles, and suggests plenty of ways to improve things. Primarily they suggest better tools for moderation.

[-] voluble@lemmy.world 4 points 1 year ago

better tools for moderation

Where have I heard that before?

[-] mindbleach@lemmy.world 3 points 1 year ago

The study from Stanford conflates pencil drawings of imaginary characters with actual evidence of child rape.

Half the goddamn point of saying CSAM instead of CP is to make that difference blindingly obvious. Somehow, they still missed it. Somehow they are talking about sexual abuse as if it's something that can happen to pixels.

[-] azdood85@lemmy.world 3 points 1 year ago

Its weird how this headline shows up only when other headlines start covering how popular Mastadon is now.

Coincidence? Sure smells like it. God, I love astroturfing in the morning.

[-] TrismegistusMx@lemmy.world 4 points 1 year ago

According to corporate news everything outside of the corporate internet is pedophiles.

load more comments (3 replies)

[-] Feyter@programming.dev 3 points 1 year ago

I'm always suspicious if someone argues pro Contents Filter with "protection of children" as the main argument...

[-] sugar_in_your_tea@sh.itjust.works 3 points 1 year ago

As a parent, I'm always worried about any policy with "protect the children" as the main argument. There are lots of stupid policies proposed and sometimes implemented that are justified this way, such as:

facial recognition to prevent underage kids from playing certain video games
proof of ID to access social media and porn
complicated parental controls on devices and services

And so on.

Most of these have easy ways to circumvent these rules and absolutely violate privacy, so I will be teaching my kids how to do that. In fact, once our home Internet gets fast enough, I may route all traffic through a VPN just to avoid most of these stupid rules and instead rely on trust with my kids to keep them safe on the Internet.

[-] Spiracle@kbin.social 3 points 1 year ago

Direct link to the (short) report this article refers to:

https://stacks.stanford.edu/file/druid:vb515nd6874/20230724-fediverse-csam-report.pdf

https://purl.stanford.edu/vb515nd6874

After reading it, I’m still unsure what all they consider to be CSAM and how much of each category they found. Here are what they count as CSAM categories as far as I can tell. No idea how much the categories overlap, and therefore no idea how many beyond the 112 PhotoDNA images are of actual children.

112 instances of known CSAM of actual children, (identified by PhotoDNA)
713 times assumed CSAM, based on hashtags.
1,217 text posts talking about stuff related to grooming/trading. Includes no actual CSAM or CSAM trading/selling on Mastodon, but some links to other sites?
Drawn and Computer-Generated images. (No quantity given, possibly not counted? Part of the 713 posts above?)
Self-Generated CSAM. (Example is someone literally selling pics of their dick for Robux.) (No quantity given here either.)

Personally, I’m not sure what the take-away is supposed to be from this. It’s impossible to moderate all the user-generated content quickly. This is not a Fediverse issue. The same is true for Mastodon, Twitter, Reddit and all the other big content-generating sites. It’s a hard problem to solve. Known CSAM being deleted within hours is already pretty good, imho.

Meta-discussion especially is hard to police. Based on the report, it seems that most CP-material by mass is traded using other services (chat rooms).

For me, there’s a huge difference between actual children being directly exploited and virtual depictions of fictional children. Personally, I consider it the same as any other fetish-images which would be illegal with actual humans (guro/vore/bestiality/rape etc etc).

[-] sugar_in_your_tea@sh.itjust.works 3 points 1 year ago

If we took this to its logical conclusion, most popular games would be banned. How many JRPGs have underage protagonists? How many of those have some kind of love story going on in the background? What about FPS games where you're depicted killing other people? What about fantasy RPGs where you can kill and control animals?

Things should always be legal unless there's a clear victim. And communities should absolutely be allowed to filter out anything they want, even if it's 100% legal. So the lack of clear articulation of the legal issues is very worrisome since it implies a moral obligation to remove legal but taboo content.

[-] Strangle@lemmy.world 3 points 1 year ago

Hasn’t Twitter had the same problem for years?

https://www.iwf.org.uk/news-media/news/iwf-publishes-platform-specific-data-for-child-sexual-abuse-imagery/

[-] Arotrios@kbin.social 2 points 1 year ago

This is one of the reasons I'm hesitant to start my own instance - the moderation load expands exponentially as you scale, and without some sort of automated tool to keep CSAM content from being posted in the first place, I can only see the problem increasing. I'm curious to see if anyone knows of lemmy or mastodon moderation tools that could help here.

That being said, it's worth noting that the same Standford research team reviewed Twitter and found the same dynamic in play, so this isn't a problem unique to Mastodon. The ugly thing is that Twitter has (or had) a team to deal with this, and yet:

“The investigation discovered problems with Twitter's CSAM detection mechanisms and we reported this issue to NCMEC in April, but the problem continued,” says the team. “Having no remaining Trust and Safety contacts at Twitter, we approached a third-party intermediary to arrange a briefing. Twitter was informed of the problem, and the issue appears to have been resolved as of May 20.”

Research such as this is about to become far harder—or at any rate far more expensive—following Elon Musk’s decision to start charging $42,000 per month for its previously free API. The Stanford Internet Observatory, indeed, has recently been forced to stop using the enterprise-level of the tool; the free version is said to provide read-only access, and there are concerns that researchers will be forced to delete data that was previously collected under agreement.

So going forward, such comparisons will be impossible because Twitter has locked down its API. So yes, the Fediverse has a problem, the same one Twitter has, but Twitter is actively ignoring it while reducing transparency into future moderation.

load more comments (5 replies)

load more comments

this post was submitted on 24 Jul 2023

21 points (70.6% liked)

Technology

35218 readers

273 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.

Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.

Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 5 years ago

MODERATORS

MinutePhrase@lemmy.ml