undefined | Better HN

0 pointsprepend1mo ago0 comments

How else do you want companies to remove and prevent CSAM? It seems like you must have some human involvement to train and monitor.

It’s a terrible job, I wouldn’t want to do it, but someone needs to. Perhaps one day, AI will be accurate enough to not need it, but even then you need someone to process complaints and waivers (like someone’s home photos being inaccurately flagged).

0 comments

44 comments · 8 top-level

abdullahkhalids1mo ago· 22 in thread

CSAM exists on social media because they are so large that it's not possible to moderate them effectively. To me this is a a no-go. If a business is so large that it cannot respect laws, it needs to be shut down.

The correct way to organize social media is in federated way. Each server only holds on average a few hundred or few thousand people. Server moderators should be legally responsible for content on their server. CSAM on social media will be 100x suppressed because banning people is way easier on small servers.

Not many moderators will have to look at CSAM because the structure of the system makes is unappealing to even try sharing CSAM, knowing you will be immediately blocked.

scarmig1mo ago

Having tens of thousands of decentralized, independently moderated servers would result in an order of magnitude more CSAM being shared than having a few oligopolies. The abusers just have to find the weakest link, and that weakest link will have fewer resources than multi trillion dollar companies. You would also likely not hear many news stories about it, because they won't have the expertise to even detect it.

That's a tradeoff you can choose to make, but you need to enter into it with open eyes.

bigbadfeline1mo ago

> Having tens of thousands of decentralized, independently moderated servers would result in an order of magnitude more CSAM being shared than having a few oligopolies.

It doesn't matter how many are shared but how many are viewed. On a small server community policing works just fine, bad actors are easier and faster to block and to top it off, the smaller reach of each server makes it unprofitable to target multiple serves, fish for their weak points. etc - the dirty jobs become unprofitable which is what matters most.

With the help of AI, small players can do a better job at removing CSAM.

1 more reply

freejazz1mo ago

>That's a tradeoff you can choose to make, but you need to enter into it with open eyes.

No it's not. It's certainly not my choice. No one asked me if it's okay for Facebook to distribute CSAM because you insist it would be worse if it didn't.

1 more reply

camgunz1mo ago

This isn't an either or. X isn't the only place CSAM is, there are gazillions of other sources. It I'd probably the easiest place to find it tho.

devilbunny1mo ago

> Server moderators should be legally responsible for content on their server.

And therefore anything that is remotely questionable will be blocked. Not just kiddie porn. Pissed off a local business with a bad review? Blocked.

Child abusers are twisted people, and I really don’t care much what happens to them, but making it impossible for them to use the internet means sterilizing the whole thing.

prmoustache1mo ago

>And therefore anything that is remotely questionable will be blocked. Not just kiddie porn. Pissed off a local business with a bad review? Blocked.

This is already the case. There is a lot of lawful, useful, medical or educational content that is actively censured on social medias because they include words or pictures of organs while same social medias actively encourage and develop algorithm to push underage girls (and possibly boys) posting pictures of themselves in sexual poses, attires and context.

Big tech and social media networks love and push CSAM, they just hide the genitals but the content really is the same.

1 more reply

abdullahkhalids1mo ago

You are just saying that physical life doesn't function. People get banned or removed from all sorts of informal and formal groups all the time because of completely illegitimate reasons. That's just human politics embedded so deeply in our psychology it will never go away. They simply move to different groups - and similarly online they can move to a different federated server.

But that's not possible in today's oligopoly of social media. An invisible algorithm will ban you, and there is no way back, and few alternates. Big Social Media is way worse from a sanitizing perspective than some federated social media.

1 more reply

haritha-j1mo ago

Also, if you've gone from zero to one of the biggest coroporations in the country, and have billions to throw at the 'metaverse', I find it hard to believe that removing CSAM is where you struggle.

abdullahkhalids1mo ago

No. It's a legitimately difficult problem because there not all naked pictures of kids are illegal. The false positive problem is bad for business, but also generally bad even if the big social media was benevolent.

Moderators need to actually understand the context of the picture/video, which requires knowledge of culture and language of the people sharing the pictures. It's really difficult to do that without hiring moderators from every culture in the world.

But small federated servers can often align along real world human social networks, so it's easier for the server admin to understand what should be removed.

red_admiral1mo ago

The amount of CSAM online is completely out of control. There's already nation-level and sometimes international cooperation to catch any known images with perceptual hashing (think: the opposite of cryptographic hashing) as well as other automated and manual tools.

My impression is it would take Manhattan-Project levels of effort and funds to come close to "solving" this problem, especially without someone getting on a watchlist for having a telehealth-first primary care provider insurace plan and asking for advice on their toddler's chickenpox.

Human review? Meta has small armies worth of content moderators already that tend to burn out with psychological problems and have a suicide rate where you're probably better off going to fight in a real war. (This includes workers hired by Sama in Kenya, to link back to the OP.)

I will reluctantly grant Meta that they're up against a really hard problem here.

1 more reply

GrinningFool1mo ago

Isn't this more about disincentivizing the posting of it in the first place by increasing the chances of getting banned? Once you have to remove it, it's too late.

Barrin921mo ago

>CSAM on social media will be 100x suppressed because banning people is way easier on small servers.

No it isn't. Small servers often don't have paid security or moderation, are run in anonymous fashion, and have no profit motive that can even be used to incentivize them against hosting illegal content.

That's visible when it comes to porn. There's a million bootleg porn sites on the internet hosted that show off illegal content. The only site that was ever forced to curate its content was Pornhub, because they're sufficiently large, work in a jurisdiction that has laws and can be held accountable. From a content moderation standpoint going after a million web forums is an absolute pain in the ass compared to going after Facebook.

Which is the first argument any decentralization advocate always brings up (and they're correct to do so), censorship is harder and evasion of law enforcement easier when dealing with a network of independent actors.

red_admiral1mo ago

What stops Humbert Humbert from joining hundreds of small servers?

You now have 100x the total human effort for mods to review and ban him.

Aurornis1mo ago

> Server moderators should be legally responsible for content on their server.

So if you want to send someone to jail, just talk your way into joining their server, upload some illegal content, and report them for it?

> Not many moderators will have to look at CSAM because the structure of the system makes is unappealing to even try sharing CSAM, knowing you will be immediately blocked.

Why would someone join a server with active moderation if they wanted to share CSAM with their social media friends?

They would seek out one of those servers that was set up specifically for those groups, where it was known to be a safe space.

This is what many people don't get about federated networks: The people in those little servers DGAF if you block them. They want to be surrounded by their likeminded friends away from the rules of some bigger service like Facebook or Twitter. Federated social media is the perfect platform for them because they can find someone who set up a server in some other country with their own idea of rules and join that, not be subject to the regulations of mainstream social media.

genewitch1mo ago

right, and you have other users on fediverse that notice that server leaking, and if the content is bad enough, report the service to an authority. Having all of the pedophiles and other creeps on a tiny subset of servers, isloated islands of them; well, that ought make enforcement easier.

It also makes it relatively easy to avoid, as server admins share blocklists. I know a dozen servers offhand that i'd block if i ran another fediverse server.

Fosstodon fediverse server doesn't have this issue, for example.

I replied this way because the way you wrote it, it sounds like an indictment of a system that's designed to avoid advertisers getting user profiles, over all else.

The problem is the people who participate in this (the illegal and immoral), and not "the network."

1 more reply

devmor1mo ago

The one thing I will throw out here that I can add to this conversation is that I think the government simply does not care, either. It's mainly only in regard to mass public outrage, or when someone is a political target that it gets dealt with from a law enforcement level.

Anecdotally, when I was a young adult I was a volunteer moderator for a large forum. We got reports of CSAM several times a month and had a process for escalating and reporting it to the FBI IC3 - we retained a lot of information about the users that posted it.

One of the administrators of the website mentioned to me that over the years since the inception of the forum, they'd reported almost a thousand incidents of CSAM distribution - and the FBI followed up with them to get information less than 10 times in total.

devilbunny1mo ago

That seems reasonable though. The FBI isn’t interested in busting one perv in a closet, they want the ones making the stuff.

1 more reply

2ndorderthought1mo ago

Yep. If you cannot both safely and legally provide the thing you are selling you are no longer a legitimate company you are a criminal enterprise profiting off of exploitation.

esyir1mo ago

If car manufacturers cannot bring car related deaths to zero, they too should no longer be legitimate companies.

3 more replies

Yokohiii1mo ago

I am not sold on the federated thing to solve CSAM or similar issues.

Actually companies should be bullied about privacy and copyright so they are unable to share any contents at a scale with 3rd parties. Thus they have to solve it on their own and forced to realize their business model is shit.

muglug1mo ago

> Banning people is way easier on small servers

Big “citation needed” here. My bet is that Meta have far better moderation systems than any other social media company on the planet.

genewitch1mo ago

when i ran a fediverse server for myself and 3 people, but allowed public signups if someone came by; it was very easy to ban people, and very easy to null-route entire swaths of the fediverse, because i didn't want their content on my service.

That's more what i got from that pull-quote. I know a company that has hundreds of individual forums, and those are all moderated quickly and correctly (last i heard). They're moderated so effectively they often get DDoS by Russian IPs for banning users for scam posts from that country.

SlinkyOnStairs1mo ago· 12 in thread

> How else do you want companies to remove and prevent CSAM?

Different situation.

Facebook has to do CSAM moderation because it's a publishing platform. People will post CSAM on facebook, so they must do moderation.

And "just don't have facebook" isn't a solution because every publication of any sort has to deal with this problem; Any newspaper accepting mail has this problem. (Albeit to a much more scaled down version) People were nailing obscene things to bulletin boards for all recorded history.

---

In contrast, OpenAI has no such problem. It did not have CSAM pushed onto it, it actively collected such data itself. It could have, at any point before and after, simply stopped scraping all of the web indiscriminately and switched to using more curated sources of scraped data.

The downside would be "worse LLMs" or "LLMs being created later", which is a perfectly acceptable compromise.

---

This is not to say that genuine content flagging firms have no reason to curate such data & build tools to automatically flag content before human moderators have to. (But then they also shouldn't be outsourcing this and traumatizing contract workers for $2-3 an hour)

But OpenAI is not such a firm. It's a general AI company.

GrinningFool1mo ago

> traumatizing contract workers for $2-3 an hour)

Is there an hourly rate at which this should be acceptable?

SlinkyOnStairs1mo ago

There's no dollar amount but proper support during and after employment is a minimum, and a large paycheque will both offset some of the human cost and make it easier for people to be pushed to quit the job; Such that they aren't doing the job for too long.

The current support systems for police in this subject are already insufficient. Facebook's treatment of their moderation staff is abhorrent. The point of including the pay figure is to further illustrate just how damning this subcontracting practice is.

arw0n1mo ago

There is labor that is necessary for our societies to function, but a direct threat to the people doing the work. Someone has to do it, and it should be seen as a great service to society and rewarded accordingly. In a just world, we would be paying significantly extra for threats to health that come from work, in the one we are currently in we use threat of worse harm instead.

1 more reply

bonesss1mo ago

We have coal miners destroying their bodies and lungs, cobalt mining slavery, cocoa nut child labour and de facto slavery, sex workers, CPS investigators, first responders, and doctors with high rates of suicide…

Not only is there an acceptable market rate for trauma, it’s sometimes competitive and requires licensing.

1 more reply

genewitch1mo ago

Emergency Department^ doctors, what do they make? give people who have to review the worst humanity has to offer and pay them that. and while we're at it, ambulance personnel should get a huge pay bump. Take it from nurses' pay.

^ i originally said "triage doctors" but i meant the resident ER doc.

2 more replies

expedition321mo ago

Rookie police officers in my country are paid 2500 euro per month and they have to deal with the underbelly of society.

They have access to better counselling and are ostensibly trained for the job. But there are still suicides.

fragmede1mo ago

OpenAI runs ChatGPT where users submit text and photos and OpenAI generates and sends text and photos back. So users could be submitting CSAM. And yes, OpenAI could be generating CSAM. It's not limited to being a pull operation. What am I missing?

SlinkyOnStairs1mo ago

What you're missing is that they're "separate" parts of the business.

The core Facebook product is users' posts. It's not possible to separate those two. Nor can one downscale Facebook in a way that stops the problem; The aforementioned "Facebook has had this problem because it's a problem we've had since the medieval days of a town bulletin board"

With OpenAI, the way ChatGPT was built and user submissions are separate things. The GPT models could have been have been trained without this mess. OpenAI could be more selective in what data it scrapes.

While OpenAI cannot stop users sending god knows what in their prompt text and images, OpenAI can choose to not interact with that data beyond the minimum legal retention, by e.g. not using it for training the next generation of models. This would massively downscale the problem.

AI output is another such problem, where A) Maybe this'd be less of a problem if they didn't recklessly include a bunch of CSAM into the training data by accident, and B) LLMs just aren't the kind of fundamental human right that "having a public opinion" is. It would be fine if they were less good, invented years later, or even not invented at all.

The main counterargument to the latter has been the "But China is inventing evil AI" spiel, which is fairly weak. If China builds an orphaned baby crushing machine, we do not need to build an orphaned baby crushing machine of our own. (And the reality is that China is only chasing AI so aggressively because the west does. They're reasonable people, it would have been entirely possible for both the west and China to make a mutual "no orphan crushing" agreement and just accept slower rollout of technology. This is exactly what has been done with human genetic engineering, and China did in fact enforce these norms.)

prependOP1mo ago

People upload images to openai and have it generate and modify. And it has to not generate csam.

I guess that they process billions of images every day.

I don’t think they’re getting csam from scraping (thankfully, I expect there isnt much publicly available csam).

They aren’t as big as facebook, but they must have this functionality or many users will be hurt.

BobbyJo1mo ago

> In contrast, OpenAI has no such problem. It did not have CSAM pushed onto it, it actively collected such data itself. It could have, at any point before and after, simply stopped scraping all of the web indiscriminately and switched to using more curated sources of scraped data.

You've just thrown the garbage over your fence. Instead of OpenAI contracting Sama to classify CSAM, the "Curators" have to.

At the end of the day, someone needs to classify it. If you say the platforms need to, and they miss some, and it ends up in OAI training data, OAI is going to be the entity paying the prices.

sahilagarwal1mo ago

Not really different. They would need to report CSAM if it is ever uploaded by a user.

Any website that allows user to upload videos needs some sort of service that can identify and report CSAM.

deaux1mo ago

This is of course incredibly illegal, but megacorps (by valuation) and oligarchy members are above the law so who cares. I assume there could be a regulatory framework which can make this legal for an extremely specific purpose, but there is zero change that OpenAI was part of this/abiding by this in 2022, absolutely none.

IncreasePosts1mo ago· 2 in thread

Couldn't you just use multiple classifiers? Like "is a minor" classifier coupled with "is sexual content" classifier?

superfrank1mo ago

How would you test that that works?

IncreasePosts1mo ago

There are databases of known child porn available for this kind of work.

Yokohiii1mo ago

These workers prepare data for AI. I don't think the need for them will go away anyway soon.

Westeners are too expensive and unwilling to do it. AI is a business model that requires poverty and extreme inequality to function. Yes other businesses do that too, but they don't claim it's a solution to everything while it actually has very special human requirements.

frm881mo ago

This is the swedish newspaper report quoted in the sumitted article: https://www.svd.se/a/K8nrV4/metas-ai-smart-glasses-and-data-...

There are more reasons why these jobs are located in developing countries, it's not only the price of labour. Imagine for a second, these annotations would have to be done in the US. The public outrage would probably be audible across the Atlantic. This is another form of imperialism.

duxup1mo ago

I agree that there’s no good way to do this other than like… no user generated content ever or just ban everyone for their baby pics and etc….and nobody can post them.

Granted the latter is kinda happening distantly on YouTube where you can’t talk about “ suicide “ so everyone self censors…

freejazz1mo ago

I don't understand why their size is an excuse for them to not remove and prevent CSAM.

cnd78A1mo ago

t’s a terrible job

you must be extremy priviledge to think that way, even as EU I would be glad to do it for the minimum salary. For your info, a terrible job for most human is a job that is extremly hard physically at the point of destroying your health. That said, like many people, I would find it much more interesting than many boring job. [If someone read this, please hire me for this, in exchange I would work the 5 first hour for free]

j / k navigate · click thread line to collapse

0 comments

44 comments · 8 top-level

abdullahkhalids1mo ago· 22 in thread

Not many moderators will have to look at CSAM because the structure of the system makes is unappealing to even try sharing CSAM, knowing you will be immediately blocked.

scarmig1mo ago

That's a tradeoff you can choose to make, but you need to enter into it with open eyes.

bigbadfeline1mo ago

> Having tens of thousands of decentralized, independently moderated servers would result in an order of magnitude more CSAM being shared than having a few oligopolies.

With the help of AI, small players can do a better job at removing CSAM.

1 more reply

freejazz1mo ago

>That's a tradeoff you can choose to make, but you need to enter into it with open eyes.

No it's not. It's certainly not my choice. No one asked me if it's okay for Facebook to distribute CSAM because you insist it would be worse if it didn't.

1 more reply

camgunz1mo ago

This isn't an either or. X isn't the only place CSAM is, there are gazillions of other sources. It I'd probably the easiest place to find it tho.

devilbunny1mo ago

> Server moderators should be legally responsible for content on their server.

And therefore anything that is remotely questionable will be blocked. Not just kiddie porn. Pissed off a local business with a bad review? Blocked.

Child abusers are twisted people, and I really don’t care much what happens to them, but making it impossible for them to use the internet means sterilizing the whole thing.

prmoustache1mo ago

>And therefore anything that is remotely questionable will be blocked. Not just kiddie porn. Pissed off a local business with a bad review? Blocked.

Big tech and social media networks love and push CSAM, they just hide the genitals but the content really is the same.

1 more reply

abdullahkhalids1mo ago

1 more reply

haritha-j1mo ago

Also, if you've gone from zero to one of the biggest coroporations in the country, and have billions to throw at the 'metaverse', I find it hard to believe that removing CSAM is where you struggle.

abdullahkhalids1mo ago

But small federated servers can often align along real world human social networks, so it's easier for the server admin to understand what should be removed.

red_admiral1mo ago

I will reluctantly grant Meta that they're up against a really hard problem here.

1 more reply

GrinningFool1mo ago

Isn't this more about disincentivizing the posting of it in the first place by increasing the chances of getting banned? Once you have to remove it, it's too late.

Barrin921mo ago

>CSAM on social media will be 100x suppressed because banning people is way easier on small servers.

red_admiral1mo ago

What stops Humbert Humbert from joining hundreds of small servers?

You now have 100x the total human effort for mods to review and ban him.

Aurornis1mo ago

> Server moderators should be legally responsible for content on their server.

So if you want to send someone to jail, just talk your way into joining their server, upload some illegal content, and report them for it?

> Not many moderators will have to look at CSAM because the structure of the system makes is unappealing to even try sharing CSAM, knowing you will be immediately blocked.

Why would someone join a server with active moderation if they wanted to share CSAM with their social media friends?

They would seek out one of those servers that was set up specifically for those groups, where it was known to be a safe space.

genewitch1mo ago

It also makes it relatively easy to avoid, as server admins share blocklists. I know a dozen servers offhand that i'd block if i ran another fediverse server.

Fosstodon fediverse server doesn't have this issue, for example.

I replied this way because the way you wrote it, it sounds like an indictment of a system that's designed to avoid advertisers getting user profiles, over all else.

The problem is the people who participate in this (the illegal and immoral), and not "the network."

1 more reply

devmor1mo ago

devilbunny1mo ago

That seems reasonable though. The FBI isn’t interested in busting one perv in a closet, they want the ones making the stuff.

1 more reply

2ndorderthought1mo ago

Yep. If you cannot both safely and legally provide the thing you are selling you are no longer a legitimate company you are a criminal enterprise profiting off of exploitation.

esyir1mo ago

If car manufacturers cannot bring car related deaths to zero, they too should no longer be legitimate companies.

3 more replies

Yokohiii1mo ago

I am not sold on the federated thing to solve CSAM or similar issues.

muglug1mo ago

> Banning people is way easier on small servers

Big “citation needed” here. My bet is that Meta have far better moderation systems than any other social media company on the planet.

genewitch1mo ago

SlinkyOnStairs1mo ago· 12 in thread

> How else do you want companies to remove and prevent CSAM?

Different situation.

Facebook has to do CSAM moderation because it's a publishing platform. People will post CSAM on facebook, so they must do moderation.

---

The downside would be "worse LLMs" or "LLMs being created later", which is a perfectly acceptable compromise.

---

But OpenAI is not such a firm. It's a general AI company.

GrinningFool1mo ago

> traumatizing contract workers for $2-3 an hour)

Is there an hourly rate at which this should be acceptable?

SlinkyOnStairs1mo ago

arw0n1mo ago

1 more reply

bonesss1mo ago

Not only is there an acceptable market rate for trauma, it’s sometimes competitive and requires licensing.

1 more reply

genewitch1mo ago

^ i originally said "triage doctors" but i meant the resident ER doc.

2 more replies

expedition321mo ago

Rookie police officers in my country are paid 2500 euro per month and they have to deal with the underbelly of society.

They have access to better counselling and are ostensibly trained for the job. But there are still suicides.

fragmede1mo ago

SlinkyOnStairs1mo ago

What you're missing is that they're "separate" parts of the business.

prependOP1mo ago

People upload images to openai and have it generate and modify. And it has to not generate csam.

I guess that they process billions of images every day.

I don’t think they’re getting csam from scraping (thankfully, I expect there isnt much publicly available csam).

They aren’t as big as facebook, but they must have this functionality or many users will be hurt.

BobbyJo1mo ago

You've just thrown the garbage over your fence. Instead of OpenAI contracting Sama to classify CSAM, the "Curators" have to.

At the end of the day, someone needs to classify it. If you say the platforms need to, and they miss some, and it ends up in OAI training data, OAI is going to be the entity paying the prices.

sahilagarwal1mo ago

Not really different. They would need to report CSAM if it is ever uploaded by a user.

Any website that allows user to upload videos needs some sort of service that can identify and report CSAM.

deaux1mo ago

IncreasePosts1mo ago· 2 in thread

Couldn't you just use multiple classifiers? Like "is a minor" classifier coupled with "is sexual content" classifier?

superfrank1mo ago

How would you test that that works?

IncreasePosts1mo ago

There are databases of known child porn available for this kind of work.

Yokohiii1mo ago

These workers prepare data for AI. I don't think the need for them will go away anyway soon.

frm881mo ago

This is the swedish newspaper report quoted in the sumitted article: https://www.svd.se/a/K8nrV4/metas-ai-smart-glasses-and-data-...

duxup1mo ago

I agree that there’s no good way to do this other than like… no user generated content ever or just ban everyone for their baby pics and etc….and nobody can post them.

Granted the latter is kinda happening distantly on YouTube where you can’t talk about “ suicide “ so everyone self censors…

freejazz1mo ago

I don't understand why their size is an excuse for them to not remove and prevent CSAM.

cnd78A1mo ago

t’s a terrible job

j / k navigate · click thread line to collapse