Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther

Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.
Full article here.
Link to the full leaked list download: Meta leaked list pdf


Definitely called this. Can we have private voting now? These people are scraping the fediverse and the current state of things is a privacy nightmare.
You cannot have private voting. The Fediverse is open, that information has to be shared for it to work unless you want to make it more open to vote manipulation.
Even the PieFed implementation wasn’t great, basically giving every user a second account that sends the vote instead.
Vote manipulation only matters if votes matter. Just make down votes placebo or get rid of them entirely. There are other engagement metrics to use for sorting. Just make votes a small portion of a bigger algorithm and it dilutes the problem away. On the other hand, it seems like a ton of people on here outright refuse to consider that this is a problem, and are I stead choosing to live with their head in the sand.
Either way, right now public voting does nothing to stop vote manipulation, it just gives the sockpuppet and astroturfing accounts great feedback to target certain demographics.
The piefed implementation was a great compromise imo, and the only reason it was abandoned was idiotic forum politics. It did exactly what it set out to do - provide a layer of protection against large scale data mining and long term storage, and added a significant barrier to vote stalking, while still leaving mechanisms to ban voting agents.
I don’t want engagement metrics, I want the collective opinion of users.
People may engage may more with content they dislike, that doesn’t mean they want it to be on the front page.
Once people stop expecting privacy from an open publicly broadcasting platform the better.
So your argument is that meaningless internet points are more important than user privacy? I just want to make sure we have that on record.
The quickest path to enshitification of the fediverse is precisely this kind of large scale scraping and data mining. There are extremely simple ways to avoid this but the collective admin cohort has decided they like this tiny bit of internet power over innovation, because innovation is a tiny bit more difficult.
There is no user privacy on an open system. Just as there is no privacy when you walk down the street. If you want privacy go into your house and talk (use signal or any other privacy app).
Likewise peoples opinions are not meaningless.
The enshitification of the fediverse will come from corporate or so aligned instances that play it safe for brand. The scraping is irrelevant. Enshitification is a social issue, not a technical one.
There is no privacy, or there can’t be privacy?
By intent there is none, and it should remain that way. This works on public openness, everything needs to be visible not further hidden away out of our reach on our platform.
Well that is surely not my intent.
i was surprised how we vote left the instance. smh just send a count