Follow up to the proposal for working on GDPR issues and moderation dashboard for Lemmy.

rglullis@communick.news · 8 months ago

Follow up to the proposal for working on GDPR issues and moderation dashboard for Lemmy.

Max-P@lemmy.max-p.me · 8 months ago

Aren’t you also supposed to ensure that the third-party handling the PII is also GDPR compliant before the user consents to sharing it? Pretty sure my work training said so, but they could be erring on the safe side.

If not, that sounds like a giant loophole: you could just ask for consent, funnel all the data out of reach from the GDPR, and do all the analytics and profiling you want. Like, when Threads joins, what’s stopping them from swallowing all your user’s data? They can get it, they’re implicitly allowed to process it, and yet the data is now unencumbered from any further consent requests by the user. They don’t even have a way of knowing if the user is potentially from the EU.

Meta would of course be obligated to delete the data if the user goes to them and requests it to be deleted, but they might not even know Meta’s processing their data, and there’s a lot of privacy enthusiasts on Lemmy.

How can a user possibly consent to this properly, other than practically waiving their GDPR rights, which the law doesn’t allow?

Is there any new documentation around on that topic from actual lawyers analyzing the implications? It feels everything GDPR I see is opinions and personal interpretations of the law, which may be biased towards “it’s probably okay” as obviously we all want the fediverse to succeed.

In particular, ActivityPub pushes the data out for the most part, so one can’t argue “well I can’t stop people scraping my site illegally”, one could argue that instance admins should vet new instances before opening the data firehose.

It feels very much like depending on the case, and who got harmed how, a judge could decide the admins should have put technical safeties. I mean, we’re in the era of holding porn sites responsible for letting minors access the site and demanding they ID everyone to make sure. Lawmakers barely understand technology, let alone something like the Fediverse. I could see things go sour real fast.

rglullis@communick.news · 8 months ago

User generated content != PII.

Like, when Threads joins, what’s stopping them from swallowing all your user’s data?

What’s stopping you (or anyone else) to just bypass authorized fetch and swallow the data stream from anyone?

Max-P@lemmy.max-p.me · 8 months ago

User generated content != PII.

Aren’t the usernames an identifier and therefore PII? As far as I understand you can’t even use a cookie or the user’s IP to determine unique visitors on a site because it identifies the user personally.

On the fediverse, every comment, every vote, every moderator action is completely public, and tied to the username. Unless the username is a throwaway and the user never ties it to their real identity in any way, that builds a ridiculously detailed profile of the user’s habits online. And still, you get enough of a profile I don’t doubt Google or Meta could manage to connect it to your profile easily unless you’re actively using a different persona.

It’s all completely public and available to anyone that wants it.

It’s even worse, images aren’t proxied right now so you can actually tie a username to an IP rather easily if you don’t use a VPN or block outside resources by default.

Your IP

Not exactly a new threat to be fair, but really the only thing not being broadcasted everywhere about the user is their email address.

I guess the best one can do is clearly inform the user about the risks involved and honor incoming deletion requests properly, but man if a child get abused on the fediverse and you can barely yank the content, I can see a judge ruling that the fediverse as a whole is reckless.

What’s stopping you (or anyone else) to just bypass authorized fetch and swallow the data stream from anyone?

Exactly.

rglullis@communick.news · 8 months ago

To my understanding, the key part is that you are supposed to disclose any type of information that you are sharing with third-parties through back channels.

If you set a third-party tracking cookie on your site, then yes, the third-party can use the cookie to correlate users from different sites. But if you do what you just did and place a image that displays the IP, how can any third-party access this information? You have my IP and a request log, so what? Is there any way that another Lemmy instance can use this to identify me?

On the fediverse, every comment, every vote, every moderator action is completely public, and tied to the username.

And distribution/collection of public information is not what the GDPR is trying to regulate!

Kayn@dormi.zone · 8 months ago

Can you show where the GDPR excludes public information? Because if it doesn’t and can uniquely identify a person, then it’s still subject to this regulation.

rglullis@communick.news · 8 months ago

Let’s say you go to a public forum and asks “please remove my PII”. To comply, they don’t need to remove your comments and posts, they just need to remove your username. Granted, the website owner might have the policy of deleting all the content, but you’ll have a hard time with the legal system to argue that they are not complying with the GDPR if they delete only the thing that really just identifies you uniquely.

Kayn@dormi.zone · 8 months ago

But what if some of my comments include information that can uniquely identify me?

That can be something like “message me on Matrix at …”

rglullis@communick.news · 8 months ago

It’s not “uniquely identifiable”. What if someone else writes your information as their own?

Again, I feel like there is a lot of conjecturing when the best thing to do would be to get an actual lawyer to make a report indicating what about Lemmy today is in violation of the GDPR. For all the crying around it, I’d bet that the issues are not insurmountable, and I think that we should stick with common sense: those that care about actual privacy should not be using a social media platform anyway, and they should always be treating anything they put online as something that is never going to be deleted and available for any sufficiently motivated actor.