From the paper everyone's bitching about:

> Since the Mastodon user may be unaware of their data being public and reusable for research purposes we disposed of the information about the users and we fully anonymized them by hashing the Mastodon user identifier.

very heavy handed sarcasm 

@djsumdog "From the paper everyone's bitching about:" you have such a way with words

very heavy handed sarcasm 

@joshua Yea, sorry. I shouldn't have put it that way, especially without going over the data or artifacts first. I should know better than gut reactions.

I still feel like it's fine to scrape anything public. That's how works and it's kinda important to preventing censorship .. but it can also be used in call-out culture 🤷‍♂️ .. this paper is honestly crap anyway. They do trivial analysis. There's no debt or understanding of how ActivityPub works in it.

@djsumdog they included the urls of every post tho, which means that the usernames are still there.

@lemememeringue Yea .. this paper is kinda crap. They're not very knowledgeable. They're not doing anything really deep either; they're going purely off self-marked content. There's nothing really insightful here.

I didn't realize that about the URLs. They don't mention in the paper where they publish the artifacts. Someone had this pastbin of just the instance info earlier:

I should probably look at how they anonymized stuff before I say anything else.

@djsumdog wow amazing what do people expect 'public' means anyway

Sign in to participate in the conversation

The social network of the future: No ads, no corporate surveillance, ethical design, and decentralization! Own your data with Mastodon!