kbc
@kbc
@rileybeans As a researcher I’m conflicted about this https://gizmodo.com/researchers-dump-2-billion-scraped-discord-messages-online-2000605471 you shared in your tg channel There’s a clause in GDPR for using secondary data and public available data for research purposes. I don’t know if this allows for publishing that data. If you ever tweeted between 2010 and 2023 chances are high your tweet has been analysed by some scientist. there’s also a push for researchers to publish their dataset with their paper so that others can run the analysis and check any issues. IMO, the scrapping of the data is an issue if the discord server admin was not aware of it. The proper way to do this is to include in the guidelines that this community takes part in research project XYZ and by joining the server you agree to participate
2 replies
0 recast
3 reactions
rileybeans
@rileybeans
and thank you for calling it out! really appreciate it! always happy to expand on the short comments or reasons for sharing
0 reply
0 recast
1 reaction
rileybeans
@rileybeans
the reporting says that users and server admin were NOT made aware of said scraping. but otherwise yes I generally agree. that's why I said in my comment that people shouldn't really be using public servers.
0 reply
0 recast
1 reaction