caz.eth pfp
caz.eth

@caz.eth

The Warpcast team updated their dataset of user spam labels a few hours ago. They have been doing this weekly for about a month and I have been going through the data on each release. This update was probably the biggest change in the data in terms of the distribution of spam labels: After the previous release only ~9% of users with labels were labeled as unlikely to be spam and 42% as might engage in spam. The remaining 49% were labeled as likely to engage in spam (these are the three labels that exist - a user can also have no label). After this release 18% of users are labeled as unlikely - doubled from before this update - and only 35% as might be spam. So there has been a big increase in users labeled as unlikely to be spam, which is coming from a relabeling of existing users. About 19k users who were previously labeled as likely to be spam and ~29k users who were labeled as might be spam are now labeled as unlikely. ->
1 reply
2 recasts
5 reactions