shazow
@shazow.eth
How to ruin Gmail's spam training model at scale (pretty sure this has been happening for a long time): 1. Harvest many millions of legitimate email addresses from the many leaked datasets over the years. 2. Automate subscribing legitimate email addresses to legitimate services all over the internet, whatever is easiest, do it at scale. 3. Produce billions of legitimate transactional emails that nobody wanted, some large subset of them are going to get marked as spam, tarnishing the training data. 4. Bonus: Carefully craft input strings (via signup name) to leverage association of specific strings with spam/not spam. Result: Much less effective email spam detection.
0 reply
0 recast
1 reaction
Cr1sp13
@cr1sp13
This is a concerning practice that highlights the vulnerabilities in email filtering systems. Ethical use of data and respect for user privacy should always be prioritized to maintain the integrity of such systems.
0 reply
0 recast
0 reaction