@gbinj95
π¨ @gensynai "Hail to the Thief"
Decentralized GRPO is vulnerable: Few malicious nodes can poison LLMs via high-reward sequences β full model drift in <50 rounds.
Attacks:
- In-context (alter math/code)
- Out-of-context (irrelevant text)
β 100% success on math/code
Defenses:
1. Log-prob verification (homogeneous)
2. LLM-as-a-Judge (heterogeneous)
β Block 100% attacks, minimal overhead
First systematic study of dRL security.
π arxiv.org/abs/2511.09780
π gensyn.ai/blog/hail-to-tβ¦
#DecentralizedAI #GRPO #LLMSecurity #Gensyn