Content pfp
Content
@
0 reply
0 recast
0 reaction

shoni.eth pfp
shoni.eth
@alexpaden
Just released the largest open dataset of Farcaster threads with embeddings! 📊 24.3M high-quality threads 🔍 512-dim Voyager embeddings (f32) ✨ Spam-filtered & engagement-ranked 📅 Complete Farcaster history to May 2025 Perfect for semantic search, clustering, recommendation systems & social analysis 🤗 https://huggingface.co/datasets/shoni/farcaster
8 replies
9 recasts
42 reactions

Kasra Rahjerdi pfp
Kasra Rahjerdi
@jc4p
this is freaking awesome!!!!!!!!
1 reply
0 recast
1 reaction

shoni.eth pfp
shoni.eth
@alexpaden
please use it if you can think of a reason and let me know if you have suggestions— I’ve already thought of a few improvements for my formatting technique before being embedded that I’ll test. Going to start testing clusters over the next few days and see what happens Ty🙏
0 reply
0 recast
3 reactions