@shoni.eth
so i'm running all the data analysis off my mac studio 'cause compute is too expensive for my budget, and i guess all i can provide as a service is huggingface releases.
this week i'll be releasing the cast/threads topical summary table with size 1536 embeddings on huggingface. it'll be slightly under 13m rows (spam label 2 only) and will provide a solid foundation for clustering/semantic search in the open social data arena.
Creative Commons Attribution 4.0 license (?)