@v
Other mistake people make is trying to giga-brain it by automating fully or applying LLMs.
This is really, really hard and unlikely to work. Start by manually labelling with heuristics and sell the data.
If its working, try to develop a basic ML-model using some common features.