@dwr
Let's say you have a corpus of text — 10 million words — about a specific topic.
1. What's the best way to "train a model" on that text?
2. Is that even the right term? Or is it using an existing foundational model and then augmenting it? Fine-tuning it? Something else?