@nicolay
The more specific your task and the more it diverges from being a general purpose chatbot, the more likely you are to get good results from finetuning vs prompting alone. With the caveat, if large models completely fail, even a finetuned model will likely fail.