@blobs
part 2 of the series on backpropagation, hyperparameters and evaluation metrics.
writing this gave me some basic understanding of how a LLM works, instead of thinking of a language model as a complete black box. let me know if it helps you too!
https://michaelhly.com/posts/tune-llm-two