Radek Bartyzal blog

[Video notes] How to Train an LLM in 2024

[Video notes] How to Train an LLM in 2024

ANN architecures: From Perceptron to Transformers

ANN architecures: From Perceptron to Transformers