Radek Bartyzal blog

[Video notes] How to Train an LLM in 2024

[Video notes] How to Train an LLM in 2024

ANN architectures: From Perceptron to Transformers

ANN architectures: From Perceptron to Transformers