i write about stuff that i find interesting or anything about deep learning, neural nets or insights on papers.

how neural networks think at scale

features, neurons, polysemanticity and superposition

mathematical intuition for transformers

embeddings, attention and transformers architecture math

notes on neural nets

backpropagation, regularization, cross-entropy loss and more