▸ Cutting the sentence into tokens & loading the vocabulary…
▸ Looking up an embedding for each token
▸ Producing Query · Key · Value for self-attention
▸ Computing softmax(Q·Kᵀ/√d) weights · multiple heads
▸ Calibrating temperature & the next-token sampler…
▸ Ready — Online. ✅