1. makemore by Andrej Karpathy
  2. minbpe by karpathy
  3. attention? attention! Lilian Weng ![[Attention_Attention.pdf]]
  4. gpt-2 again by karpathy
  5. llama3 from scratch by naklecha
  6. llm training in simple, raw by c/cuda karpathy
  7. decoding strategies in large language models mlabonne
  8. how to make llms go fast by vgel ![[How to make LLMs go fast.pdf]]
  9. a visual guide to quantization maarten![[A visual guide to quantization.pdf]]
  10. extending the RoPE by eleutherai ![[Extending the RoPE.pdf]]
  11. the novice's llm training guide by alpin ![[The Novice LLM Training Guide.pdf]]
  12. a survey on evaluation of large language models paper ![[2307.03109v9.pdf]]
  13. mixture of experts explained huggingface ![[Mixture of Experts Explained.pdf]]
  14. vision transformer by aman-arora ![[Vision Transformer.pdf]]
  15. clip, siglip and paligemma by umar-jamil