Part 4: Modern Architectures and Large Language Models

Explore transformer architecture, pre-trained language models, fine-tuning techniques, and efficient inference.

Chapters in This Part

Self-attention, multi-head attention, positional encoding, and transformer blocks....

BERT, GPT, tokenization, and the Hugging Face ecosystem....

LoRA, RLHF, DPO, and parameter-efficient fine-tuning....

Flash attention, MoE, state space models, and inference optimization....

ViT, Swin Transformer, and vision-only applications....