Curriculum overview

Learn AI the way professionals actually use it.

This roadmap starts with basic vocabulary and grows into the topics that matter for real AI jobs: paper reading, Transformer basics, LLM behavior, retrieval systems, agents, data pipelines, training, fine-tuning, and infrastructure. It is intentionally selective, so beginners can focus on useful professional skills instead of unnecessary detours.

Learn tokens, vectors, embeddings, attention, positional information, and why Transformer architecture changed sequence modeling.

LLM engineeringML researchAI product work

Vocabulary
Transformer intuition
Attention math
Paper reading

Understand next-token prediction, context windows, decoding, prompting, instruction tuning, model limits, and evaluation basics.

AI applicationsModel evaluationPrompt systems

Token prediction
Prompt behavior
Context limits
Evaluation basics

Build systems that retrieve outside knowledge, rank evidence, cite sources, and reduce hallucinations with measurable grounding.

SearchSupport assistantsEnterprise knowledge

Embeddings
Chunking
Retrieval
Grounded answers

Design model workflows that plan, call tools, inspect results, recover from errors, and stop safely.

AutomationResearch workflowsOperations tools

Tool schemas
Agent loops
Tracing
Safety boundaries

Prepare, clean, version, and evaluate the data that powers retrieval, training, fine-tuning, and production feedback loops.

RAGFine-tuningEvaluation

Data cleaning
Versioning
Labeling
Quality checks

Understand datasets, loss, optimization, checkpoints, validation, regression tests, and how model behavior changes during training.

ML engineeringFine-tuningModel research

Loss curves
Checkpoints
Validation
Regression tests

Learn when to adapt a model with examples, how to format training data, and how to compare the result against prompting or RAG.

Domain assistantsStyle controlTask specialization

Training data
Eval sets
Regression risk
Deployment checks

Operate AI features with serving, latency, caching, monitoring, tracing, cost controls, and incident response.

Production appsPlatform teamsReliability work

Serving design
Caching
Monitoring
Cost model

Create a tiny evaluation harness that sends the same task examples to two prompts or models, scores the outputs, and prints a pass/fail report.

npm run eval:first-llm

All starter examples run without changing the evaluation code, and the harness is ready to expand toward 10 or more cases.
The report shows baseline score, candidate score, and every failed case.

Build a local question-answering app over a small document folder, retrieve source chunks, and answer only when evidence is available.

npm run rag:dev

Questions with known answers cite the right source document.
Out-of-scope questions return a refusal instead of an invented answer.

Build a step-limited agent with one read-only tool, strict argument validation, structured observations, and a trace of every decision.

npm run agent:first

The agent calls the tool when the answer requires external information.
Malformed or out-of-scope tool arguments are rejected before execution.

Prepare a small supervised fine-tuning dataset plan, split it correctly, define quality checks, and decide whether fine-tuning is justified.

npm run dataset:check

Every example has an input, ideal output, category, and review status.
No duplicate or near-duplicate examples cross train/test boundaries.

Add tracing, metrics, and a pre-release evaluation gate to one AI workflow so production changes are measurable and reversible.

npm run release:gate

Every eval run records model, prompt, data, and tool versions.
The gate fails when quality drops, cost exceeds budget, or unsupported answers increase.

First deep module

Attention Is All You Need

The Transformer lesson teaches tokens, embeddings, self-attention, multi-head attention, positional encoding, encoder-decoder flow, and why the architecture became central to modern AI.

Read the Transformer paper section by section and explain the authors' argument.
Define tokens, embeddings, hidden states, attention scores, softmax, residual connections, masking, dropout, and label smoothing from first principles.
Derive the scaled dot-product attention equation and explain every symbol.
Trace how the encoder, decoder, multi-head attention, feed-forward layers, positional encodings, and output softmax move information through the architecture.

Open lesson Continue to LLM Foundations Continue to RAG Systems Continue to Agents and Tool Use Continue to Data Pipelines Continue to Training Runs Continue to Fine-Tuning Continue to AI Systems

Learn AI the way professionals actually use it.

Start with foundations, then move across the AI system stack.

Basics of Transformers

LLM Foundations

RAG Systems

Agents and Tool Use

Data Pipelines

Training Runs

Fine-Tuning

AI Systems

Small runnable projects that prove the concepts.

First LLM Eval Project

First RAG App

First Tool-Using Agent

First Fine-Tuning Dataset and Eval Plan

First Production AI Observability and Eval Gate

Attention Is All You Need