Essentials and applications of generative AI unit outline

Lesson 44: State of the art in generative AI (LLM landscape and foundations)

Focus: Survey of state-of-the-art LLMs with foundational concepts for understanding and comparing models

Topic	Purpose
The landscape	Open vs closed weights, licensing, text and code generation models
Evaluating models	Model sizes and parameters, context length, benchmarks
Looking ahead	Key trends (efficiency, reasoning, agents) and limitations

Key concepts: Open vs closed weights, licensing, model sizes, context length, benchmarks, LLM landscape

Lesson 45: Practical LLM deployment (Working with large language models)

Focus: How to configure, interact with, and deploy LLMs - structured as a walk through the application stack

Topic	Purpose
Model layer	Decoding strategies, hyperparameters, performance optimizations
Inference layer	Local (direct loading, Ollama), cloud, and API deployment options; tokens and cost
Framework layer	Why use a framework; LangChain, LlamaIndex, Haystack, and others
Application layer	System prompts, message format (system/user/assistant), streaming vs batch

Builds on 44: Applies foundation model knowledge to practical configuration and deployment

Key concepts: Application stack layers, decoding strategies, hyperparameters, deployment strategies, LLM frameworks, chat message format

Lesson 46: Advanced prompt engineering techniques: Part 1 (Prompting strategies)

Focus: Fundamental prompting approaches for eliciting better LLM responses

Topic	Purpose
Zero-shot prompting	Task completion without examples
Few-shot prompting	Learning from in-context examples
Prompt structure	Role definitions, temperature settings
Practical examples	Translation, classification, text analysis

Builds on 45: Uses LLM APIs learned previously, focuses on prompt design

Key concepts: In-context learning, example-based prompting, prompt engineering fundamentals

Lesson 47: Advanced prompt engineering techniques: Part 2 (Advanced reasoning)

Focus: Sophisticated prompting techniques for complex reasoning tasks

Topic	Purpose
Chain of thought (CoT)	Step-by-step reasoning prompts
Self-consistency	Multiple reasoning paths, majority voting
Tree of thoughts (ToT)	Branching reasoning, exploration of solution space
Template formats	Jinja2 templates, f-strings, custom templates
Dynamic message generation	Programmatic prompt construction

Builds on 46: Extends basic prompting to complex multi-step reasoning

Key concepts: Reasoning chains, multi-path exploration, dynamic prompt generation

Lesson 48: LangChain for LLM application development: Part 1 (Core components)

Focus: Building blocks of LangChain applications

Topic	Purpose
Models	Chat models, LLM wrappers, direct API calls
Prompts	ChatPromptTemplate, structured prompts
Output parsers	Structured output extraction, format control
Basic chains	Connecting models, prompts, and parsers

Builds on 47: Formalizes prompt engineering with LangChain abstractions

Key concepts: Model-Prompt-Parser pipeline, LangChain component architecture

Lesson 49: LangChain for LLM application development: Part 2 (Advanced patterns)

Focus: Retrieval, memory, and autonomous agents

Topic	Purpose
Document loaders	TextLoader, PyPDFLoader for various formats
Text splitters	RecursiveCharacterTextSplitter for chunking
Embeddings	HuggingFace and OpenAI embeddings
Vector stores	Chroma for similarity search and persistence
Sequential chains	Multi-step workflows with context passing
Memory	Conversation history, state management
Agents	Autonomous tool use (llm-math, wikipedia)
Local LLMs	Running Falcon models locally

Builds on 48: Extends basic chains to RAG, memory, and agent patterns

Key concepts: Retrieval-augmented generation (RAG), vector databases, autonomous agents

Lesson 50: LLM fine-tuning, customization, and tool integration (Model adaptation)

Focus: Adapting pre-trained models and extending capabilities with tools

Topic	Purpose
Data preparation	Dataset formatting, preprocessing for fine-tuning
Fine-tuning techniques	LoRA, QLoRA for parameter-efficient fine-tuning
Domain adaptation	Adapting models to specific domains (legal, medical, etc.)
Evaluation	Assessing fine-tuned model performance
Model Context Protocol (MCP)	Standardized protocol for LLM-tool communication
MCP servers	Building and deploying MCP servers for custom tools
Tool integration	Connecting LLMs to external APIs, databases, and services
Function calling	Structured tool use with modern LLMs

Builds on 49: Extends agent capabilities with custom tools and domain-specific models

Key concepts: Transfer learning, parameter-efficient fine-tuning, tool integration protocols, MCP architecture

Lesson 51: Advanced agents and production deployment (Capstone)

Focus: Complex multi-agent systems and deploying LLM applications to production

Topic	Purpose
Multi-agent architectures	Coordinating multiple specialized agents
Agent planning	ReAct, reflection, and iterative refinement patterns
Agent memory systems	Long-term memory, knowledge graphs for agents
Evaluation frameworks	Testing agent reliability and performance
Production deployment	Containerization, API design, monitoring
Cost optimization	Caching, prompt optimization, model selection
Safety and alignment	Guardrails, content filtering, output validation
Real-world applications	Case studies of production agent systems

Builds on 50: Advanced patterns building on basic agents from Lesson 49

Key concepts: Multi-agent coordination, production engineering, safety considerations, deployment best practices

Course progression summary

Foundation (Lesson 44): LLM landscape, model sizes, context length, quantization, benchmarks
↓
Configuration & Deployment (Lesson 45): Decoding strategies, hyperparameters, chat vs completion, hosting options
↓
Prompt Engineering (Lessons 46-47): Zero-shot → Few-shot → CoT → ToT
↓
LangChain Development (Lessons 48-49): Core components → RAG + Agents
↓
Customization & Tools (Lesson 50): Fine-tuning, MCP servers, tool integration
↓
Advanced Production (Lesson 51): Multi-agent systems, deployment, safety

Key pedagogical flow: The course moves from surveying the generative AI landscape to practical LLM deployment, systematically building skills in prompting, application development, model customization, tool integration, and production deployment with advanced agent systems.