Module 1: Introduction to AI, ML, Deep Learning, GenAI, and LLMs

Module Goal

Build a rock-solid mental model of the AI landscape so that every later module makes sense. By the end of this module you will be able to draw the AI to ML to DL to GenAI to LLM hierarchy from memory and explain when each one is the right tool.

Estimated Duration

3 to 5 hours.

Skills Learned

Distinguishing AI, machine learning, deep learning, generative AI, and LLMs
Reading AI news without getting confused by buzzwords
Identifying which problems suit which technique
Spotting marketing dressed up as "AI"

Real-world Importance

In 2026 every product team is "doing AI." Most of them confuse machine learning with LLMs and waste months. The first skill of an AI engineer is calling the right tool by its right name and choosing it for the right reason.

Lessons in this module

What is AI, really? (and what it is not)
The AI to ML to DL to GenAI to LLM hierarchy
Discriminative versus Generative AI in 60 seconds
The 2026 LLM landscape and how we got here
Where LLMs shine, and where they should not be used

Lesson 1.1: What is AI, really? (and what it is not)

Hook / Why This Matters

If you cannot define AI in one sentence, every later concept will feel slippery. AI is also the single most misused word in technology marketing today. Getting this right is your shortcut to seeing through hype.

Beginner Analogy

Think of a thermostat. It senses temperature and decides whether to turn the heater on. That is a tiny piece of intelligence in a machine. AI is the same idea, scaled up: a machine that senses, decides, and acts in a way that we would call intelligent if a human did it.

Concept Explanation

Artificial Intelligence is the broad goal of building machines that perform tasks which normally require human intelligence: understanding language, recognizing images, making decisions, planning, and learning from experience.

AI is the goal. Everything else you will hear about (machine learning, deep learning, LLMs) is a technique used to reach that goal.

Technical Breakdown

AI splits into two families based on how the intelligence is built:

Symbolic AI (rule-based): Humans write explicit rules. If X then Y. The chess engine Deep Blue (1997) was largely symbolic. Strengths: predictable, explainable. Weaknesses: brittle, scales badly.
Statistical / learned AI: The machine learns patterns from data instead of being told rules. Everything modern (ChatGPT, image recognition, recommendation systems) is in this family. Strengths: handles messy real-world data. Weaknesses: less interpretable, needs lots of data.

In 2026, "AI" almost always means the second category.

Visual Learning Suggestion

Insert a diagram here: a large circle labeled "AI", with two child circles inside labeled "Symbolic AI" and "Learned AI". Inside "Learned AI", a smaller circle "Machine Learning", and inside that "Deep Learning", and inside that "Generative AI", and inside that "LLMs". This nested-circles diagram is the most important visual of Module 1.

Interactive Element

Open ChatGPT or Gemini. Ask: "Give me three tasks where AI is genuinely useful and three where humans calling something AI is mostly marketing." Compare its answer to your own gut. Notice where you disagreed. That gap is where your intuition needs reps.

Hands-on Lab

No code yet. Do this writing exercise (15 minutes):

Open a document. List 10 apps or features you used today.
For each, write whether it uses AI (Y/N) and why you think so.
For the Y items, guess whether it is symbolic or learned.
Save this. We will revisit it at the end of Module 2 and you will be amazed how much your mental model sharpens.

Mini Exercise

Classify these as AI or not AI: (a) the recommendation row on Netflix, (b) a CAPTCHA, (c) Excel's SUM function, (d) Google Translate, (e) Siri's wake-word detection. Answers at the end of the lesson.

Common Mistakes

Calling every automation "AI". A SQL query that updates a column is automation, not AI.
Believing AI is conscious. It is not. It is statistics at scale.
Thinking AI is new. The term was coined in 1956. The current boom is one of many waves.

Debugging Tips

When you hear "this product uses AI", ask three questions:

What data did it learn from?
What is the prediction or output?
What happens when it is wrong?

If a salesperson cannot answer all three, it is probably not AI in any useful sense.

Knowledge Check Questions

What is the difference between symbolic and learned AI?
Why do most 2026 products use the learned approach?
Give one task where symbolic AI is still the right choice.

Quiz Questions

AI is best defined as: a) Any computer program that automates a task b) Machines performing tasks that normally require human intelligence c) Robots d) ChatGPT Answer: b
Deep Blue (1997 chess engine) is mostly an example of: a) Generative AI b) Statistical AI c) Symbolic AI d) An LLM Answer: c

Challenge Task

Write a one-page "AI vs not-AI" explainer for a non-technical friend. Use only analogies, no jargon. Publish it as a GeekHub post and tag it #ai-beginners.

Real-world Use Cases

Email spam filters (learned AI, classification)
Voice typing on your phone (learned AI, speech recognition)
Self-driving lane assist (learned AI, computer vision)
A traffic light timer (not AI, just a rule)

Industry Insight

In hiring, "AI experience" without context is meaningless on a resume. Specify which family and which task: "trained a classification model on customer churn data" or "built a RAG app with the OpenAI API". This is the language that gets interviews.

Interview Questions

Define AI in one sentence.
What is the difference between rule-based and learned AI? Give an example of each.
Why is the current AI boom different from the 1980s boom?

Summary

AI is the goal of making machines act intelligently. In 2026 it is dominated by learned approaches, where machines find patterns in data rather than being told explicit rules. Everything else in this course is a technique inside this umbrella.

Mini Exercise answers: a) AI (recommendation model), b) AI (image classification model behind the scenes), c) Not AI (it is a formula), d) AI (neural machine translation), e) AI (small on-device model).

Lesson 1.2: The AI to ML to DL to GenAI to LLM hierarchy

Hook / Why This Matters

This is the single most useful diagram in the entire course. If you internalize it, every news headline and job description suddenly slots into the right bucket.

Beginner Analogy

Think of Russian dolls. The biggest doll is AI. Open it: inside is Machine Learning. Open that: inside is Deep Learning. Open that: Generative AI. Open that: Large Language Models. Each is a special case of the one outside it.

Concept Explanation

AI: anything that makes a machine act intelligently.
Machine Learning (ML): AI that learns from data instead of being programmed with rules.
Deep Learning (DL): ML that uses neural networks with many layers.
Generative AI (GenAI): DL models that create new content (text, images, audio, video).
Large Language Models (LLMs): GenAI for text, trained on huge text corpora.

ChatGPT is an LLM. An LLM is a GenAI model. A GenAI model is a deep learning model. Deep learning is machine learning. Machine learning is AI.

Technical Breakdown

Each layer adds a constraint and a power:

Layer	Adds	Trades off
ML over AI	Learns from data	Needs lots of clean data
DL over ML	Handles unstructured data (text, images, audio)	Needs lots of compute
GenAI over DL	Produces new content, not just predictions	Hard to evaluate "good"
LLM over GenAI	Specifically for language	Hallucinates, costly

Visual Learning Suggestion

Insert the nested circles diagram from Lesson 1.1, this time labeled with one real example in each ring: "Symbolic AI: Deep Blue. ML: Spam filter. DL: Image classifier. GenAI: DALL-E. LLM: ChatGPT."

Interactive Element

Read three AI news headlines from this week (any tech site). For each, write which layer it is talking about. Be strict: if it says "AI" but means "an LLM call", note that. This pattern recognition compounds fast.

Hands-on Lab

Open a blank doc. From memory, draw the nested-circles diagram. Place these in the right ring:

Netflix recommendations
ChatGPT
Self-driving Tesla
Midjourney
Gmail's spam filter
DeepFake video
A bank's credit-scoring model

Then check yourself: most are ML, some are DL, two are GenAI, one is an LLM.

Mini Exercise

True or false: Every LLM is a deep learning model, but not every deep learning model is an LLM. Justify.

Common Mistakes

Treating "AI" and "LLM" as synonyms. LLMs are a tiny sliver.
Calling image generators LLMs. They are GenAI but not LLMs.
Believing ML always uses neural networks. It often does not (decision trees, gradient boosting, linear regression).

Debugging Tips

When reading a paper or product spec, ask: which ring are we in? It changes what evaluation methods, costs, and risks apply.

Knowledge Check Questions

Is every machine learning model a deep learning model? Why or why not?
Why is generative AI a subset of deep learning specifically?
Where would you place a stock price prediction model in the hierarchy?

Quiz Questions

ChatGPT is best described as: a) Machine learning that is not deep learning b) A deep learning generative language model c) A symbolic AI system d) An image generator Answer: b
Decision trees are a form of: a) Deep learning b) Machine learning that is not deep learning c) Generative AI d) An LLM Answer: b

Challenge Task

Pick any AI startup from a YC batch in 2025 or 2026. Read its landing page. Write a 200-word note placing its tech in the hierarchy and identifying the layer where the actual differentiation lives.

Real-world Use Cases

ML (not DL): credit scoring, fraud detection on tabular data
DL (not GenAI): face recognition, voice typing
GenAI (not LLM): Midjourney, Suno music, Sora video
LLM: ChatGPT, Claude, Gemini, Llama

Industry Insight

A common 2026 product mistake: teams use an LLM where a tiny ML classifier would be 100x cheaper and more accurate. Knowing the hierarchy saves money.

Interview Questions

Draw the AI to ML to DL to GenAI to LLM hierarchy on a whiteboard with one example each.
Give an example where ML is the right choice and an LLM would be overkill.
Is DALL-E an LLM? Justify your answer.

Summary

AI is the umbrella. ML learns from data. DL is ML with neural networks. GenAI generates content. LLMs are GenAI for text. Internalize this and the rest of the course unfolds cleanly.

Lesson 1.3: Discriminative versus Generative AI in 60 seconds

Hook / Why This Matters

Every AI model in the world is doing one of two things: drawing a line between things, or creating new things. Once you see this split, model choice becomes obvious.

Beginner Analogy

A bouncer at a club is discriminative: looks at you and decides "in" or "out". An artist drawing a portrait is generative: produces something new that did not exist before.

Concept Explanation

Discriminative models answer: "What category does this input belong to?" or "What is the predicted value?". Examples: spam or not spam, house price, image of a cat or dog.
Generative models answer: "Given this prompt, produce a plausible new sample." Examples: write an essay, draw a logo, compose music.

Technical Breakdown

Mathematically, discriminative models learn P(label | input). Generative models learn P(input) and then sample from it. You do not need to compute this, but knowing the framing helps you read papers.

Visual Learning Suggestion

A two-column diagram. Left column "Discriminative" with arrows: image -> label, email -> spam or not, transaction -> fraud or not. Right column "Generative" with arrows: prompt -> essay, prompt -> image, prompt -> music.

Interactive Element

In ChatGPT, run two prompts. First: "Classify this email as spam or not: [paste]". Second: "Write a sample spam email about a fake lottery win." First is discriminative use of an LLM, second is generative.

Hands-on Lab

Pick three real apps you use. For each, identify whether the AI inside is discriminative, generative, or both. Examples to start: TikTok, Photoshop, Gmail.

Mini Exercise

Is Google Translate discriminative or generative? Defend either answer in two sentences.

Common Mistakes

Assuming LLMs are always generative. They can be used discriminatively (classify, score, route).
Confusing "creative" with generative. A creative-looking output can still be a classification underneath.

Debugging Tips

If your AI app is unreliable, ask whether you are using a generative model for what is really a discriminative job. Often, a tiny classifier solves it.

Knowledge Check Questions

Define discriminative vs generative in one sentence each.
Can the same LLM be used both ways? Give an example.
Why are generative models harder to evaluate?

Quiz Questions

A model that decides if a tweet is positive or negative is: a) Generative b) Discriminative c) Both d) Neither Answer: b

Challenge Task

Take your favorite SaaS app. Sketch two AI features: one discriminative, one generative. Specify inputs, outputs, and which would be more valuable to ship first.

Real-world Use Cases

Discriminative: fraud detection, content moderation, search ranking
Generative: writing assistants, image generation, code completion
Mixed: AI agents that classify intent then generate a response

Industry Insight

Most 2026 production AI features are 80% discriminative under the hood, with a generative layer on top to make outputs natural. Knowing this lets you architect for cost and reliability.

Interview Questions

Difference between discriminative and generative models. Give an example each.
Why might you use a discriminative classifier in front of an LLM call?

Summary

Every AI task is either picking a label or producing new content. LLMs can do both, but treat them as expensive generative tools and reach for classifiers when classification is what you need.

Lesson 1.4: The 2026 LLM landscape and how we got here

Hook / Why This Matters

You will hear a hundred model names this year. Without context, you will pick wrong. This lesson gives you the map.

Beginner Analogy

If LLMs were cars, GPT is the Toyota that proved the model, Claude is the Lexus famous for safety, Gemini is the Tesla integrated with the Google ecosystem, Llama is the open-source Honda you can mod yourself.

Concept Explanation

The current LLM era began with the 2017 paper "Attention Is All You Need" (Transformer architecture). Key milestones:

2018: GPT-1 and BERT. Proved Transformers scale.
2020: GPT-3. Showed few-shot prompting works.
2022: ChatGPT. The consumer breakthrough.
2023: GPT-4, Claude, Llama 2 (open source).
2024: Gemini, Llama 3, multimodal mainstream.
2025: Reasoning models (o1-style), agents go mainstream.
2026: On-device LLMs, longer contexts, cheaper inference, better tool use.

Technical Breakdown

The 2026 LLM market has four tiers:

Frontier closed models: OpenAI GPT family, Anthropic Claude family, Google Gemini family. State of the art quality, paid API, easy to use.
Frontier open weights: Meta Llama, Mistral, DeepSeek. Strong quality, downloadable, can be self-hosted.
Small efficient models: Phi, Gemma, Qwen small. Run on a laptop, surprisingly capable for narrow tasks.
Specialized models: code (DeepSeek Coder), vision-language (PaliGemma), embeddings (text-embedding-3, voyage-3).

Visual Learning Suggestion

A timeline horizontal chart from 2017 to 2026 with key models on it. Below the timeline, a 2x2 quadrant: x-axis "closed to open", y-axis "frontier to small", with the 4 tiers placed.

Interactive Element

Open openai.com/api, anthropic.com, ai.google.dev, and llama.com side by side. Note the latest model and price per million tokens for each. This snapshot will be your baseline for the rest of the course.

Hands-on Lab

Create a "LLM scorecard" spreadsheet with columns: Provider, Model, Input price per 1M tokens, Output price per 1M tokens, Context window, Strengths, Weaknesses. Fill in 6 current 2026 models. You will reuse this in Module 5.

Mini Exercise

In 2026 dollars per million output tokens, sort the four current frontier models from cheapest to most expensive. Note which has the largest context window.

Common Mistakes

Picking a model because it is famous, not because it fits the task.
Choosing the largest model when a small one would do.
Forgetting that prices and capabilities change every quarter. Re-check before any production decision.

Debugging Tips

If your app is too slow or expensive, the fix is almost never a fancier prompt. It is a different (smaller) model.

Knowledge Check Questions

What was the architectural breakthrough behind modern LLMs?
Name one frontier closed model and one frontier open-weights model.
When would you pick a small model over a frontier model?

Quiz Questions

The Transformer architecture was introduced in: a) 2012 b) 2017 c) 2020 d) 2022 Answer: b
Llama is best described as: a) A closed frontier model b) An open-weights frontier model c) A symbolic AI system d) A small on-device model Answer: b

Challenge Task

Pick a real use case (for example: summarizing customer support tickets). Argue in 200 words which tier and which specific model you would use, including cost reasoning.

Real-world Use Cases

ChatGPT for general writing and brainstorming
Claude for long-context document analysis
Gemini for Google Workspace integration
Llama for self-hosted, privacy-sensitive deployments
Phi or Gemma for on-device features in a mobile app

Industry Insight

The "best model" question is the wrong question. The right one is "which model is best for this specific task within this specific budget and latency target". Engineers who internalize this ship reliable products. Those who do not chase the leaderboard every week.

Interview Questions

Walk me through the major LLM milestones from 2017 to today.
Compare frontier closed vs open-weights models. When would you pick each?
What is the largest context window you have worked with, and what did it unlock?

Summary

The 2026 LLM landscape has four tiers across closed-versus-open and frontier-versus-small. The best engineers pick the smallest, cheapest model that does the job, not the largest brand-name one.

Lesson 1.5: Where LLMs shine, and where they should not be used

Hook / Why This Matters

Half of failed AI projects in 2026 used an LLM where it had no business being used. This lesson saves you that mistake.

Beginner Analogy

A Swiss army knife is great. But you would not use it to chop down a tree. LLMs are the same: powerful, general, but wrong for some jobs.

Concept Explanation

LLMs shine when:

Input is natural language and output is natural language
Tasks need flexibility and creativity
There is no clean labeled dataset
"Pretty good and fast" beats "perfect and slow"

LLMs are wrong when:

Outputs must be exactly correct (medical dosages, legal citations) without checks
Strong privacy or compliance requires no data leaving your machine
The task is deterministic and small (a regex would do)
Latency must be under 50 ms
You need explainable, auditable decisions

Technical Breakdown

Reach for an LLM when the task is fuzzy: summarization, rewriting, classification with subtle context, drafting. Reach for traditional tools when the task is sharp: arithmetic, exact lookups, deterministic rules.

In practice, the best 2026 systems combine both. An LLM routes the request, a deterministic system does the precise work, an LLM formats the response.

Visual Learning Suggestion

A two-column "Reach for an LLM" vs "Do not reach for an LLM" table, with 5 to 8 example tasks in each.

Interactive Element

Ask ChatGPT to multiply two 6-digit numbers. Then ask it to summarize the plot of any movie. Notice which one it nails and which one it can flub. Math without tools is its classic weakness.

Hands-on Lab

In a spreadsheet, list 15 features your favorite app could add. For each, decide: LLM, classical ML, traditional code, or hybrid. Defend each pick in one sentence.

Mini Exercise

Pick three: (a) password validation, (b) summarizing a legal contract, (c) translating English to Hindi, (d) detecting credit card fraud in real time. Which suit an LLM? Why?

Common Mistakes

Calling an LLM for tasks a regex solves
Using an LLM for arithmetic without giving it a calculator tool
Sending sensitive data to a public LLM API without checking your provider's data retention policy
Promising users a "smart AI assistant" that turns out to be a $0.10-per-query glorified chatbot

Debugging Tips

If your LLM app behaves inconsistently, ask whether the task should have been deterministic. Often the fix is to remove the LLM from the critical path.

Knowledge Check Questions

Name three task types where LLMs are the right tool.
Name three task types where LLMs are the wrong tool.
Why are LLMs poor at exact arithmetic by default?

Quiz Questions

Validating that a string is a valid email address: a) Is a great use of an LLM b) Should use a regex, not an LLM c) Requires GPT-4 d) Needs RAG Answer: b

Challenge Task

Audit any AI feature in a product you use. Decide whether the team made the right tool choice. Write a 300-word memo.

Real-world Use Cases

LLM-suited: drafting emails, classifying support tickets, generating product descriptions, code completion, summarization
Not LLM-suited: calculating taxes, validating IDs, real-time price computation, exact database lookups

Industry Insight

The phrase you want on your resume is: "I used an LLM where it was the right tool, and a classical method where it was not." This single sentence sets you apart from 90% of bootcamp grads.

Interview Questions

Give an example of a task where you replaced an LLM with a simpler approach. Why?
How do you handle math inside LLM workflows?
What is the cost-quality tradeoff when picking a model for production?

Summary

LLMs are excellent at fuzzy language tasks and terrible at exact, deterministic ones. Real engineers compose LLMs with classical code and use each tool for what it is best at.

Module 1 Recap

You can now place any AI buzzword in its correct ring, distinguish discriminative from generative, name the 2026 LLM landscape, and decide when an LLM is and is not the right tool.

Next Module

Module 2: How ChatGPT and Transformers Work