GeekHub Learn
Module
Lesson 9.88 of 8 in this module2 min read Module 9: Building a PDF Chatbot (RAG Project)

Deployment and stretch goals

The capstone moment: ship the PDF chatbot live. Stretch goals turn it from "tutorial project" into "founder-able product".

You built a working bicycle. Now you go ride it in public.

Deploy on Streamlit Cloud as in Module 6.6. Add secrets for OPENAI_API_KEY. Add a requirements.txt:

streamlit
openai
chromadb
pypdf
tiktoken
python-dotenv

Stretch goals:

  1. Multi-PDF library with per-document selection
  2. Hybrid search (vector + BM25)
  3. Reranker (Cohere Rerank or Voyage Rerank)
  4. Auth via Supabase
  5. Persistent user history
  6. Conversational follow-ups ("based on my last question...")
  7. Image extraction from PDFs (vision LLM call)
  8. Cost meter in the sidebar
  9. "Suggested questions" generator from indexed content
  10. Multi-language support

Hybrid search example:

from rank_bm25 import BM25Okapi
docs = [c["text"] for c in all_chunks]
bm25 = BM25Okapi([d.split() for d in docs])
def hybrid(question, k=10):
    vec_top = retrieve(question, k=k)
    kw_top = bm25.get_top_n(question.split(), docs, n=k)
    return dedupe_and_rerank(vec_top, kw_top)

Visualize it

A "v1 -> v2 -> v3" roadmap with the 10 stretch goals on the right side.

Try it now

Deploy. Share the URL. Get one friend to try and tell you the first thing they wished worked differently.

Hands-on lab

Deploy. Pick one stretch goal. Ship it within a week. Update README.

Try it now

What is the smallest stretch goal that adds the most user trust?

Common mistakes

  • Deploying without ingest progress UI (users assume it broke)
  • Skipping cost meter (one user with a 1000-page PDF can shock you)
  • Picking 4 stretch goals at once (none ship)

Debugging tip

If your deployed app fails on large PDFs, you may be exceeding Streamlit Cloud memory. Switch to Hugging Face Spaces (more RAM) or a paid tier.

Challenge

Ship 3 stretch goals over 3 weekends. Document each in a separate PR.

Where this shows up

  • Founder MVPs in legal, education, support
  • Internal company tools
  • Public-facing AI utilities

From the field

A polished deployed RAG product on your GitHub is hireable on its own in 2026. Many junior AI roles ask for exactly this artifact.

Recap

Deploy v1. Add one stretch at a time. The capstone of this course lives here.


Quick recall

3 prompts · think before you flip

Prompt 1 of 3

Why a cost meter?

Quiz time

1 question · tap an answer to check it

  1. 1. The single highest-trust stretch goal is usually

Finished lesson 9.8?

Mark complete to update your module progress and unlock the streak.

Loading