Lesson 12 — Capstone: My College AI Portfolio | Class 12

Recap

All 12 Lessons — What You Built

Twelve lessons. Eleven characters. Applied AI mastery from LLM fine-tuning to college admissions. Click any card to revisit it.

L01

Capstone

You are here ⭐

Portfolio Builder

Your 3-Flagship GitHub Portfolio

Fill in these four sections to create your college-application-ready portfolio summary. Save this page or copy the text into your README files.

🏆 Flagship Project #1

🏆 Flagship Project #2

🏆 Flagship Project #3

📝 College Application Essay (1 paragraph)

Wisdom

One Insight from Each Lesson

Lesson	Character	Key Insight
L01	Aishwarya, Pune	QLoRA fine-tunes a 7B model in 4-bit on a free Colab T4 — 99% of full-fine-tune quality at 1% of the cost. The barrier to LLM customisation is no longer compute.
L02	Karthik, Chennai	Hybrid retrieval (BM25 + dense + reranker) beats pure vector search on accuracy. Production RAG is three retrievers, not one.
L03	Ishaan, Bengaluru	PyTorch DDP turns 14 hours of training into 2 hours with 12 lines of code. The linear scaling rule (LR × world_size + warmup) is the one rule you must remember.
L04	Sneha, Kolkata	UPI fraud is a graph problem, not a tabular one. Same data + GNN architecture cut false positives from 32% to 9% — features alone could not.
L05	Tara, Goa	Diffusion models are denoisers — generate by starting from noise and removing it 50 times. ControlNet preserves your composition; safety controls prevent your tools from being weaponised.
L06	Manav, Lucknow	Voice cloning without consent is illegal under DPDPA, the IT Rules, and personality rights case law. Refuse the offer, even when paid.
L07	Priya, Hyderabad	Two towers + negative sampling scales recommendations to billions of items. Engagement is not the only metric — guardrails (kill-switch, opt-out, "why am I seeing this") matter long-term.
L08	Rohit, Mumbai	vLLM + PagedAttention serves 50K daily users on a ₹40K/month A10 GPU — 1/8th the OpenAI cost. Self-hosting wins above ~5K DAU; below that, the API price is worth it.
L09	Anjali, Delhi	Prophet + Indian holidays + multiplicative seasonality + the right confidence-interval cuts ₹40K/month inventory waste. The forecast is one input — the inventory decision is asymmetric-cost optimisation.
L10	Devansh, Ahmedabad	An 84%-accurate model shipped well outperforms a 92%-accurate model shipped badly. Shadow → canary → A/B → 100% with kill switches. The PM craft multiplies (or destroys) the value of any model.
L11	Ananya, Coimbatore	Three flagship projects + a Kaggle bronze + one specific cold-email beats 30 generic ones. College brand opens the first door — your work keeps it open.

Certificate

Class 12 Completion Certificate

🎓 Certificate of Completion

First pass the Capstone Quiz below (80% or higher). The Generate button unlocks once you pass.

This certifies that

has successfully completed
Class 12 — Applied AI Mastery
12 lessons · 96 quiz questions · Mitra AI Life Education

Skills: LLM Fine-tuning · Hybrid RAG · Distributed Training · GNNs · Diffusion · Speech AI Ethics · Recommender Systems · vLLM at Scale · Time Series · AI Product Management · College Portfolio

What's Next

After Class 12 → Engineering College

🏛️

JEE / BITSAT / UGEE

Crack the entrance test for IIT, IIIT-H, BITS Pilani, NIT, IIIT-D.

📂

Pin 3 Flagships

Polish READMEs, deploy demos, link from your college application form.

📧

Cold-Email Researchers

Get a summer research internship before Year 1 of college begins.

🥉

First Kaggle Medal

Compete in an Indic NLP or vision competition. A bronze is a real signal.

🚀

Open-Source PRs

Contribute to AI4Bharat, vLLM, transformers. A merged PR opens doors.

🌍

MS Abroad (Year 4+)

Funded MS at CMU/MIT/Stanford/NUS — research assistantship pays tuition.

You are college-ready. Class 11 built your AI engineering foundation. Class 12 applied that foundation to research-grade techniques and shipped you a 3-flagship GitHub portfolio plus a college roadmap. From here, the next chapter is engineering college, internships, and your first AI role. The Mitra AI Life journey continues — keep building.

⭐ Capstone Quiz — Cross-Lesson Mastery (10 Questions)

1. Aishwarya (L01) fine-tuned Llama-3-8B with QLoRA on a free Colab T4. Karthik (L02) built hybrid RAG. Rohit (L08) served the model with vLLM. The correct end-to-end production pipeline order is:

a) Serve → Fine-tune → Retrieve

b) Fine-tune the base model on domain data (QLoRA) → wrap inference in a hybrid RAG pipeline so the model has up-to-date facts at query time → serve at scale with vLLM + Kubernetes for cost-efficient throughput. Each stage builds on the previous: fine-tuning gives base capability, RAG provides freshness, vLLM provides scale.

c) Retrieve → Serve → Fine-tune

d) The order doesn't matter as long as all three are used

2. Ishaan (L03) used PyTorch DDP. Sneha (L04) used PyG NeighborLoader. Both required mini-batching at scale. The shared engineering principle is:

a) Always use the largest batch size your GPU memory allows

b) Distributed training requires careful data partitioning — DistributedSampler ensures each GPU sees a different shard each epoch with no overlap; NeighborLoader samples each node's neighbourhood independently to fit graph batches in memory. In both cases, naive batching either duplicates work or runs out of memory; the right sampler is the unlock.

c) Data parallelism is always better than model parallelism

d) Mini-batching is only required when training on multiple GPUs

3. Tara (L05) added watermarks to her diffusion outputs. Manav (L06) refused a celebrity voice-clone offer. Both were applying the same principle. What is it?

a) Always charge premium prices for AI-generated content

b) Generative AI requires consent + provenance — synthesising someone's likeness or voice without permission violates Indian DPDPA 2023, IT Rules 2021/2023, and personality rights case law (Anil Kapoor v. Simply Life India). Watermarking and refusing unconsented clones are both expressions of "you do not synthesise or simulate a person without their consent and clear labelling."

c) Open-source models are safer than commercial ones

d) Watermarks reduce model accuracy and should only be added on request

4. Priya (L07) used two-tower with negative sampling. Why is the two-tower architecture (separate user and item encoders) preferred over a single combined model for production recommenders at scale?

a) Two towers always achieve higher accuracy than single models

b) At serving time, you can precompute and FAISS-index every item embedding once; for each query you only encode the user vector and do a fast nearest-neighbour search. A single combined model would require evaluating one network per (user, item) pair — billions of forward passes per query batch, which does not scale.

c) Two towers can be trained on different programming languages

d) FAISS only supports two-tower architectures

5. Rohit (L08) found self-hosting Llama-3-8B with vLLM costs ₹40K/month vs ₹3.2L for OpenAI at the same load. Devansh (L10) calculated A/B-test power before launch. The shared business principle is:

a) AI features always reduce engineering cost

b) Decisions about AI infrastructure and product launches require quantitative reasoning before commitment — Rohit calculated the crossover point (~5K DAU) where self-hosting wins; Devansh calculated the sample size needed for statistical significance. Without these calculations, both would have shipped on intuition and lost either money or evidence.

c) Self-hosting is always cheaper than using APIs

d) A/B tests should be skipped to ship faster

6. Anjali (L09) used the upper 80% confidence interval for perishable mithai but lower 60% for non-perishable packaging. This illustrates which fundamental principle of decision-making under uncertainty?

a) Always use the median forecast — outliers should be ignored

b) The right quantile of a forecast distribution depends on the asymmetric cost of being wrong in each direction. Stockout of mithai loses customers (high cost) so order optimistically; overstock of packaging ties up cash (lower cost) so order conservatively. Point forecasts ignore this asymmetry; quantile forecasts make it explicit.

c) Confidence intervals are only for academic papers, not production systems

d) 80% is always the right confidence level for inventory decisions

7. Sneha (L04) found UPI fraud was a graph problem. Manav (L06) found voice synthesis is fundamentally a consent problem. Devansh (L10) found AI features are PM problems first, model problems second. The cross-cutting lesson is:

a) Every problem in AI can be solved with a larger neural network

b) Framing the problem correctly matters more than the model architecture chosen. Treating fraud as tabular missed the relational structure; treating voice synthesis as a tech problem missed the consent failure mode; treating AI features as model problems missed the rollout-and-measurement failure mode. Wrong framing × good model = wrong answer.

c) AI engineers should specialise in only one domain

d) Indian AI problems require different algorithms than global ones

8. Karthik (L02) reranked retrieved candidates with bge-reranker-v2-m3. Priya (L07) reranked with MMR for diversity. Why is reranking a separate stage from initial retrieval in production systems?

a) Reranking is required by the OpenAI API specification

b) Initial retrieval optimises for recall over a huge candidate pool (billions of items, must be fast); reranking optimises for precision and diversity over a small candidate set (top-100, can be slow). Combining them collapses both budgets — fast retrieval over billions + accurate scoring over hundreds is how production search and recommendations actually scale.

c) Reranking improves model accuracy by 90% in all cases

d) Reranking is only used for English-language results

9. Aishwarya (L01) trained for Marathi students. Tara (L05) made art for her grandmother. Manav (L06) built audiobooks for visually-impaired students. Anjali (L09) helped her family's mithai shop. The shared philosophical principle is:

a) AI projects should target maximum revenue potential first

b) The best AI projects start from a real, named human in your immediate community whose specific problem you understand deeply. Distance-from-the-problem is the leading indicator of irrelevant AI work — every memorable project in this course began with someone the builder personally knew, not a market opportunity from a slide deck.

c) Indian AI must focus on rural problems exclusively

d) AI projects should be entirely funded by personal savings

10. You have completed Class 12 — Applied AI Mastery. Looking at all 11 characters from Aishwarya to Ananya, the one habit they all shared that determined their success was:

a) Owning expensive personal hardware (multi-GPU rigs)

b) Shipping working artefacts publicly. Aishwarya pushed her LoRA adapter to Hugging Face. Tara released her Warli posters with credits. Rohit open-sourced his vLLM Helm chart. Ananya pinned 3 flagships on GitHub. They all turned learning into shipped, dated, public work — which is the single most reliable signal college admissions and AI employers have ever had.

c) Coming from English-medium schools in metro cities

d) Having parents already in the AI industry

← Lesson 11: College & Career ← Back to Class 12 Hub

Capstone: My College AI Portfolio 🎓

Class 12 Lesson 12 - Capstone: My College AI Portfolio

LLM Fine-tuning (QLoRA)

Vector DBs & Hybrid RAG

Distributed Training (DDP)

Graph Neural Networks

Diffusion Models & SDXL

Speech AI & TTS Ethics

Recommender Systems

vLLM + Kubernetes

Time Series Forecasting

AI Product Management

College & AI Career

Capstone

🏆 Flagship Project #1

🏆 Flagship Project #2

🏆 Flagship Project #3

📝 College Application Essay (1 paragraph)

🎓 Certificate of Completion

JEE / BITSAT / UGEE

Pin 3 Flagships

Cold-Email Researchers

First Kaggle Medal

Open-Source PRs

MS Abroad (Year 4+)

⭐ Capstone Quiz — Cross-Lesson Mastery (10 Questions)