Capstone: My College AI Portfolio ๐ŸŽ“

Class 12Age 16โ€“17Lesson 12 of 12๐Ÿ†“ Freeโญ Capstone
Class 12 capstone โ€” all 11 characters: Aishwarya, Karthik, Ishaan, Sneha, Tara, Manav, Priya, Rohit, Anjali, Devansh, Ananya โ€” presenting their college-ready AI portfolios
Watch first - 2-3 minutes

Class 12 Lesson 12 - Capstone: My College AI Portfolio

No sign-in needed - English narration - Safe for all school ages

Recap
All 12 Lessons โ€” What You Built

Twelve lessons. Eleven characters. Applied AI mastery from LLM fine-tuning to college admissions. Click any card to revisit it.

Skills Checklist
What Can You Do Now?
Skills Mastered0 / 12
Portfolio Builder
Your 3-Flagship GitHub Portfolio

Fill in these four sections to create your college-application-ready portfolio summary. Save this page or copy the text into your README files.

๐Ÿ† Flagship Project #1

๐Ÿ† Flagship Project #2

๐Ÿ† Flagship Project #3

๐Ÿ“ College Application Essay (1 paragraph)

Wisdom
One Insight from Each Lesson
LessonCharacterKey Insight
L01Aishwarya, PuneQLoRA fine-tunes a 7B model in 4-bit on a free Colab T4 โ€” 99% of full-fine-tune quality at 1% of the cost. The barrier to LLM customisation is no longer compute.
L02Karthik, ChennaiHybrid retrieval (BM25 + dense + reranker) beats pure vector search on accuracy. Production RAG is three retrievers, not one.
L03Ishaan, BengaluruPyTorch DDP turns 14 hours of training into 2 hours with 12 lines of code. The linear scaling rule (LR ร— world_size + warmup) is the one rule you must remember.
L04Sneha, KolkataUPI fraud is a graph problem, not a tabular one. Same data + GNN architecture cut false positives from 32% to 9% โ€” features alone could not.
L05Tara, GoaDiffusion models are denoisers โ€” generate by starting from noise and removing it 50 times. ControlNet preserves your composition; safety controls prevent your tools from being weaponised.
L06Manav, LucknowVoice cloning without consent is illegal under DPDPA, the IT Rules, and personality rights case law. Refuse the offer, even when paid.
L07Priya, HyderabadTwo towers + negative sampling scales recommendations to billions of items. Engagement is not the only metric โ€” guardrails (kill-switch, opt-out, "why am I seeing this") matter long-term.
L08Rohit, MumbaivLLM + PagedAttention serves 50K daily users on a โ‚น40K/month A10 GPU โ€” 1/8th the OpenAI cost. Self-hosting wins above ~5K DAU; below that, the API price is worth it.
L09Anjali, DelhiProphet + Indian holidays + multiplicative seasonality + the right confidence-interval cuts โ‚น40K/month inventory waste. The forecast is one input โ€” the inventory decision is asymmetric-cost optimisation.
L10Devansh, AhmedabadAn 84%-accurate model shipped well outperforms a 92%-accurate model shipped badly. Shadow โ†’ canary โ†’ A/B โ†’ 100% with kill switches. The PM craft multiplies (or destroys) the value of any model.
L11Ananya, CoimbatoreThree flagship projects + a Kaggle bronze + one specific cold-email beats 30 generic ones. College brand opens the first door โ€” your work keeps it open.
Certificate
Class 12 Completion Certificate

๐ŸŽ“ Certificate of Completion

First pass the Capstone Quiz below (80% or higher). The Generate button unlocks once you pass.

This certifies that
has successfully completed
Class 12 โ€” Applied AI Mastery
12 lessons ยท 96 quiz questions ยท Mitra AI Life Education

Skills: LLM Fine-tuning ยท Hybrid RAG ยท Distributed Training ยท GNNs ยท Diffusion ยท Speech AI Ethics ยท Recommender Systems ยท vLLM at Scale ยท Time Series ยท AI Product Management ยท College Portfolio
What's Next
After Class 12 โ†’ Engineering College
๐Ÿ›๏ธ

JEE / BITSAT / UGEE

Crack the entrance test for IIT, IIIT-H, BITS Pilani, NIT, IIIT-D.

๐Ÿ“‚

Pin 3 Flagships

Polish READMEs, deploy demos, link from your college application form.

๐Ÿ“ง

Cold-Email Researchers

Get a summer research internship before Year 1 of college begins.

๐Ÿฅ‰

First Kaggle Medal

Compete in an Indic NLP or vision competition. A bronze is a real signal.

๐Ÿš€

Open-Source PRs

Contribute to AI4Bharat, vLLM, transformers. A merged PR opens doors.

๐ŸŒ

MS Abroad (Year 4+)

Funded MS at CMU/MIT/Stanford/NUS โ€” research assistantship pays tuition.

You are college-ready. Class 11 built your AI engineering foundation. Class 12 applied that foundation to research-grade techniques and shipped you a 3-flagship GitHub portfolio plus a college roadmap. From here, the next chapter is engineering college, internships, and your first AI role. The Mitra AI Life journey continues โ€” keep building.

โญ Capstone Quiz โ€” Cross-Lesson Mastery (10 Questions)

1. Aishwarya (L01) fine-tuned Llama-3-8B with QLoRA on a free Colab T4. Karthik (L02) built hybrid RAG. Rohit (L08) served the model with vLLM. The correct end-to-end production pipeline order is:
a) Serve โ†’ Fine-tune โ†’ Retrieve
b) Fine-tune the base model on domain data (QLoRA) โ†’ wrap inference in a hybrid RAG pipeline so the model has up-to-date facts at query time โ†’ serve at scale with vLLM + Kubernetes for cost-efficient throughput. Each stage builds on the previous: fine-tuning gives base capability, RAG provides freshness, vLLM provides scale.
c) Retrieve โ†’ Serve โ†’ Fine-tune
d) The order doesn't matter as long as all three are used
2. Ishaan (L03) used PyTorch DDP. Sneha (L04) used PyG NeighborLoader. Both required mini-batching at scale. The shared engineering principle is:
a) Always use the largest batch size your GPU memory allows
b) Distributed training requires careful data partitioning โ€” DistributedSampler ensures each GPU sees a different shard each epoch with no overlap; NeighborLoader samples each node's neighbourhood independently to fit graph batches in memory. In both cases, naive batching either duplicates work or runs out of memory; the right sampler is the unlock.
c) Data parallelism is always better than model parallelism
d) Mini-batching is only required when training on multiple GPUs
3. Tara (L05) added watermarks to her diffusion outputs. Manav (L06) refused a celebrity voice-clone offer. Both were applying the same principle. What is it?
a) Always charge premium prices for AI-generated content
b) Generative AI requires consent + provenance โ€” synthesising someone's likeness or voice without permission violates Indian DPDPA 2023, IT Rules 2021/2023, and personality rights case law (Anil Kapoor v. Simply Life India). Watermarking and refusing unconsented clones are both expressions of "you do not synthesise or simulate a person without their consent and clear labelling."
c) Open-source models are safer than commercial ones
d) Watermarks reduce model accuracy and should only be added on request
4. Priya (L07) used two-tower with negative sampling. Why is the two-tower architecture (separate user and item encoders) preferred over a single combined model for production recommenders at scale?
a) Two towers always achieve higher accuracy than single models
b) At serving time, you can precompute and FAISS-index every item embedding once; for each query you only encode the user vector and do a fast nearest-neighbour search. A single combined model would require evaluating one network per (user, item) pair โ€” billions of forward passes per query batch, which does not scale.
c) Two towers can be trained on different programming languages
d) FAISS only supports two-tower architectures
5. Rohit (L08) found self-hosting Llama-3-8B with vLLM costs โ‚น40K/month vs โ‚น3.2L for OpenAI at the same load. Devansh (L10) calculated A/B-test power before launch. The shared business principle is:
a) AI features always reduce engineering cost
b) Decisions about AI infrastructure and product launches require quantitative reasoning before commitment โ€” Rohit calculated the crossover point (~5K DAU) where self-hosting wins; Devansh calculated the sample size needed for statistical significance. Without these calculations, both would have shipped on intuition and lost either money or evidence.
c) Self-hosting is always cheaper than using APIs
d) A/B tests should be skipped to ship faster
6. Anjali (L09) used the upper 80% confidence interval for perishable mithai but lower 60% for non-perishable packaging. This illustrates which fundamental principle of decision-making under uncertainty?
a) Always use the median forecast โ€” outliers should be ignored
b) The right quantile of a forecast distribution depends on the asymmetric cost of being wrong in each direction. Stockout of mithai loses customers (high cost) so order optimistically; overstock of packaging ties up cash (lower cost) so order conservatively. Point forecasts ignore this asymmetry; quantile forecasts make it explicit.
c) Confidence intervals are only for academic papers, not production systems
d) 80% is always the right confidence level for inventory decisions
7. Sneha (L04) found UPI fraud was a graph problem. Manav (L06) found voice synthesis is fundamentally a consent problem. Devansh (L10) found AI features are PM problems first, model problems second. The cross-cutting lesson is:
a) Every problem in AI can be solved with a larger neural network
b) Framing the problem correctly matters more than the model architecture chosen. Treating fraud as tabular missed the relational structure; treating voice synthesis as a tech problem missed the consent failure mode; treating AI features as model problems missed the rollout-and-measurement failure mode. Wrong framing ร— good model = wrong answer.
c) AI engineers should specialise in only one domain
d) Indian AI problems require different algorithms than global ones
8. Karthik (L02) reranked retrieved candidates with bge-reranker-v2-m3. Priya (L07) reranked with MMR for diversity. Why is reranking a separate stage from initial retrieval in production systems?
a) Reranking is required by the OpenAI API specification
b) Initial retrieval optimises for recall over a huge candidate pool (billions of items, must be fast); reranking optimises for precision and diversity over a small candidate set (top-100, can be slow). Combining them collapses both budgets โ€” fast retrieval over billions + accurate scoring over hundreds is how production search and recommendations actually scale.
c) Reranking improves model accuracy by 90% in all cases
d) Reranking is only used for English-language results
9. Aishwarya (L01) trained for Marathi students. Tara (L05) made art for her grandmother. Manav (L06) built audiobooks for visually-impaired students. Anjali (L09) helped her family's mithai shop. The shared philosophical principle is:
a) AI projects should target maximum revenue potential first
b) The best AI projects start from a real, named human in your immediate community whose specific problem you understand deeply. Distance-from-the-problem is the leading indicator of irrelevant AI work โ€” every memorable project in this course began with someone the builder personally knew, not a market opportunity from a slide deck.
c) Indian AI must focus on rural problems exclusively
d) AI projects should be entirely funded by personal savings
10. You have completed Class 12 โ€” Applied AI Mastery. Looking at all 11 characters from Aishwarya to Ananya, the one habit they all shared that determined their success was:
a) Owning expensive personal hardware (multi-GPU rigs)
b) Shipping working artefacts publicly. Aishwarya pushed her LoRA adapter to Hugging Face. Tara released her Warli posters with credits. Rohit open-sourced his vLLM Helm chart. Ananya pinned 3 flagships on GitHub. They all turned learning into shipped, dated, public work โ€” which is the single most reliable signal college admissions and AI employers have ever had.
c) Coming from English-medium schools in metro cities
d) Having parents already in the AI industry
โ† Lesson 11: College & Career โ† Back to Class 12 Hub