On June 5, 2026, the residency focused on Prompt Engineering—the science of writing clear instructions to align AI outputs. We studied the RTACF (Role, Task, Audience, Context, Format) framework. Role prompting narrows down the search space in the model's parameters, ensuring appropriate tone and vocabulary. We compared unstructured prompts to highly structured ones, demonstrating how specificity drives output quality.
1. Prompt Engineering: Directing Generative Logic
We focused on role prompting and structured instructions. Organising queries using Role, Task, Audience, Context, and Format templates restricts the model's statistical parameters, shifting outputs from generic wiki descriptions to targeted execution payloads.
2. LLM Hyperparameters: Temperature and Token Mechanics
We analyzed core inference parameters: Temperature and Tokens. Temperature controls the randomness of next-token selection: low temperature (e.g., 0.2) results in deterministic, factual, and consistent responses (ideal for code and math), while high temperature (e.g., 1.0) increases creativity. Text is processed as Tokens—sub-word chunks determined by BPE tokenizers. We discussed context window constraints and token-based API pricing models.
3. Advanced Customization: Fine-Tuning & RAG Pipelines
Rashmi introduced advanced AI architecture patterns: Pre-training (costly training from scratch), Fine-Tuning (adapting weights on instruct datasets), and Vector Embeddings (semantic numerical projections). We then explored RAG (Retrieval-Augmented Generation), where document chunks are fetched from vector databases (like Qdrant) via Cosine Similarity to enrich prompt contexts dynamically, eliminating hallucinations and knowledge-cutoff limits.