Mastering Generative AI & LLM Engineering: From Foundations to Applications

Mastering Generative AI & LLM Engineering: From Foundations to Applications

Course Overview:
In today’s AI-driven world, Generative AI and Large Language Models (LLMs) are at the forefront of innovation, powering applications ranging from chatbots to recommendation engines, speech recognition, and intelligent automation. Python is the most widely used programming language in the world of Gen-AI and Data Science owing to its simplicity, versatility, and a vast ecosystem of powerful libraries. Therefore, to make the learners acquainted of these latest high-in-demand skills, we have designed this course that will build a strong foundation in the practical applications of Artificial Intelligence, Natural Language Processing (NLP), Large Language Models (LLMs), LangChain, Vector Databases, and Speech Recognition technologies with a hands-on approach. By completing this course, the learners will not only strengthen their technical skill-set but also gain the ability to build AI-powered applications, positioning themselves for exciting career opportunities in the rapidly evolving field of Generative AI and LLM Engineering.

Course Objectives:
• To introduce participants to core AI concepts, including Natural vs Artificial Intelligence, Machine Learning, Deep Learning, and Generative AI fundamentals.
• To enable learners to work with Python libraries (NumPy, Pandas etc.) for automation, data handling, and API integration (including OpenAI API).
• To introduce participants to Natural Language Processing (NLP) and equip them to build text-processing pipelines including tokenization, sentiment analysis, and custom text classifiers.
• To make learners proficient in working with Large Language Models (LLMs), covering transformer architecture, GPT, BERT, Hugging Face, and LangChain for chatbot development and text generation.
• To equip learners with the knowledge of LangChain framework, LangGraph, and Retrieval Augmented Generation (RAG) for building advanced conversational agents and memory-enabled systems.
• To familiarize learners with Vector Databases (e.g., Pinecone) and their applications in semantic search, recommendation engines, and biomedical research.
• To introduce learners to Speech Recognition and Speech-to-Text systems using traditional ML, Deep Learning, and transformer-based approaches such as Whisper AI.
• To prepare learners for LLM Engineering, including prompt engineering, hosting models vs APIs, cost optimization, scaling strategies, and deploying AI-powered applications with Streamlit.


Batch Details:
Class Timings: Saturday & Sunday (11:30 am-1:30 pm) Start Date: 20th June 2026
Duration: (56 Hours) End Date: 27nd Sep 2026
Mode: Online Certification: iHUB Divyasampark IIT Roorkee
Last Date to Register: 19th June 2026
Course Fee: Rs. 10,000/-
(Amounts inclusive of GST)

Link to Register: https://rzp.io/rzp/MasteringGenAICourse

Contact Person:
Dr. Subrat Kotoky
CTO, Ritvij Bharat Private Limited
Ph.D. in Mechanical Engineering (IIT Guwahati)
rbpl.edu@gmail.com; subrat.kotoky@ritvij.co.in
9085317465/8473874389

Expert Profile:

Mr. Shreyas Shukla
Professional Corporate Trainer & Microsoft Azure Certified Data Engineer
MTech-IIT Kharagpur & BE- The Aeronautical Society of India, New Delhi
4+ years of experience in leading online professional courses for different leading organizations

Has successfully conducted 25+ courses and trained 1500+ learners in the fields of Python Programming, Data Analytics, Machine Learning, Deep Learning, Computer Vision etc. till now.

Certifications:
1. DP-203: Microsoft Certified: Azure Data Engineer Associate
2. DP-900: Microsoft Certified: Azure Data Fundamentals
3. AZ-900: Microsoft Certified: Azure Fundamentals

Our Students Rate This Course

4.5
Trainer

RBPL

Program Fee

Rs 10,000/- (Amount Inclusive of GST)

Available Seats

100

Schedule

Saturday & Sunday (11:30 am-1:30 pm)

Only Few Seats Left

Reviews

Testimonials

Module 1

Module 1

Introduction to AI & Gen AI

·       Building an AI tool

·       Natural vs Artificial Intelligence, Brief history of AI, Weak vs Strong AI

·       AI vs Data Science vs Machine Learning vs Deep Learning

·       Data: Collection, Labelled vs Unlabeled, Structured vs Unstructured, Metadata

·       Overview of Machine Learning (Supervised, Unsupervised and Reinforcement Learning)

·       Overview of Deep Learning, Robotics, Computer Vision, Traditional ML, Generative AI

·       Generative AI: Introducing ChatGPT, Natural Language, Processing (NLP)

·       Large Language Models (LLMs): Training, N-Grams, RNNs, Transformers

·       Building LLMs: Prompt Engineering, Fine Tuning, RAG

·       Foundation Models vs Private Models

·       Inconsistency and Hallucination in Gen AI, Budgeting, Latency, Running out of Data

·       AI stack: Python, Working with APIs, Vector Databases, Hugging Face, LangChain

Module 2

Module 2

Working with Python Libraries

Introduction to Numerical Python (NumPy Library)

Introduction and working with Pandas

Series, DataFrames, working with various data types, Group-By operation

Selecting a single column, important series methods

Indexing & Sorting; loc & iloc with series; Inspecting DataFrames, filtering with conditional operators

Regex: Anchors and Groupings, Range Expressions, Non-Greedy Matching, Substitutions.


Module 3

Module 3

Natural Language Processing (NLP)

Introduction, Supervised vs Unsupervised NLP

Data preparation, Handling Stop words, Regular Expressions 

Tokenization, Stemming, Lemmatization, N-grams

Text tagging, Parts of Speech (POS) tagging, Name Entity Recognition (NER)

Sentiment Analysis, Rule-based Sentiment Analysis, Pre-trained Transformers

Numerical Representation of text, Bag of Words, TF-IDF

Topic Modelling, Latent Dirichlet Allocation (LDA), Latent Semantic Analysis (LSA)

Custom text classifier using various ML models     


Module 4

Module 4

Large Language Models (LLMs)

Understanding LLMs, General Purpose Models, Pre-training and Fine Tuning

Deep- Learning recap, Transformer Architecture, Input Embeddings

Multi-headed Attention, Feed-Forward Layer, Masked multihead Attention

Understanding GPT, Open AI API, Generating Text, Customizing GPT output

Key word text summarization, Coding a simple chatbot using LangChain in Python

Hugging Face, Transformer Pipeline, Pre-trained tokenizers, Special tokens

Q&A models : BERT architecture, Tokenizer, Embeddings, Calculating response

Creating QA bot, BERT, RoBERTa, DistilBERT, GPT vs BERT vs XLNET, XLNET Embeddings and Fine Tuning


Module 5

Module 5

LangChain

LangChain Introduction, Tokens and Models, Setting up Environment, Open API key

System, User and Assistant roles, Creating chatbot, Temperature, Max Tokens, Streaming

LangChain Framework, ChatOpenAI, System, AI and Human messages

Prompt Templates and Prompt Values, Few-shot chat message prompt templates

String output parser, Comma-separated list output parser, Datetime output parser

Piping a prompt, model and parser, Batching, Streaming, Runnable Sequence class

Piping chains and Runnable Passthrough, Runnable Parallel, Runnable Lambda

Retrieval Augmented Generation (RAG): Document Loading, Splitting and Embedding

Document storing, retrieval and generation, loading with PyPDFLoader, Docx2txtLoader

Splitting with character text splitter, markdown header text splitter

Text embedding with OpenAI, Chroma vectorstore (Inspecting and Managing docs)

Retrieval: Similarity search, Maximal Marginal Relevance (MMR) search, Vectorstore-backed retriever

Generation: Stuffing documents and Generating a response 


Module 6

Module 6

LangGraph

States, nodes and Edges, First graph: Importing relevant classes, Building graph

Conditional edges: Defining nodes, routing function, Building the graph

Annotated construct and reducer functions, MessagesState, RemoveMessages

Checkpointers and threads, Short-term memory with the InMemorySaver class

The StateSnapshot class, Long-term memory with SQLite    


Module 7

Module 7

Vector Databases

·       Database comparison: SQL, NoSQL and Vector, Understanding Vector databases,

·       Vector space: Introduction, Distance Metrics, Vector Embeddings

·       Vector database, comparison, Pinecone registration, walkthrough and creating index

·       Pinecone with Python: Connection, Pinecone Index, Upsetting data and using embedding

·       Vector database for Recommendation Engines, Biomedical research, Semantic search

Module 8

Module 8

Speech Recognition Module

Development and Evolution, Formants, Harmonics and Phonemes

Sound and Sound waves: Fundamentals and properties

Sample Rate, bit depth, bit rate, Audio signal processing for Machine Learning and AI

Audio Features: Time-domain, Frequency-domain, time-frequency-domain, Fourier transform

Acoustic and language modeling, Hidden Markov Models (HMMs)

Traditional Neural Networks: CNNs, RNNs and LSTMs

Advanced speech recognition systems: Transformers

Building a Speech Recognition Model

Audio file formats for speech recognition, Importing audio files in python

Google Web Speech API, Evaluation metrics: WER and CER

Dealing with background, Noise Creating spectrogram

Whisper AI: Transformer-based speech-to-text, Transcribing multiple audio files

Reversing the process: AI-powered text-to-speech


Module 9

Module 9

LLM Engineering

Hosting an LLM vs Using an API, Open-Source vs Closed-source, Tokens

Pricing: Hosting an LLM vs Pay-by-Token, Initial Prompt Development

Database Design and Schema Development, Activity Diagram

OpenAI Playground, Optimizing Temperature, Top P for Different Use Cases

Prompt Engineering for Software Lifecycle

Streamlit: Introduction, Pros and Cons, Text Methods, Chat Elements, Session State

Initializing an OpenAI Client, Implementing the Chat Functionality, Building the Setup Page

Enhancing Chatbot Interaction with Session State, Feedback Functionality, Deployment

Application Structure, Prompt Structure, Hallucinations, Prompt Injection

Counting Tokens, Cost reduction and Scaling


NEWS & UPDATES

Career Transitions

55% Average Salary Hike

$1,27,000 Highest Salary

800+ Career Transitions

300+ Hiring Partners

Who Can Apply for the Course?

  • Basic knowledge of Python programming is required for this course.
  • This course can be taken up by any undergraduate/postgraduate student of Basic & Applied Sciences, Engineering, and Computer Applications and also by Research Scholars/Faculties/Working Professionals who want to upskill themselves
  • Participants need to have a laptop/PC (with a minimum of 4 GB RAM, 100 GB HDD, Intel i3processor) and proper internet/Wi-Fi connection.
  • Optional API expenses to be borne by the learners.
Who can apply

About Program

At iHUB DivyaSampark, we are driven by the belief that young, innovative minds have immense potential to transform the world. Our core mission is to develop highly knowledgeable human resources with top-order, industry-relevant skills.
Whether you are looking for a career transition, a significant salary hike, or to master specialized knowledge, our programs provide the mentorship and practical exposure needed to achieve successful career outcomes and help you secure roles with our network of 300+ hiring partners.

Key Highlights

Industry-Relevant High-In-Demand Skills in Generative AI.
Hands-on based learning experience through practical projects.
Globally accepted certification from iHUB Divyasampark IIT Roorkee
Full-time access to recorded lectures/PPTs/PDFs/Study Materials.
Session on Resume Preparation/Interview Preparation.

Our Alumni Work At

Hiring Partners

What is included in this course?

  • Non-biased career guidance
  • Counselling based on your skills and preference
  • No repetitive calls, only as per convenience
  • Rigorous curriculum designed by industry experts
  • Complete this program while you work

I'm Interested in This Program