Hero Image

Generative AI & Large Language Model Engineering using Python

Course Overview:
In today’s AI-driven world, Generative AI and Large Language Models (LLMs) are at the forefront of innovation, powering applications ranging from chatbots to recommendation engines, speech recognition, and intelligent automation. Python is the most widely used programming language in the world of Gen-AI and Data Science owing to its simplicity, versatility, and a vast ecosystem of powerful libraries. Therefore, to make the learners acquainted of these latest high-in-demand skills, we have designed this course that will build a strong foundation in Python programming and its practical applications in Artificial Intelligence, Natural Language Processing (NLP), Large Language Models (LLMs), LangChain, Vector Databases, and Speech Recognition technologies with a hands-on approach. By completing this course, the learners will not only strengthen their technical skill-set but also gain the ability to build AI-powered applications, positioning themselves for exciting career opportunities in the rapidly evolving field of Generative AI and LLM Engineering.

Course Objectives:
• To make the participants learn Python programming from scratch with a focus on its applications in the fields of Artificial Intelligence (AI) & Generative AI.
• To introduce participants to core AI concepts, including Natural vs Artificial Intelligence, Machine Learning, Deep Learning, and Generative AI fundamentals.
• To provide a strong foundation in Python programming concepts such as objects, functions, methods, error handling, regex, and object-oriented programming with practical illustrations.
• To enable learners to work with Python libraries (OS, Requests, Pandas) for automation, data handling, and API integration (including OpenAI API).
• To introduce participants to Natural Language Processing (NLP) and equip them to build text-processing pipelines including tokenization, sentiment analysis, and custom text classifiers.
• To make learners proficient in working with Large Language Models (LLMs), covering transformer architecture, GPT, BERT, Hugging Face, and LangChain for chatbot development and text generation.
• To equip learners with the knowledge of LangChain framework, LangGraph, and Retrieval Augmented Generation (RAG) for building advanced conversational agents and memory-enabled systems.
• To familiarize learners with Vector Databases (e.g., Pinecone) and their applications in semantic search, recommendation engines, and biomedical research.
• To introduce learners to Speech Recognition and Speech-to-Text systems using traditional ML, Deep Learning, and transformer-based approaches such as Whisper AI.
• To prepare learners for LLM Engineering, including prompt engineering, hosting models vs APIs, cost optimization, scaling strategies, and deploying AI-powered applications with Streamlit.


Batch Details:
Class Timings: Saturday & Sunday (4 pm-6 pm) Start Date: 11th Oct 2025
Duration: (74 Hours) End Date: 28th Feb 2026
Mode: Online Certification: iHUB Divyasampark IIT Roorkee
Last Date to Register: 10th Oct 2025
Course Fee: Students/PhD Scholars/RA/JRF/SRF/Postdoc fellows: Rs. 13,000/-
Faculty/Working Professional: Rs. 15,000/-
(Amounts inclusive of GST)

Link to Register: https://rzp.io/rzp/GenerativeAICourse

Contact Person:
Dr. Subrat Kotoky
CTO, Ritvij Bharat Private Limited
Ph.D. in Mechanical Engineering (IIT Guwahati)
rbpl.edu@gmail.com; subrat.kotoky@ritvij.co.in
9085317465/8473874389

Expert Profile:

Mr. Shreyas Shukla
Professional Corporate Trainer & Microsoft Azure Certified Data Engineer
MTech-IIT Kharagpur & BE- The Aeronautical Society of India, New Delhi
4+ years of experience in leading online professional courses for different leading organizations

Has successfully conducted 25+ courses and trained 1500+ learners in the fields of Python Programming, Data Analytics, Machine Learning, Deep Learning, Computer Vision etc. till now.

Certifications:
1. DP-203: Microsoft Certified: Azure Data Engineer Associate
2. DP-900: Microsoft Certified: Azure Data Fundamentals
3. AZ-900: Microsoft Certified: Azure Fundamentals

Our Students Rate This Course

4.5
Program Fee

Rs 13,000/- & Rs. 15,000

Available Seats

100

Schedule

Saturday & Sunday (4 pm-6 pm)

Only Few Seats Left

Reviews

Testimonials

Module 1

Introduction to AI & Gen AI

·       Building an AI tool

·       Natural vs Artificial Intelligence, Brief history of AI, Weak vs Strong AI

·       AI vs Data Science vs Machine Learning vs Deep Learning

·       Data: Collection, Labelled vs Unlabeled, Structured vs Unstructured, Metadata

·       Overview of Machine Learning (Supervised, Unsupervised and Reinforcement Learning)

·       Overview of Deep Learning, Robotics, Computer Vision, Traditional ML, Generative AI

·       Generative AI: Introducing ChatGPT, Natural Language, Processing (NLP)

·       Large Language Models (LLMs): Training, N-Grams, RNNs, Transformers

·       Building LLMs: Prompt Engineering, Fine Tuning, RAG

·       Foundation Models vs Private Models

·       Inconsistency and Hallucination in Gen AI, Budgeting, Latency, Running out of Data

·       AI stack: Python, Working with APIs, Vector Databases, Hugging Face, LangChain

Module 2

Fundamentals of Python Programming

  • Basics of Python Language
  • Python objects with details of Numbers/Variables/Strings/Lists/Dictionaries/Tuples etc. 
  • Variable Assignment, Indexing & Slicing
  • Comparison & Logical operators
  • Range, List Comprehension, Functions, Lambda expressions.
  • Scope, with statement, Working with various Files types, Error Handling
  • Regex: Anchors and Groupings, Range Expressions, Non-Greedy Matching, Substitutions.
  • Object Oriented Programming, -class, Attributes, Inheritance, Polymorphism

Module 3

Working with Python Libraries: OS, Requests, and Pandas

·       Understanding Python Module: OS module, Import Techniques and Best Practices

·       Requests and HTTPZ Libraries: Getting Started, Handling Errors and Managing Authentication and Headers (Open AI API)

·       Introduction and working with Pandas

·       Series, DataFrames, working with various data types, Group-By operation

·       Selecting a single column, important series methods

·       Indexing & Sorting; loc & iloc with series; Inspecting DataFrames, filtering with conditional operators

Module 4

Natural Language Processing (NLP)

·       Introduction, Supervised vs Unsupervised NLP

·       Data preparation, Handling Stop words, Regular Expressions

·       Tokenization, Stemming, Lemmatization, N-grams

·       Text tagging, Parts of Speech (POS) tagging, Name Entity Recognition (NER)

·       Sentiment Analysis, Rule-based Sentiment Analysis, Pre-trained Transformers

·       Numerical Representation of text, Bag of Words, TF-IDF

·       Topic Modelling, Latent Dirichlet Allocation (LDA), Latent Semantic Analysis (LSA)

·       Custom text classifier using various ML models     

    

Module 5

Large Language Models (LLMs)

·       Understanding LLMs, General Purpose Models, Pre-training and Fine Tuning

·       Deep- Learning recap, Transformer Architecture, Input Embeddings

·       Multi-headed Attention, Feed-Forward Layer, Masked multihaed Attention

·       Understanding GPT, Open AI API, Generating Text, Customizing GPT output

·       Key word text summarization, Coding a simple chatbot using LangChain in Python

·       Hugging Face, Transformer Pipeline, Pre-trained tokenizers, Special tokens

·       Q&A models : BERT architecture, Tokenizer, Embeddings, Calculating response

·       Creating QA bot, BERT, RoBERTa, DistilBERT, GPT vs BERT vs XLNET, XLNET Embeddings and Fine Tuning

Module 6

LangChain

·       LangChain Introduction, Tokens and Models, Setting up Environment, Open API key

·       System, User and Assistant roles, Creating chatbot, Temperature, Max Tokens, Streaming

·       LangChain Framework, ChatOpenAI, System, AI and Human messages

·       Prompt Templates and Prompt Values, Few-shot chat message prompt templates

·       String output parser, Comma-separated list output parser, Datetime output parser

·       Piping a prompt, model and parser, Batching, Streaming, Runnable Sequence class

·       Piping chains and Runnable Passthrough, Runnable Parallel, Runnable Lambda

·       Retrieval Augmented Generation (RAG): Document Loading, Splitting and Embedding

·       Document storing, retrieval and generation, loading with PyPDFLoader, Docx2txtLoader

·       Splitting with character text splitter, markdown header text splitter

·       Text embedding with OpenAI, Chroma vectorstore (Inspecting and Managing docs)

·       Retrieval: Similarity search, Maximal Marginal Relevance (MMR) search, Vectorstore-backed retriever

·       Generation: Stuffing documents and Generating a response 

Module 7

LangGraph

·       States, nodes and Edges, First graph: Importing relevant classes, Building graph

·       Conditional edges: Defining nodes, routing function, Building the graph

·       Annotated construct and reducer functions, MessagesState, RemoveMessages

·       Checkpointers and threads, Short-term memory with the InMemorySaver class

·       The StateSnapshot class, Long-term memory with SQLite    

  

Module 8

Vector Databases

·       Database comparison: SQL, NoSQL and Vector, Understanding Vector databases,

·       Vector space: Introduction, Distance Metrics, Vector Embeddings

·       Vector database, comparison, Pinecone registration, walkthrough and creating index

·       Pinecone with Python: Connection, Pinecone Index, Upsetting data and using embedding

·       Vector database for Recommendation Engines, Biomedical research, Semantic search

Module 9

Speech Recognition Module

·       Development and Evolution, Formants, Harmonics and Phonemes

·       Sound and Sound waves: Fundamentals and properties

·       Sample Rate, bit depth, bit rate, Audio signal processing for Machine Learning and AI

·       Audio Features: Time-domain, Frequency-domain, time-frequency-domain, Fourier transform

·       Acoustic and language modeling, Hidden Markov Models (HMMs)

·       Traditional Neural Networks: CNNs, RNNs and LSTMs

·       Advanced speech recognition systems: Transformers

·       Building a Speech Recognition Model

·       Audio file formats for speech recognition, Importing audio files in python

·       Google Web Speech API, Evaluation metrics: WER and CER

·       Dealing with background, Noise Creating spectrogram

·       Whisper AI: Transformer-based speech-to-text, Transcribing multiple audio files 


Reversing the process: AI-powered text-to-speech




Module 10

LLM Engineering

·       Hosting an LLM vs Using an API, Open-Source vs Closed-source, Tokens

·       Pricing: Hosting an LLM vs Pay-by-Token, Initial Prompt Development

·       Database Design and Schema Development, Activity Diagram

·       OpenAI Playground, Optimizing Temperature, Top P for Different Use Cases

·       Prompt Engineering for Software Lifecycle

·       Streamlit: Introduction, Pros and Cons, Text Methods, Chat Elements, Session State

·       Initializing an OpenAI Client, Implementing the Chat Functionality, Building the Setup Page

·       Enhancing Chatbot Interaction with Session State, Feedback Functionality, Deployment

·       Application Structure, Prompt Structure, Hallucinations, Prompt Injection

  • Counting Tokens, Cost reduction and Scaling

NEWS & UPDATES

Career Transitions

55% Average Salary Hike

$1,27,000 Highest Salary

800+ Career Transitions

300+ Hiring Partners

Who Can Apply for the Course?

  • Basic knowledge of Python is useful but NOT Mandatory. We’ll start from scratch.
  • This course can be taken up by any undergraduate/postgraduate student of Basic & Applied Sciences, Engineering, and Computer Applications and also by Research Scholars/Faculties/Working Professionals who want to upskill themselves
  • Participants need to have a laptop/PC (with a minimum of 4 GB RAM, 100 GB HDD, Intel i3processor) and proper internet/Wi-Fi connection.
  • Optional API expenses to be borne by the learners.
Who can aaply

About Program

This program by iHub Divya Sampark, IIT Roorkee helps you gain the data analytics, machine learning, and artificial intelligence skills sought after by top employers.

Key Highlights

Industry-Relevant High-In-Demand Skills in Generative AI.
Hands-on based learning experience through practical projects.
Globally accepted certification from iHUB Divyasampark IIT Roorkee
Full-time access to recorded lectures/PPTs/PDFs/Study Materials.
Session on Resume Preparation/Interview Preparation.
750+ Hrs of Live Online Immersive Learning Journey
Exploring Cutting-edge Applications of Artificial Intelligence in VLSI – A Key Focus Area
70% Hands On , 30% Theory
Master Classes from IIT Faculty and Industry Experts
2 Days Campus Immersion
Business Communication (Aptitude & Soft Skill Training)
Career Support Services
24/7 EDA Tools Access (Cadence, Synopsis, Siemens )
24/7 Access to e-Learn cloud-based platform
Student Success Support

Our Alumni Work At

Master Client Desktop

What is included in this course?

  • Non-biased career guidance
  • Counselling based on your skills and preference
  • No repetitive calls, only as per convenience
  • Rigorous curriculum designed by industry experts
  • Complete this program while you work

I’m Interested in This Program