
Meet your instructor, Dr. Reza Moradinezhad—AI scientist and educator—here to guide you through building smart analytics with Generative AI. In this video, you'll get a clear overview of the course structure, key projects, and what skills you’ll build.
A walk-through of core Generative AI concepts, its distinction from discriminative AI, and its transformative role in data science workflows. You’ll learn how GenAI enhances data creation, insight generation, automated report generation, and natural language querying. By the end, you’ll understand its high-level value and be ready to start applying it for real-world data problems.
This video demonstrates how to visualize sentiment analysis results with matplotlib and seaborn and prepares the data for advanced generative AI tasks by selecting the top positive and negative reviews.
This video focuses on the final data preparation before generating the report. We will select a representative sample of reviews and initialize the summarization model, preparing all our data and tools for the core generative AI task.
This video brings all the prepared data together to generate a professional analytical report. We will construct a comprehensive prompt using all our insights and then use the Hugging Face summarization pipeline to command a Large Language Model to write a concise and actionable summary.
This video focuses on the final steps of our analytical pipeline: outputting and presenting the AI-generated report. We'll demonstrate how to display the final report within the notebook and save it to a file, ensuring our insights are ready to be shared with stakeholders.
This video is a high-level review of the entire Generative AI pipeline we have built. We'll recap how each lesson's code contributed to the final project, from initial data loading and synthetic enrichment to advanced sentiment analysis and automated report generation, consolidating our understanding of the project blueprint.
This video moves beyond a functional pipeline to make it production-ready. We'll explore strategies for optimizing LLM calls, such as batching to improve efficiency, and demonstrate how to use Jupyter’s %%time command to perform initial performance checks and identify bottlenecks.
This final video moves beyond technical skills to focus on the crucial topic of responsible AI. We'll discuss the importance of continuous qualitative review, addressing bias and privacy, and ensuring transparency in AI-generated content. The video concludes with a final review of the project and a look ahead to the future of Generative AI.
This video continues our discussion on optimization by focusing on the quality and scalability of your pipeline. We'll demonstrate strategies for refining AI output through prompt iteration and introduce modularity as a best practice for building a maintainable, production-ready solution.
Every day, businesses generate mountains of unstructured text data—from customer reviews to support tickets—and most of it goes underused. What if you could turn all that raw text into structured insights and automated reports in minutes, using Generative AI?
Welcome to Generative AI for Data Science: Build Smart Analytics with GenAI! I’m Prof. Reza (Dr. Reza Moradinezhad), an AI scientist and educator with over a decade of experience in machine learning, computer science, and human computer interaction—having collaborated with teams at MIT Media Lab, CMU, and Harvard.
In this course, I’ll guide you through a hands-on, project-based course where you’ll build an end-to-end Generative AI data pipeline using Python, Hugging Face Transformers, and real-world datasets.
You’ll learn how to analyze movie reviews, generate synthetic data, extract insights with LLMs, and automate analytical reporting—step by step, with practical code examples and guided walkthroughs. Unlike typical GenAI courses that stay theoretical, this one is all about real application.
We’ll work with the Rotten Tomatoes dataset, build in Jupyter Notebooks, and apply pre-trained models for sentiment analysis, text summarization, and report generation.
Every step you learn is transferable to your own data science projects, across domains. By the end, you won’t just understand how to use GenAI—you’ll have built a fully functional analytics tool powered by it. Whether you're a data scientist, AI engineer, or a curious analyst, you’ll leave with the skills and confidence to turn any text-heavy dataset into automated insights. Let’s dive in—and start transforming how we do data science, with Generative AI at the core.
1 Main Outcome:
By the end of this course, you will be able to design and implement a Python-based Generative AI pipeline for analyzing unstructured text data, generating insights, and creating automated analytical reports.
TOOLS USED
In this course, all the tools work in concert to create a comprehensive generative AI workflow for:
You'll leverage Python and its powerful Pandas library as your foundational environment for data handling, from initial loading and exploration of review data to the creation of simple, illustrative synthetic tabular data that enriches your analytical context. Hugging Face Transformers and Datasets will serve as your direct gateway to cutting-edge AI, providing seamless access to pre-trained models for automated sentiment analysis, text summarization, and key insight extraction, while also efficiently loading the large Rotten Tomatoes dataset. All these components will come together within Jupyter Notebooks, providing an interactive and flexible environment where you'll build, visualize, and refine your end-to-end Generative AI analytics pipeline step by step.
Python + Pandas – Core tools for data loading, exploration, manipulation, and generating simple synthetic tabular data (e.g., simulated movie attributes or tiers).
Hugging Face Transformers + Datasets – Provides access to pre-trained models for sentiment analysis, summarization, and keyword extraction, as well as efficient loading of the Rotten Tomatoes dataset.
Jupyter Notebooks – The interactive coding environment used to integrate all tools, visualize intermediate results, and build the complete GenAI analytics pipeline.