What you'll learn

Build agentic AI browser agents that can browse websites, click buttons, type into forms, extract information, and complete multi-step web tasks.
Use Python and Playwright to control a real browser and automate common web workflows.
Connect LLMs to browser automation so an AI agent can understand user instructions, plan actions, and make decisions during a workflow.
Extract structured data from websites and save results into clean formats such as tables and CSV files.
Build practical approval-based workflows where AI assists with form filling but keeps a human in the loop before final submission.
Create a Streamlit user interface for uploading files, running browser agents, reviewing results, and tracking status.
Understand safety, ethics, and limitations of browser AI agents, including CAPTCHA handling, prompt injection, and responsible automation.
Deploy an agentic browser automation project using Docker and AWS services such as S3, SQS, Lambda, DynamoDB, and Lightsail Containers.

Course content

7 sections • 16 lectures • 4h 1m total length

Introduction2:52

How AI Browser Agents Actually Work2:45
Learn how AI browser agents work behind the scenes using the agent loop: plan, act, observe, and repeat. In this lecture, we break down how an LLM connects with browser automation tools like Playwright to read web pages, click buttons, type into forms, extract data, and complete multi-step browser tasks.
Tools of the Trade for Building AI Browser Agents4:10
In this lecture, we introduce the core tools used throughout the course, including Python, Playwright, LLMs, Streamlit, Docker, and AWS. You will understand why each tool matters and how they work together to build real-world agentic AI browser automation systems.
Why Browser AI Agents Matter1:44
Discover why AI browser agents are becoming important for developers, automation engineers, and AI builders. This lecture explains practical use cases such as web research, product comparison, data extraction, form filling, workflow automation, and human-in-the-loop AI systems.

DOM API and Selectors 1019:58
Learn the basics of the DOM API and how browser automation tools understand web pages. In this lecture, we cover HTML structure, elements, buttons, inputs, forms, links, and selectors so you can confidently locate and interact with page elements using Playwright.
Project Setup and First Browser Launch15:57
Set up the project from scratch and launch your first automated browser using Python and Playwright. This lecture walks through the initial environment setup, dependency installation, browser launch, and the difference between headless and headful browser automation.
Reading and Extracting Data from Web Pages13:54
Learn how to read content from web pages and extract useful information using Playwright. In this lecture, we cover how to capture headings, paragraphs, product details, links, prices, ratings, and other structured data from websites.

Three Levels of Browser Agents10:08
Understand the three levels of browser agents: basic scripted automation, LLM-assisted browser workflows, and fully autonomous browser agents. This lecture helps you clearly see the difference between normal automation and true agentic AI browser behavior.
Build an AI Shopping Research Agent35:04
Build a practical AI shopping research agent that can search the web, analyze product results, and extract useful shopping information. This lecture shows how browser automation and LLM reasoning can work together to compare products, prices, ratings, and summaries.
Building a Fully Autonomous Browser Agent Loop28:30
Learn how to build a fully autonomous browser agent loop where the AI can plan, take browser actions, observe results, and decide the next step. This lecture introduces the core architecture behind agentic AI systems that can operate across multiple steps with guardrails.

Building the Human-in-the-Loop Browser Agent35:33
Build a safer AI browser agent by adding human review and approval before final actions are submitted. This lecture explains why human-in-the-loop design is important for browser automation, especially for forms, approvals, sensitive workflows, and real-world business use cases.
Uploading Files into the Agent System16:36
Learn how to upload CSV files into the AI browser agent system so multiple requests can be processed in a structured workflow. This lecture introduces file upload handling, batch processing concepts, and how uploaded data moves into the automation pipeline.
Processing Uploaded Files into Approval Requests21:35
Convert uploaded CSV data into approval requests that can be reviewed, tracked, and processed by the agent system. This lecture covers how to read uploaded records, create structured requests, manage statuses, and prepare the system for human approval workflows.

Deploying the Backend to AWS21:02
Deploy the backend of the AI browser agent system using AWS services such as S3, SQS, Lambda, and DynamoDB. This lecture explains the cloud architecture and shows how uploaded files can trigger backend processing in a serverless workflow.
Deploying the Streamlit UI to AWS14:59
Deploy the Streamlit user interface for the AI browser agent system using Docker and AWS Lightsail Containers. This lecture shows how to package the Playwright-based application, connect it with the backend, and run the browser agent UI in the cloud.

Requirements

No prior experience with browser automation is required
No prior experience with Playwright is required
No prior experience with AI agents is required
No advanced AI or machine learning background is required
Basic Python knowledge is helpful, but we will explain the code step by step
Basic understanding of websites, buttons, forms, and web pages is helpful
A code editor such as Visual Studio Code is recommended
An AWS account is optional and only needed for the deployment section
Everything will be built from scratch, step by step, including Python setup, Playwright basics, AI agent logic, UI, Docker, and deployment
An OpenAI API key or access to another LLM provider is recommended for the AI agent examples

Description

AI browser agents are one of the most practical use cases of agentic AI. Instead of only chatting with an AI model, you will learn how to build agents that can open a browser, read web pages, click buttons, type into forms, extract data, and complete multi-step workflows.

In this course, we will build everything from scratch using Python, Playwright, LLMs, Streamlit, Docker, and AWS. You will begin with the fundamentals of browser automation, including how web pages are structured, how DOM selectors work, how to interact with buttons and forms, and how to extract useful information from websites.

Then we will add LLM intelligence so the agent can understand user instructions, plan browser actions, make decisions, and continue working through an automation flow. You will build practical projects including an AI shopping research agent, a fully autonomous browser agent loop, and a human-in-the-loop approval workflow.

You will also learn how to upload files, process records, create approval requests, track statuses, handle errors, and build a simple Streamlit interface for managing the agent system. In the deployment section, we will move the project to the cloud using AWS services such as S3, SQS, Lambda, DynamoDB, Docker, and Lightsail Containers.

We will also cover important safety and ethics topics such as CAPTCHA handling, prompt injection, terms of service, rate limits, and responsible browser automation.

No prior experience with Playwright, browser agents, Streamlit, Docker, or AWS is required. We will build the project step by step from the ground up. By the end, you will have a complete project you can show in your portfolio, extend for your own workflows, and use as a foundation for building more advanced AI automation systems.

Who this course is for:

Beginner Python developers who want to build hands-on AI automation projects
Students who want to understand how AI agents can browse websites, extract data, and complete tasks
Automation engineers who want to add LLM-powered decision making to browser workflows
AI enthusiasts who want to move beyond simple chatbots and build agents that can take actions
Freelancers and builders who want to create practical automation tools for web tasks
Professionals who want to understand how agentic AI can be used for real-world business workflows
Learners who want a step-by-step project-based course instead of only theory

What you'll learn

Explore related topics

Course content

Introduction1 lecture • 3min

Foundations of AI Browser Agents3 lectures • 9min

Browser Automation Basics with Playwright3 lectures • 40min

Building Practical AI Browser Agents3 lectures • 1hr 14min

Human-in-the-Loop Agent Workflows3 lectures • 1hr 14min

Cloud Deployment2 lectures • 36min

Ethics, Safety, and Responsible Automation1 lecture • 7min

Requirements

Description

Who this course is for: