Non Functional Testing for LLM, Chatbots and AI Models

Name: Non Functional Testing for LLM, Chatbots and AI Models
Rating: 4.1 (20 reviews)

Learn essential AI testing techniques to ensure reliable, ethical, and human-like performance of advanced AI systems

Created byDan Andrei Bucureanu

Last updated 9/2025

English

What you'll learn

Understand how AI is working
Understand basic software testing
Understand how AI is tested compared to traditional software
Gain knowledge on testing for ethics
Demo on testing Chat GPT with automated Tools
Understand Adversarial Testing techniques
Understand how to test for a human like conversation
Create a framework for testing bias, toxicity and hate with PerspectiveAPI

Course content

11 sections • 73 lectures • 6h 47m total length

Introduction4:27
About your instructor2:00
History of AI7:16
Do not skip - AI for Testing the DOs and DONTs21:39
Understand what are the limitation, what you can and cannot do in all phases of a software testing project
What you will learn in this material2:04
Understand what you will learn and what you will not learn in this material - do no skip

What makes up AI3:47
Get a brief introduction on what are the main components of AI
Where do LLMs fit into AI1:50
Understand how LLMS and foundation models are part of the AI Science field.
Introduction to Natural Language Processing4:56
How NLP actually makes the AI more human.
Introduction to Machine Learning (ML)4:08
Understand what is machine learning and how algorithms make the core of AI
Machine Learning - Supervised ML4:01
Understand the basics concepts around supervised Machine Learning
Machine Learning - Unsupervised ML5:03
Gain basic understanding of Unsupervised ML and Clustering
Machine Learning - Reinforced ML3:20
In this lecture you will get a basic idea of how Reinforced Learning is working together with ML Algorithms
Neural Networks and Deep Learning5:26
Importance of Training Data4:09
In this material you will understand how critical good quality training data actually is.

Perspective Service - Bias Detection Service4:47
Understand what PerspectiveAPI can offer in matter in toxicity and censorship;
Get a Perspective API Key6:26
Step by step demo how to obtain a ChatGPT API Key
Demo - VS Code - Call Perspective API3:24
See a live demo how we will implement an API call to Perspective API
Github Code: calculator/src/test/Perspecttive_api at master · danteachqe/calculator · GitHub
Demo - Python - Test AI Response against Perspective APIs5:06
Understand how Perspective API And ChatGPT work together to create an automated testing framework for Toxic content.

Adversarial attacks for LLMS and Red Team5:54
In this lecture you will learn the basics of adversial testing as well as the red and blue security teams.
Prompt Injection Attack18:56
Learn how prompt injection works by understanding, direct and indirect prompt injections techniques as well as some examples and how to defend against them.
FUZZ Testing11:02
Fuzz testing, also known as fuzzing, is a software testing technique used to discover security vulnerabilities and bugs by inputting large amounts of random or semi-random data (often referred to as "fuzz") into a program or system.
Denial of Service Attacks12:50
Understand DOS attacks such as: Prompt flood, API Exploitation, Context Window Exploit and other
Adversarial Attack Examples6:22
Understand what is poisoning attack, how to mitigate it and some examples.

Links to material reference:
https://www.businessinsider.com/tesla-hackers-steer-into-oncoming-traffic-with-stickers-on-the-road-2019-4
https://bair.berkeley.edu/blog/2019/08/13/memorization/
https://chatgpt.com/share/456d092b-fb4e-4979-bea1-76d8d904031f
https://www.researchgate.net/figure/Adversarial-examples-using-PGD-with-and-with-noise-constraint-of-on_fig1_350132115
Poisoning attack8:34
A poison attack involves injecting malicious data into a training dataset, causing the model to learn incorrect or harmful behaviors. The attacker subtly alters training inputs so that the model produces incorrect outputs during real-world use.
Privacy Leakage Testing10:26
Evasion Attacks10:24

What is non functional Testing for LLMs7:04
Understand the lesser known aspects of non-functional Testing for Conversational AI
Disclaimer on Non Functional Testing1:53
Non functional Testing AI Models | LLMs - Ethical Alignment9:23
Description: How well does the model align with ethical guidelines and avoid generating harmful, biased, or inappropriate content?
Testing Aspects:
Does the model refrain from generating offensive or dangerous outputs (e.g., hate speech, misinformation)?
How well does it handle sensitive topics without being exploitative or biased?
Non functional Testing AI Models | LLMs - Explainability10:00
Description: The ability of the model to provide understandable reasoning for its outputs.
Testing Aspects:
Can the model explain why it provided a certain answer or the process it used to arrive at a decision?
Is there transparency in how it weighs different inputs?
Non functional Testing AI Models | LLMs - User Interaction Robustness10:15
Description: How well the model adapts to different types of user interactions, including ambiguous, contradictory, or incomplete queries.
Testing Aspects:
Can the model maintain coherent conversations across multiple queries or threads?
How does it handle user queries that contain slang, mixed languages, or unusual input formats?
Non functional Testing AI Models | LLMs - Context Preservation8:58
Description: The ability to retain and appropriately use context across long conversations or interactions.
Testing Aspects:
How effectively does the model remember previous inputs in a long interaction without getting confused or providing irrelevant responses?
Does the model avoid "forgetting" context, or improperly continuing it when it's no longer relevant?
Non functional Testing AI Models | LLMs - Creativity and Novelty5:14
Description: How well the model generates novel or creative content when prompted.
Testing Aspects:
Can the model create new stories, jokes, or analogies that are both original and coherent?
Does it avoid repetitiveness or overfitting common outputs when asked for creative work?

Requirements

basic experience with software testing
basic coding experience ( but not needed)
optional - GPT model 4 subscription (but not needed)
desire to learn the hottest skill on the market
desire to learn the hottest skill on the market

Description

Welcome to "Non Functional Testing for LLM, Chatbots and AI Models" your comprehensive guide to mastering the fundamentals of testing AI systems. Whether you're a developer, data scientist, or AI enthusiast, this course will provide you with the knowledge and skills needed to assess, improve, and ensure the reliability, performance, safety, and ethical integrity of AI technologies.

What You Will Learn:

Introduction to AI Testing: Understand the critical importance of testing AI systems, addressing both technical performance and ethical considerations. Learn about the potential impacts of AI failures and how responsible testing mitigates these risks.
Special Focus on Foundation Models and LLMs: Dive deep into the unique challenges of testing large language models and foundational AI systems, which are driving innovation across multiple industries.
AI System Evaluations: Learn how to design and implement effective testing frameworks for AI-based systems, utilizing both manual and automated tools to improve system performance and safety.
Adversarial AI Testing: Understand how to evaluate the robustness of AI models through adversarial testing techniques, assessing how well AI systems resist manipulation and errors when exposed to malicious inputs.
PerspectiveAPI for Ethical and Toxicity Testing: Learn how to integrate the PerspectiveAPI and other tools to test AI systems for ethical compliance and detect harmful or toxic outputs, ensuring AI systems uphold safety and ethical standards.
Humanness in AI: Explore the concept of evaluating the "humanness" of AI responses. Learn how to test whether AI systems generate outputs that are human-like, contextually aware, and empathetic in their interactions.
Ethical AI: Delve into the risks associated with AI and the ethical dimensions of AI development. Learn how to test AI systems for bias, fairness, and transparency, ensuring adherence to responsible AI practices.
Testing ChatGPT and Chatbots Using APIs in MLOps: Learn to test and evaluate conversational models like ChatGPT through APIs, and understand how to integrate these tests into MLOps pipelines for continuous AI improvement.
Case Studies: Review real-world examples of AI testing, learning from common pitfalls and best practices used in the field to ensure AI reliability and safety.

Who This Course Is For:

This course is designed for individuals seeking a comprehensive understanding of the techniques and practices required for testing AI systems. Whether you are starting a career in AI, enhancing your professional skills, or interested in the technical and ethical mechanisms behind AI system reliability, this course offers valuable insights.

Enroll now to start mastering the critical skill of testing AI systems, ensuring that you are equipped to contribute to the development of safe, reliable, and ethically sound AI technologies!

Who this course is for:

Citizen Developer
Software testers
Quality engineers
Social Engineers
Prompt Engineers
Product Managers
Engineering Directors

Non Functional Testing for LLM, Chatbots and AI Models

What you'll learn

Explore related topics

Course content

Introduction5 lectures • 37min

Setup Environment8 lectures • 36min

Introduction to Artificial Intelligence9 lectures • 37min

Introduction to LLM Basic Testing4 lectures • 17min

Performance Characteristics for AI Models5 lectures • 19min

Automated Testing Framework with Postman and ChatGPT7 lectures • 29min

Toxicity Testing Framework for LLMs4 lectures • 20min

Adversarial /Security Testing for LLMs8 lectures • 1hr 24min

Laboratory - Benchmark BIAS and Fairness of AI6 lectures • 28min

Non - Functional Testing - Human in AI7 lectures • 53min

Requirements

Description

Who this course is for: