
Understand what are the limitation, what you can and cannot do in all phases of a software testing project
Understand what you will learn and what you will not learn in this material - do no skip
Get a brief introduction on what are the main components of AI
Understand how LLMS and foundation models are part of the AI Science field.
How NLP actually makes the AI more human.
Understand what is machine learning and how algorithms make the core of AI
Understand the basics concepts around supervised Machine Learning
Gain basic understanding of Unsupervised ML and Clustering
In this lecture you will get a basic idea of how Reinforced Learning is working together with ML Algorithms
In this material you will understand how critical good quality training data actually is.
Understand what are the main areas in software testing
Understand what PerspectiveAPI can offer in matter in toxicity and censorship;
Step by step demo how to obtain a ChatGPT API Key
See a live demo how we will implement an API call to Perspective API
Github Code: calculator/src/test/Perspecttive_api at master · danteachqe/calculator · GitHub
Understand how Perspective API And ChatGPT work together to create an automated testing framework for Toxic content.
In this lecture you will learn the basics of adversial testing as well as the red and blue security teams.
Learn how prompt injection works by understanding, direct and indirect prompt injections techniques as well as some examples and how to defend against them.
Fuzz testing, also known as fuzzing, is a software testing technique used to discover security vulnerabilities and bugs by inputting large amounts of random or semi-random data (often referred to as "fuzz") into a program or system.
Understand DOS attacks such as: Prompt flood, API Exploitation, Context Window Exploit and other
Understand what is poisoning attack, how to mitigate it and some examples.
Links to material reference:
https://www.businessinsider.com/tesla-hackers-steer-into-oncoming-traffic-with-stickers-on-the-road-2019-4
https://bair.berkeley.edu/blog/2019/08/13/memorization/
https://chatgpt.com/share/456d092b-fb4e-4979-bea1-76d8d904031f
https://www.researchgate.net/figure/Adversarial-examples-using-PGD-with-and-with-noise-constraint-of-on_fig1_350132115
A poison attack involves injecting malicious data into a training dataset, causing the model to learn incorrect or harmful behaviors. The attacker subtly alters training inputs so that the model produces incorrect outputs during real-world use.
Understand the lesser known aspects of non-functional Testing for Conversational AI
Description: How well does the model align with ethical guidelines and avoid generating harmful, biased, or inappropriate content?
Testing Aspects:
Does the model refrain from generating offensive or dangerous outputs (e.g., hate speech, misinformation)?
How well does it handle sensitive topics without being exploitative or biased?
Description: The ability of the model to provide understandable reasoning for its outputs.
Testing Aspects:
Can the model explain why it provided a certain answer or the process it used to arrive at a decision?
Is there transparency in how it weighs different inputs?
Description: How well the model adapts to different types of user interactions, including ambiguous, contradictory, or incomplete queries.
Testing Aspects:
Can the model maintain coherent conversations across multiple queries or threads?
How does it handle user queries that contain slang, mixed languages, or unusual input formats?
Description: The ability to retain and appropriately use context across long conversations or interactions.
Testing Aspects:
How effectively does the model remember previous inputs in a long interaction without getting confused or providing irrelevant responses?
Does the model avoid "forgetting" context, or improperly continuing it when it's no longer relevant?
Description: How well the model generates novel or creative content when prompted.
Testing Aspects:
Can the model create new stories, jokes, or analogies that are both original and coherent?
Does it avoid repetitiveness or overfitting common outputs when asked for creative work?
Welcome to "Non Functional Testing for LLM, Chatbots and AI Models" your comprehensive guide to mastering the fundamentals of testing AI systems. Whether you're a developer, data scientist, or AI enthusiast, this course will provide you with the knowledge and skills needed to assess, improve, and ensure the reliability, performance, safety, and ethical integrity of AI technologies.
What You Will Learn:
Introduction to AI Testing: Understand the critical importance of testing AI systems, addressing both technical performance and ethical considerations. Learn about the potential impacts of AI failures and how responsible testing mitigates these risks.
Special Focus on Foundation Models and LLMs: Dive deep into the unique challenges of testing large language models and foundational AI systems, which are driving innovation across multiple industries.
AI System Evaluations: Learn how to design and implement effective testing frameworks for AI-based systems, utilizing both manual and automated tools to improve system performance and safety.
Adversarial AI Testing: Understand how to evaluate the robustness of AI models through adversarial testing techniques, assessing how well AI systems resist manipulation and errors when exposed to malicious inputs.
PerspectiveAPI for Ethical and Toxicity Testing: Learn how to integrate the PerspectiveAPI and other tools to test AI systems for ethical compliance and detect harmful or toxic outputs, ensuring AI systems uphold safety and ethical standards.
Humanness in AI: Explore the concept of evaluating the "humanness" of AI responses. Learn how to test whether AI systems generate outputs that are human-like, contextually aware, and empathetic in their interactions.
Ethical AI: Delve into the risks associated with AI and the ethical dimensions of AI development. Learn how to test AI systems for bias, fairness, and transparency, ensuring adherence to responsible AI practices.
Testing ChatGPT and Chatbots Using APIs in MLOps: Learn to test and evaluate conversational models like ChatGPT through APIs, and understand how to integrate these tests into MLOps pipelines for continuous AI improvement.
Case Studies: Review real-world examples of AI testing, learning from common pitfalls and best practices used in the field to ensure AI reliability and safety.
Who This Course Is For:
This course is designed for individuals seeking a comprehensive understanding of the techniques and practices required for testing AI systems. Whether you are starting a career in AI, enhancing your professional skills, or interested in the technical and ethical mechanisms behind AI system reliability, this course offers valuable insights.
Enroll now to start mastering the critical skill of testing AI systems, ensuring that you are equipped to contribute to the development of safe, reliable, and ethically sound AI technologies!