AI Application Boost with RAPIDS GPU Acceleration

Name: AI Application Boost with RAPIDS GPU Acceleration
Rating: 4.4 (130 reviews)

High-speed and high-performance GPU and CUDA computing! Build Data Science pipelines 50 times faster!

Created byJones Granatyr, Gabriel Alves, AI Expert Academy

Last updated 8/2025

English

What you'll learn

Understand the differences between processing data using CPU and GPU
Use cuDF as a replacement for pandas for GPU-accelerated processing
Implement codes using cuDF to manipulate DataFrames
Use cuPy as a replacement for numpy for GPU-accelerated processing
Use cuML as a replacement for scikit-learn for GPU-accelerated processing
Implement a complete machine learning project using cuDF and cuML
Compare the performance of classic Python libraries that run on the CPU with RAPIDS libraries that run on the GPU
Implement projects with DASK for parallel and distributed processing
Integrate DASK with cuDF and cuML for GPU performance

Course content

6 sections • 47 lectures • 6h 31m total length

Course content11:05
Master GPU-accelerated AI workflows with Nvidia rapids, cudf, cupy, and cuml, comparing performance to pandas and sklearn, and building end-to-end projects in Google Colab.
CPU vs GPU11:48
GPU and CUDA11:15
RAPIDS11:27
Course materials0:03

cuDF - intuition9:09
Installation9:11
Install rapids on Google Colab via a single command or a repository script, selecting GPU, CUDA, and Python versions. Save a copy of the notebook to Google Drive when customizing.
Pandas and cuDF5:44
Basic commands 19:25
Basic commands 27:08
Basic commands 310:37
Basic commands 412:30
Integration with cuPy6:35
Other data convertions5:45
User defined functions 116:52
User defined functions 210:26
Apply user defined functions to data frames with CUDA, using df.apply, apply rows, and apply chunks for GPU acceleration and missing-value handling. Create UDFs like add for row operations.
User defined functions 35:08
Performance comparison 18:45
Explore the performance comparison between rapids on GPU and pandas on CPU, testing value counts, concatenation, group by, merge, and string operations on datasets with 10 million rows.
Performance comparison 214:48
Performance comparison 37:31

DASK - intuition11:34
Creating a local cluster6:32
Arrays in distributed GPUs7:41
Distribute a 100,000 x 100 matrix across GPUs or CPUs using Dask array, with CuPy random state and chunking, then perform SVD and persist results to GPU memory.
DASK and cuDF9:20
Learn to integrate Dask and cuDF to partition data across GPUs and CPUs, compute across partitions, and export results to CSV.
DASK and cuML 114:54
DASK and cuML 28:53

Requirements

Programming logic
Basic Python programming
Machine learning: basic understanding of the algorithm training process, as well as classification and regression techniques

Description

This course is independently developed and is not affiliated with, endorsed, or sponsored by NVIDIA Corporation. RAPIDS is an open-source project originally developed by NVIDIA.

Data science and machine learning represent the largest computational sectors in the world, where modest improvements in the accuracy of analytical models can translate into billions of impact on the bottom line. Data scientists are constantly striving to train, evaluate, iterate, and optimize models to achieve highly accurate results and exceptional performance. With NVIDIA's powerful RAPIDS platform, what used to take days can now be accomplished in a matter of minutes, making the construction and deployment of high-value models easier and more agile. In data science, additional computational power means faster and more effective insights. RAPIDS harnesses the power of NVIDIA CUDA to accelerate the entire data science model training workflow, running it on graphics processing units (GPUs).

In this course, you will learn everything you need to take your machine learning applications to the next level! Check out some of the topics that will be covered below:

Utilizing the cuDF, cuPy, and cuML libraries instead of Pandas, Numpy, and scikit-learn; ensuring that data is processed and machine learning algorithms are executed with high performance on the GPU.
Comparing the performance of classic Python libraries with RAPIDS. In some experiments conducted during the classes, we achieved acceleration rates exceeding 900x. This indicates that with certain databases and algorithms, RAPIDS can be 900 times faster!
Creating a complete, step-by-step machine learning project using RAPIDS, from data loading to predictions.
Using DASK for task parallelism on multiple GPUs or CPUs; integrated with RAPIDS for superior performance.

Throughout the course, we will use the Python programming language and the online Google Colab. This way, you don't need to have a local GPU to follow the classes, as we will use the free hardware provided by Google.

Who this course is for:

Data scientists and artificial intelligence professionals looking to enhance the performance of their applications
Professionals currently working or aspiring to work in the field of data science, particularly those seeking to improve their skills in machine learning model training and data analysis
Anyone interested in learning about machine learning, especially with a focus on high-performance implementations using GPUs
Professionals involved in the development and implementation of machine learning models
Undergraduate and graduate students studying subjects related to artificial intelligence

AI Application Boost with RAPIDS GPU Acceleration

What you'll learn

Explore related topics

Course content

Introduction5 lectures • 46min

cuDF15 lectures • 2hr 20min

cuML9 lectures • 1hr 19min

Complete project10 lectures • 1hr 4min

DASK6 lectures • 59min

Final remarks2 lectures • 3min

Requirements

Description

Who this course is for: