Data Science/Machine Leaning Principles for Natural Sciences

Name: Data Science/Machine Leaning Principles for Natural Sciences
Rating: 4.3 (341 reviews)

Learn the basics and principles of data and machine learning for scientific problems

Created byGuilherme Matos Passarini, phD, Jones Granatyr, AI Expert Academy

Last updated 1/2025

English

What you'll learn

Understand the concepts of data science and machine learning and how they can be used in science
Know the main algorithms used in tasks of classification, regression, and clustering
Know the main architectures of neural networks
Understand how you can use algorithms/analyses in science projects/investigations/studies

Course content

7 sections • 50 lectures • 3h 54m total length

Welcome to the course4:13
This lecture aims to introduce the course to students
How to watch the course1:25
This lecture aims to explain some features of the platform to watch the course

The objective of data science and machine learning3:48
This lecture aims to introduce the concept of machine learning and how it works
Different types of variables5:44
This lecture aims to introduce the main types of variables used in statistics and data science
Transforming categorical data into numerical data4:01
This lecture aims to explain the concepts of label encoding and one-hot encoder
Data normalization/scaling5:06
This lecture aims to introduce the concepts of data scaling and the importance of it
Supervised/unsupervised learning6:26
This lecture aims to introduce the concepts of supervised and unsupervised learning
Some details about training4:20
This lecture aims to introduce the concepts of training and how dataset can be divided. It also explains the concepts of overfitting and underfitting
Visualizing data with graphs: part 16:07
This lecture aims to introduce the main types of graphs used to represent scientific data
Visualizing data with graphs: part 26:48
This lecture aims to introduce the main types of graphs used to represent scientific data: part 2
Dimensionality reduction and PCA (Principal component Analysis)5:57
This lecture aims to introduce a dimensionality reduction technique called PCA (principal component analysis)
Real case study: PCA to analyze heavy metal pollution1:19
Questions about data

Introduction to classification3:09
This lecture aims to introduce the concept of classification and main algorithms
Evaluation of classification algorithms7:26
This lecture aims to explain the evaluation of classification algorithms
The naive Bayes algorithm: introduction5:41
This lecture aims to explain the algorithm Naive Bayes
Naive Bayes: Laplacian correction4:12
This lecture aims to explain laplace correction
The naive Bayes algorithm: applications/examples in science4:21
This lecture aims to use a scientific example as a case study for Naive Bayes
Decision trees: part 15:54
This lecture aims to explain the algorithm Decision trees
Decision trees: Entropy and gain of information4:37
This lecture aims to explain the concepts of entropy and gain of information
Random forest: an enhancement over simple decision trees2:59
This lecture aims to explain the enhancement over decision trees called random forest
Random Forest: a scientific example: drug bioavailability5:44
This lecture aims to bring a scientific case study for Random forest
The KNN algorithm: introduction3:09
This lecture aims to explain the algorithm KNN
The KNN algorithm: distance calculation3:46
This lecture aims to explain distance calculation in KNN
KNN: scientific examples7:13
This lecture aims to bring scientific examples of how the KNN algorithm could be used
Support Vector Machines (SVM): Introduction5:39
This lecture aims to explain the algorithm SVM (Support Vector Machines)
The margin in Support Vector Machines3:04
This lecture aims to explain the effect on margin in SVM
Support Vector Machines (SVM): scientific examples6:59
This lecture aims to bring a few scientific examples to explain SVM
Real case study: Random forest to evaluate Diabetes Mellitus diagnosis1:00
Classification questions

Introduction to basic neural networks and biological inspiration3:15
This lecture aims to introduce the concept of neural networks, their usefulness, and features
Examples of uses of neural networks in science1:53
Introduction to basic neural networks: part 26:10
This lecture aims to explain more details about neural networks
Introduction to basic neural networks: part 36:45
This lecture aims to explain more details about neural networks, like gradient descent and activation functions
Convolutional neural networks and their inspiration in the visual cortex5:01
This lecture aims to introduce the concept of convolutional neural networks and how they are inspired in the visual cortex
Convolutional neural networks: part 24:35
Recurrent neural networks: part 17:00
Recurrent neural networks: part 26:47
Hopfield networks as a model of the human memory: a brief overview7:09
Introduction to liquid neural networks and their improvements based on neurons6:31
Real case study: Convolutional neural networks for astronomical object analysis1:07
Questions about neural networks

Requirements

Basic math knowledge is desirable

Description

The course "Principles of Data Science and Machine Learning for Natural Sciences" is designed to connect traditional scientific disciplines with the rapidly growing fields of Data Science (DS) and Machine Learning (ML). As research increasingly depends on large datasets and advanced computational methods, it’s becoming essential for scientists to know how to leverage DS and ML techniques to improve their work.

This course offers a solid introduction to the key concepts of Data Science and Machine Learning, specifically aimed at scientists and researchers in areas like biology, chemistry, physics, and environmental science. Participants will learn the basics of data analysis, including data collection, cleaning, and visualization, before moving on to machine learning algorithms that can help identify patterns and make predictions from data.

The course doesn’t require any programming skills and focuses on fundamental theoretical concepts. It's structured into six main sections:

1. Introduction

We'll start by introducing the course, covering its main features, content, and how to follow along.

2. Core DS/ML Concepts

We’ll go over basic concepts like variables, data scaling, training, datasets, and data visualization.

3. Classification

In this section, we’ll discuss key classification algorithms such as decision trees, random forests, Naive Bayes, and KNN, with examples of how they can be applied in scientific research.

4. Regression

We’ll briefly cover linear and multiple linear regression, discussing the main ideas and providing examples relevant to science.

5. Clustering

This section will focus on standard and hierarchical clustering methods, along with practical examples for scientific applications.

6. Neural Networks

Finally, we’ll introduce neural networks, discussing their biological inspiration and common architectures like Feedforward Neural Networks (FNN), Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN), and Hopfield Networks.

Who this course is for:

The primary target audience are people from STEM interested to understand and use concepts of DS/ML
People from an IT/Computer science interested to know how the algorithms can be used in science projects
Prople from a math background interested to understand concepts of DS/ ML, and science

Data Science/Machine Leaning Principles for Natural Sciences

What you'll learn

Explore related topics

Course content

Course introduction2 lectures • 6min

Concepts about data, variables, and intro to ML10 lectures • 50min

Machine Learning: classification16 lectures • 1hr 15min

Machine Learning: regression4 lectures • 15min

Machine Learning: clustering6 lectures • 29min

Introduction to neural networks11 lectures • 56min

Conclusion1 lecture • 4min

Requirements

Description

Who this course is for: