Udemy
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
Development
Web Development Data Science Mobile Development Programming Languages Game Development Database Design & Development Software Testing Software Engineering Software Development Tools No-Code Development
Business
Entrepreneurship Communication Management Sales Business Strategy Operations Project Management Business Law Business Analytics & Intelligence Human Resources Industry E-Commerce Media Real Estate Other Business
Finance & Accounting
Accounting & Bookkeeping Compliance Cryptocurrency & Blockchain Economics Finance Finance Cert & Exam Prep Financial Modeling & Analysis Investing & Trading Money Management Tools Taxes Other Finance & Accounting
IT & Software
IT Certifications Network & Security Hardware Operating Systems & Servers Other IT & Software
Office Productivity
Microsoft Apple Google SAP Oracle Other Office Productivity
Personal Development
Personal Transformation Personal Productivity Leadership Career Development Parenting & Relationships Happiness Esoteric Practices Religion & Spirituality Personal Brand Building Creativity Influence Self Esteem & Confidence Stress Management Memory & Study Skills Motivation Other Personal Development
Design
Web Design Graphic Design & Illustration Design Tools User Experience Design Game Design 3D & Animation Fashion Design Architectural Design Interior Design Other Design
Marketing
Digital Marketing Search Engine Optimization Social Media Marketing Branding Marketing Fundamentals Marketing Analytics & Automation Public Relations Paid Advertising Video & Mobile Marketing Content Marketing Growth Hacking Affiliate Marketing Product Marketing Other Marketing
Lifestyle
Arts & Crafts Beauty & Makeup Esoteric Practices Food & Beverage Gaming Home Improvement & Gardening Pet Care & Training Travel Other Lifestyle
Photography & Video
Digital Photography Photography Portrait Photography Photography Tools Commercial Photography Video Design Other Photography & Video
Health & Fitness
Fitness General Health Sports Nutrition & Diet Yoga Mental Health Martial Arts & Self Defense Safety & First Aid Dance Meditation Other Health & Fitness
Music
Instruments Music Production Music Fundamentals Vocal Music Techniques Music Software Other Music
Teaching & Academics
Engineering Humanities Math Science Online Education Social Science Language Learning Teacher Training Test Prep Other Teaching & Academics
Web Development JavaScript React Angular CSS Node.Js PHP HTML5 Vue JS
AWS Certification Microsoft Certification AWS Certified Solutions Architect - Associate AWS Certified Cloud Practitioner CompTIA A+ Amazon AWS Cisco CCNA Microsoft AZ-900 CompTIA Security+
Microsoft Power BI SQL Tableau Data Modeling Business Analysis Business Intelligence MySQL Qlik Sense Data Analysis
Unity Unreal Engine Game Development Fundamentals C# 3D Game Development C++ Unreal Engine Blueprints 2D Game Development Blender
Google Flutter iOS Development Android Development Swift React Native Dart (programming language) Kotlin Mobile App Development SwiftUI
Graphic Design Photoshop Adobe Illustrator Drawing Digital Painting Canva InDesign Character Design Procreate Digital Illustration App
Life Coach Training Personal Development Neuro-Linguistic Programming Personal Transformation Life Purpose Mindfulness Sound Therapy Meditation CBT Cognitive Behavioral Therapy
Business Fundamentals Entrepreneurship Fundamentals Freelancing Business Strategy Startup Business Plan Online Business Blogging Leadership
Digital Marketing Social Media Marketing Marketing Strategy Google Analytics Internet Marketing Email Marketing Copywriting YouTube Marketing Startup

IT & SoftwareOther IT & SoftwareMachine Learning

Create Your Own Datasets

with Google Colab and sklearn
Rating: 0.0 out of 50.0 (0 ratings)
2 students
Created by Tracy Renee
Last updated 1/2022
English
English [Auto]

What you'll learn

  • Students will learn how to create their own datasets using Google Colab and sklearn.
  • Students will be given an introduction to Google Colab, which they will use to write their own programs.
  • Students will be given an introduction to Python's machine learning library, sklearn, which they will need in order to create their own datasets.
  • Students will learn how to create the twenty datasets that are included in sklearn.

Requirements

  • The equipment needed for this course is a computer with an internet connection
  • A prerequisite to this course is the course I have created, "Use Google Colab to learn Python programming".

Description

In this course the student will learn how to use Google Colab and Python's machine learning library, sklearn, to create datasets and use them in machine learning enterprises.

The datasets will be created in sklearn and they are comprised of classifications and regressions, being twenty in total.

When the datasets have been created, machine learning techniques will be employed to make predictions on the labels. In addition, the concepts of supervised and unsupervised learning will be discussed. Although most of the examples will be of supervised learning, clustering will be brushed upon.

Some of the datasets introduce noise into the system, and this will decrease accuracy of predictions. The student will be shown how to tune the parameters of the appropriate datasets to reduce noise and thereby improve accuracy of the predictions. This proves that noise has an inverse relationship to accuracy of the model.

Some of the datasets will have outliers, so methods for reducing outliers in the dataset will also be discussed. When the outliers are removed, accuracy of the predictions are also likely to be increased. This proves that outliers have an inverse relationship to accuracy of the model.

The student will be taken through the following steps to create a dataset and write a program to make predictions on the labels:-

1. Import libraries.

2. Create dataset.

3. Plot a graph of the dataset so it can be seen in the computer's memory.

4. Analyse the label.

5. Remove outliers if necessary.

6. Normalise or standardise the independent variable if necessary.

7. Split the dataframe into training and validation sets.

8. Select the model.

9. Train and fit the training set into the model.

10. Make predictions on the validation set.

11. Check accuracy and / or error of the predictions.

12. Compare the predictions against the actual values.

13. Plot the predictions on a graph.

Who this course is for:

  • This course is designed for anyone who would like to create their own datasets using sklearn, which is Python's machine learning library.

Instructor

Tracy Renee
Data Scientist
Tracy Renee
  • 4.1 Instructor Rating
  • 37 Reviews
  • 3,189 Students
  • 10 Courses

I have almost five decades experience in work, to include United States Air Force, the corporate sector, and non profit sectors, and charities. I also have a BA in Computer Studies, a MSc in Finance, and have a Diploma in Accounting through the AAT. My hobbies include data science, creating content on social media, and writing.

Follow Tracyrenee on medium publications.

Subscribe to Coding with Crystal Hill on YouTube.

Top companies choose Udemy Business to build in-demand career skills.
NasdaqVolkswagenBoxNetAppEventbrite
  • Udemy Business
  • Teach on Udemy
  • Get the app
  • About us
  • Contact us
  • Careers
  • Blog
  • Help and Support
  • Affiliate
  • Investors
  • Impressum Kontakt
  • Terms
  • Privacy policy
  • Cookie settings
  • Sitemap
  • Accessibility statement
Udemy
© 2022 Udemy, Inc.