Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Data Acquisition and Manipulation with Python
Rating: 4.5 out of 5(5 ratings)
101 students

Data Acquisition and Manipulation with Python

Acquire and analyse data in different formats with the help of Python data analysis tools
Last updated 10/2017
English

What you'll learn

  • See how to acquire and save different formats of data
  • Find out how to connect to a database and add information to it
  • Combine and merge data sets, and manipulate strings using Python
  • Aggregate your data and employ group-wise operations and transformations
  • Know how to download and read a web page using the BeautifulSoup package
  • Extract useful information from websites using Selenium in Python
  • Program and employ a Scrapy spider for successful web scraping

Course content

6 sections25 lectures2h 39m total length
  • The Course Overview5:37

    This video gives an overview of entire course.

  • Getting Data in Its Different Formats9:05

    In this video, we will see how we load data into Python. Here we see, how to load data into a Pandas DataFrame from various sources, including CSV, Excel, XML, JSON, and via a web API.

  • Connecting to a Database4:47

    In this video, we will see how we can connect to a (MySQL) database. Here we set up such connection.

  • Adding Information to a Database3:31

    Once we have a database, we will see how we add data to it. Here, we create a table in Python with SQL commands, then use a Panda’s dataframe to add data to the table in the database.

  • Query a Database3:35

    In this, we will see how we extract data from a database. Here we use SQL and Pandas to load a table’s data into a DataFrame.

Requirements

  • • What will the customer need to install (core software) to follow along? Anaconda, Python, MySQL, Chrome or Firefox
  • • What Plugins will the customer need? Install packages as needed
  • • What versions of software should the customer use? Relatively recent, but not version sensitive
  • • Any special hardware needed? No
  • • Can you point to instruction to install anything that we will need to install? (Can be websites, other videos, written instructions) I detail installation in Unpacking NumPy and Pandas, an earlier course. Otherwise, check Chrome/Firefox's site, continuum .io for Anaconda, oracle .com for MySQL
  • • Anything that you would like the customers to watch out for like a precautionary measure or a tip? DO NOT SCRAPE THE WEB TOO FAST!!! Also, practice safe web practices.
  • • How are the Exercise Files laid out? Code Samples/ Starting State of the Examples/ Finished State of the Examples/ Finished State per video/ Finished State per Chapter/ Finished State per course Exercises are often linked within a section. Many need data files that I have provided. A set of exercises become a guided project that allow viewers to do tasks similar to those shown in the videos. They simply give viewers instructions on what they need to do to complete the exercise.
  • • What level user would be best served by this course? For example: "Beginner Django User but Advanced Web Designer"
  • • This course has been tested on the following system configuration: ● OS: Windows 10 ● Processor: AMD 64 ● Memory: 12GB ● Hard Disk Space: 300GB ● Video Card: NVidia 560

Description

Python, a multi-paradigm programming language, has become the language of choice for data scientists for data analysis, visualization, and machine learning.

In this course, you’ll start by learning how to acquire data from the web in its already “clean” format, such as in a .csv file, or a database. You’ll then learn to transform this data so it’s in its most useful format for analysis. After that, you’ll dive into data aggregation and grouping, where you’ll learn to group similar data for easier analysis purposes.  From there, you’ll be shown different methods of web scraping using Python. Finally, you’ll learn to extract large amounts of data using BeautifulSoup, as well as work with Selenium and Scrapy.

About the author

Curtis Miller is Associate Instructor at the University of Utah, and an MSTAT student. He is currently involved in research on data analysis from statistical and computer science perspectives. Curtis has published research on policy and economic issues.

Who this course is for:

  • If you are a budding Data Analyst and you want to learn how to acquire, manage, and manipulate large amounts of data from the web using the Python programming language, then this is the perfect video tutorial for you.