
Get started by setting up the development environment, installing open source tools, and combining Python and R for data processing, statistics, and data visualizations.
Learn the data mining process from business understanding to deployment using visualizations. Explore data preparation, modeling, and evaluation with techniques like regression and classification, and report results.
Download the iris CSB dataset and identify the high risk dataset on the drive to support data processing and visualizations in R.
Read a dataset in R, use the iris data to compute descriptive statistics, and generate a summary, then run analyses and inspect the data structure.
Learn to create bar charts in R to visualize data frequencies, customize colors, and label axes to interpret data during data processing.
Export a bar plot image to turn data into a clear bar chart. Use practical steps to generate and save bar charts for data visualization in R with data processing.
Explore creating horizontal bar charts in data visualizations using R with data processing, transforming datasets into clear visuals for insightful comparisons.
Explore how to create a camera histogram in R and customize color (green), borders, and histogram layers to reflect data processing.
Learn to build histograms in R with a density line to visualize data distribution, adjust frequency and probability, and apply color and layout options.
Learn to create line charts in R for data processing, connecting points with lines, customizing x and y aesthetics, and exporting visuals.
Explore creating a multiple line chart in R by configuring y values, line types, and colors to visualize and compare data across categories.
Create a pie chart in R using a labor data vector and labeled slices. Map labels 1–7 to the pie pieces and render the chart from the data.
Learn to build a 3D pie chart in R by installing and using a dedicated library, then create and customize the pie chart with the pie3D function.
Master data setup and processing workflows for scatterplot visuals by organizing data, copying datasets, and aligning x and y coordinates through practical programming steps.
Scatterplot matrix in R to study correlations among variables and set up metrics for data analysis.
Explore ggplot2 in R to craft advanced charts by building data visualizations with data, geometric objects, statistical transformations, scales, coordinate systems, and position adjustments for multivariate and categorical data.
Explore aesthetic mapping and geometric concepts in data visualizations using R, focusing on data processing techniques.
Learn to build a geometric point plot in R by mapping color to an asset variable and preparing data assets, such as GDP data, for visualization.
Explore data visualizations using R with data processing by applying and customizing themes, adjusting element properties, sizes, axes, and global styles to enhance charts.
Learn to create a bar chart in R using ggplot2, map data to aesthetics with aes, apply grey color, and add labels and a title.
Compare a dataset by building a histogram in R, selecting a variable, and adjusting gray color and axis labels to convey distribution clearly.
Explore density plots in r to visualize data density and normal distribution, using high and low density regions to interpret where data concentrates.
Explore how to create and interpret boxplots in R to visualize data distributions and identify outliers.
Learn how to save ggplot visualizations in R by specifying file paths and the initial file location, including drive choices like the D drive.
Learn how to sort data in R using order for ascending and descending sequences, including sorting by multiple keys and applying decreasing = true to arrange data frames.
Why learn Data Analysis and Data Science?
According to SAS, the five reasons are
1. Gain problem solving skills
The ability to think analytically and approach problems in the right way is a skill that is very useful in the professional world and everyday life.
2. High demand
Data Analysts and Data Scientists are valuable. With a looming skill shortage as more and more businesses and sectors work on data, the value is going to increase.
3. Analytics is everywhere
Data is everywhere. All company has data and need to get insights from the data. Many organizations want to capitalize on data to improve their processes. It's a hugely exciting time to start a career in analytics.
4. It's only becoming more important
With the abundance of data available for all of us today, the opportunity to find and get insights from data for companies to make decisions has never been greater. The value of data analysts will go up, creating even better job opportunities.
5. A range of related skills
The great thing about being an analyst is that the field encompasses many fields such as computer science, business, and maths. Data analysts and Data Scientists also need to know how to communicate complex information to those without expertise.
The Internet of Things is Data Science + Engineering. By learning data science, you can also go into the Internet of Things and Smart Cities.
This is the bite-size course to learn R Programming for Data visualizations. In CRISP-DM data mining process, Data Visualization is at the Data Understanding stage. This course also covers Data processing, which is at the Data Preparation Stage.
You will need to know some R programming, and you can learn R programming from my "Create Your Calculator: Learn R Programming Basics Fast" course. You will learn R Programming for applied statistics and you will be able
You can take the course as follows, and you can take an exam at EMHAcademy to get SVBook Certified Data Miner using the R certificate :
- Create Your Calculator: Learn R Programming Basics Fast (R Basics)
- Applied Statistics using R with Data Processing (Data Understanding and Data Preparation)
- Advanced Data Visualizations using R with Data Processing (Data Understanding and Data Preparation, in the future)
- Machine Learning with R (Modeling and Evaluation)
Content
Getting Started
Getting Started
Getting Started
Hello World Application
Data Mining Process
Download Dataset
Read Dataset
Bar Plot
Export Bar Chart as Image
Horizontal Bar Chart
Histogram
Histogram with Density Line
Line Chart
Multiple Line Chart
Pie Chart
3D Pie Chart
Scatterplot
Boxplot
Scatterplot Matrix
GGPlot 2
Aesthetic Mapping and Geometric
GEometrics
Labels and Titles
Themes
GGPlot2: Bar CHart
GGPlot2: HIstogram
GGPlot2: Density Plot
GGPlot2: Scatterplot
GGPLot2: Line Chart
GGPLot2: BoxpLot
Save GGPLot Image
Data Processing: Select Variables
Data Processing: Sort Data
Data Processing: Filter Data
Data Processing: Remove Duplicates and Missing Values
References:
This course is actually based on the Learn R for Applied Statistics book I have published at Apress.