
Explore open data science with Python, pandas, and Anaconda for accessing, analyzing, and visualizing tabular data, and building predictive models through cross-functional collaboration.
Learn the fundamentals of Jupyter basics, including executing code cells with shift enter, inserting and moving cells, and handling kernel interruptions with stop, restart, and clear output.
Learn to load and analyze data with Python pandas, using auto LPG dataset (1970–1982) to read, sort by fuel efficiency, extract makes from names, clean duplicates, and group by makes.
Visualize fuel efficiency trends from the 1970s using the bouquet interactive visualization library, with interactive scatterplots, hover details, and widgets to zoom and color by Ford, Volkswagen, and Honda.
Learn to load and split data, train linear regression and random forest models with scikit-learn, evaluate predictions against test data, and create interactive visualizations of deltas using pandas.
Discover how data science blends statistics, processing, machine learning, and visualization to extract insights, using open source tools and Python with Anaconda for collaborative production analytics.
Enable cross-functional collaboration in open data science by uniting statisticians, researchers, computational scientists, developers, and DevOps to move analytics from exploration to production with Anaconda.
Explore, model, predict, share, and deploy in the modern data science development workflow, from data exploration to production deployment and collaboration across teams.
Import pandas as pd, inspect data with describe and info, and review memory usage and data types. Visualize year vs fuel efficiency with Map Plot Lib in Jupyter.
Enrich data in pandas by in-place replacing origin codes with american, european, and asian, and derive make and model from vehicle names using split and list comprehension.
Explore boolean masks and indexing in pandas to filter, update, and visualize data frames by origin, cylinders, and names, using interactive, open data science techniques.
Learn to read a csv with pandas, load a 40,000-row, 83-column fuel economy data frame, and select columns via a colon and a column list.
Learn to subset and reorder a dataframe to focus on key fields, convert select columns to category types for memory efficiency, and apply masks and contains for filtering.
Learn how to group data with pandas by the fields make and model, explore the resulting groups, and visualize fuel efficiency trends across years with custom plotting.
Connect to a SQLite database using pandas read_sql, exploring the Chinook database to pull tracks and related tables via joins, and consolidate data into a normalized dataframe.
connect to a database and normalize data in pandas by renaming columns, then use SQL-like queries and Python counters to analyze track metadata and composer counts.
Explore time series data using sqlite and pandas, parse invoice dates, index by invoice id, group by year and month, and plot the month-end total.
Learn to read and write Excel files with pandas, using read_excel and to_excel, specify sheets, and export multi-sheet workbooks with ExcelWriter and xlsxwriter for data from SQL databases.
Save your notebook and publish it to Anaconda Cloud's analytics artifact repository by signing into Anaconda Cloud or Anaconda Enterprise account and using the upload button in the Jupiter window.
Explore the open data science ecosystem with the Anaconda platform, highlighting cross-platform distribution, hundreds of precompiled packages, and conda-based reproducible data workflows for Python and R.
Discover conda and the anaconda distribution, offering language- and platform-independent packages, dependency management, and reproducible environments through channels like conda forge, plus the capsule capability.
Explore how the Anaconda platform layers—from distribution to enterprise notebooks and Mosaic—enable collaborative, reproducible analytics across desktops, data labs, and production clusters.
Explore how the Anaconda desk library enables parallel data processing with task graphs on single or distributed machines, supporting terabyte-scale datasets and interactive visualizations with the book and data shader ecosystems.
Navigate data science workflows with the Anaconda platform, moving from single-machine analysis to collaboration, publish notebooks and environments to the artifact repository, and deploy reproducible models at scale.
Create a new project and invite teammates to collaborate in a centralized, secure environment. Work with per-project Conda environments and notebooks, and publish to the Anaconda Enterprise repository.
Learn to build high quality, interactive visualizations with the book data visualization library that uses grammar for graphics, requires no JavaScript, and renders via JSON in modern browsers.
Learn to plot pandas dataframes with bokeh charts, creating interactive histograms, scatterplots, bar charts, and heat maps using iris flowers and auto mpg datasets with petal length, petal width, displacement.
Learn to build high quality interactive visualizations with the book plotting library, creating figures and glyphs, and using a column data source to render per-point attributes as JSON.
Learn to manage plot construction in Bokeh using a column data source. Map species to colors, apply a hover tool, and render interactive visualizations from the iris data set.
Learn to add widgets to modify interactive Bokeh plots, link multiple figures with shared axes, arrange them in a grid, and enable linked brushing and tooltips.
Learn to build multi-dimensional interactive visualizations with widgets in Jupyter, linking plots via interactivity tools, updating data sources with NumPy, Python widgets, and custom JavaScript callbacks.
learn to generate standalone interactive web plots from bokeh by saving to static html files for web server assets, enabling client-side interactivity with hover tools and pan/zoom.
Learn to build data ops with a Python script server, render interactive plots including histograms from static and streaming data, and deploy via a web app and iframe.
Explore hollow views for composing interactive visualizations on top of bokeh. Optimize package management in an Anaconda environment by comparing conda and pip installations.
Learn to add channels in Navigator, including the r channel, to access conda’s large package ecosystem and install the essentials meta package with its dependencies.
Learn how to upgrade, downgrade, and remove packages in navigator using three example packages—book, dusk, and xarray—by selecting versions, applying changes, and managing downloads, package cache, and linking.
Create and switch conda environments in Anaconda Navigator. Start with a minimal python 3.5 core, then add the Anaconda meta package and leverage package cache with linking to save space.
Explore how to manage conda environments and Jupyter kernels, verify Python versions, switch kernels, and understand how Anaconda isolates packages to keep system Python safe.
Master conda from the command line across Windows, macOS, and Linux, enabling environment activation and package installation. Explore info, list, and channels to manage environments efficiently.
Master conda environment management, remove the base and dependent packages, create a Python 3.5 notebook with pandas and seaborn, and switch kernels in Jupyter.
Explain the differences between pip and conda, and explore continuum curated, community curated channels, and how to use pip within conda environments.
Install Peewee via pip in a conda environment and verify the correct kernel in the Jupyter notebook. Define Peewee models for person and pet, create sqlite tables, and run queries.
Conda and pip updates are essential for your data science workflow; learn how to update conda in the root environment and update pip per Python interpreter.
Learn how conda environments provide reproducible, encapsulated open data science work, and how to share them via YAML exports and Anaconda Navigator.
Learn to search for and install conda packages on Anaconda Cloud, focusing on Blaze and Data Shape across channels, using Navigator or conda, and manage channels for reproducible data science.
Learn to manage conda environments, install essentials and dev tools, and set up python and r data processing tooling for AR-enabled data visualization and notebook workflows.
Explore accessing and processing data with dplyr and tidyr, using the nyc flights dataset to arrange by carrier, origin, destination, and select key columns.
Chain select, arrange, group by, and summarize to compute mean delays by carrier; filter origin and destination; reshape data with gather and spread for tidy tables.
Learn to build visualizations with ggplot grammar by mapping carat to x and price to y, layering with geom_point, and exploring color, alpha, and log scales for the diamonds dataset.
Learn to refine ggplot visualizations by adjusting alpha and size, using facet wrap to split data, and interpreting trends with linear estimators and confidence intervals across diamonds and iris examples.
Build linear predictive models with the diamonds dataset for predictive analytics, split data into training and testing sets, fit price to carat and cut, and compare observed versus predicted plots.
Learn to build and compare linear predictive models using the LM framework, train with data, and visualize the fit to analyze model predictions in predictive analytics.
Learn to create interactive visualizations in R using bouquet plots and Shiny apps, leveraging the book and shiny packages to explore linear models, histograms with density overlays, and iris clustering.
Bridge Python and R using the RPI two module in Jupyter, using magics like percent R to pass data and build regression with a pandas data frame.
Install or update the necessary packages, then build a supervised learning model using linear regression to predict fuel efficiency, splitting data into training and testing sets.
Generate predictions from a trained predictive model, evaluate accuracy on a test set, and compare linear regression with classifiers like random forest, achieving 87% car origin accuracy.
score and evaluate linear regression models using 10-fold cross-validation, comparing training and test accuracy and mean squared error to gauge model quality and the benefits of parallel evaluation.
Visualize model performance with a scatterplot of actual fuel efficiency versus predicted values and a support vector machine classifier decision threshold to illustrate classifier behavior.
Anaconda Fusion connects Excel spreadsheets and the open data science world through Jupyter Notebooks, enabling analysts and data scientists to collaborate by accessing routines and algorithms from Excel.
Learn how to install and start Anaconda Fusion on Mac or Windows (Excel 2016+), configure the Fusion channel and environment, then connect Excel to the Fusion server for notebook workflows.
Connect a spreadsheet to a Fusion code sheet and run a transaction inspector to visualize sales data. Explore categories and entries to see how code sheets process datasets with Fusion.
Access distributed data across formats—from text files to HDFS—using Mosaic, build synthetic views with data connectors and APIs, and explore and visualize joined data in the data browser.
Learn how to install mosaic with Anaconda Enterprise, obtain a license, and launch it from the command line or the Anaconda Navigator app panel via the cloud channel.
Install mosaic via the Anaconda Mosaic channel, create a conda environment, start a web server, register data sources, and build data views that export to pandas data frames for Jupyter.
Unlock Your Future in Data Science & Machine Learning: From Novice to Pro
Transform Data into Decisions. Propel Your Career Forward.
Are you ready to master the critical skills that define today's data-driven world? Our expertly crafted Data Science and Machine Learning course is your launchpad into the tech industry's most dynamic field. Whether you're aiming for a career change or seeking to supercharge your existing expertise, this comprehensive program provides the advanced knowledge and practical techniques to make you an indispensable asset in any competitive job market.
Go Beyond the Buzzwords: Master Ethical, Interpretable, and Applied Data Science
In a world saturated with data, the ability to extract meaningful, actionable insights is paramount. This course doesn't just teach you algorithms; it instills a deep understanding of bias avoidance, model interpretability, and real-world problem-solving. You'll emerge with a robust toolkit, capable of transforming raw information into strategic business intelligence that drives impactful results.
Harness the Power of Anaconda: The Industry-Standard Data Science Ecosystem
Step into the professional realm with Anaconda, the all-in-one platform trusted by data scientists globally. With its rich ecosystem supporting over 100 powerful packages and languages like Python, R, and Scala, Anaconda will become your command center for streamlined workflows and unparalleled efficiency in all your data science operations.
Designed for Aspiring Data Professionals (Prerequisites: Basic Python & Data Science Fundamentals)
This course is meticulously structured for individuals with a foundational understanding of Python and core data science principles. If you're a data analyst, business analyst, or an aspiring data scientist eager to elevate your skills, this is your opportunity to build upon that knowledge and achieve mastery.
From Theory to Tangible Skills: Solve Real-World Challenges
Our curriculum is engineered for impact. You won't just learn theory; you'll get your hands dirty with the essential tools and techniques wielded by industry leaders. Dive deep into Python, Pandas, Scikit-learn, Keras, Prophet, Statsmodels, SciPy, and more. Develop a commanding grasp of statistics, probability, and data visualization, empowering you to not only uncover insights but also to communicate them with clarity and conviction.
Course Highlights: Your Pathway to Expertise
Comprehensive Data Science Immersion: Build an unshakeable understanding of the data science landscape and its transformative applications across diverse industries.
Master Core Data Science Concepts: Gain profound insights into the foundational pillars of data science, including sophisticated data cleaning, strategic feature engineering, and rigorous model evaluation.
Data Manipulation Mastery with Python Pandas: Achieve fluency in manipulating and analyzing complex datasets with Pandas, the cornerstone library for high-performance data analysis.
Dynamic Storytelling with Bokeh Visualizations: Craft compelling, interactive visualizations with Bokeh, enabling you to present data-driven narratives that captivate and inform.
Streamlined Package & Environment Management: Learn best practices for orchestrating the diverse libraries within Anaconda, ensuring seamless, reproducible, and efficient project workflows.
Unlock Machine Learning with Scikit-Learn: Journey into the heart of machine learning. Implement and optimize supervised and unsupervised learning algorithms for sophisticated classification, clustering, and regression tasks.
Leverage a Powerful Data Science Stack: Maximize your analytical capabilities by harnessing Anaconda’s comprehensive suite of tools, designed for cutting-edge data science.
Intensive Hands-On Data Analysis: From meticulous data cleaning to architecting complex dataset models, gain invaluable practical experience in sorting, classification, clustering, and regression.
Real-World Application Focus: Engage with authentic, real-world datasets throughout the course, ensuring the skills you forge are immediately transferable to your professional endeavors.
What You Will Achieve:
Foundational & Advanced Data Science Understanding: Articulate the scope, applications, and impact of data science in today's business environment.
Command of Core Data Science Principles: Internalize the fundamental concepts that empower every successful data scientist.
Expert-Level Data Wrangling with Pandas: Become highly proficient in data manipulation, transforming raw data into analysis-ready formats with ease.
Create Impactful Visualizations with Bokeh: Master the art of designing insightful, interactive dashboards and plots that clearly convey complex findings.
Efficient Project Management with Anaconda: Gain expertise in leveraging Anaconda’s ecosystem to manage dependencies and streamline your data science projects from inception to deployment.
Build & Optimize Machine Learning Models: Confidently develop, evaluate, and refine machine learning models using Scikit-learn to solve predictive challenges.
Maximize Your Analytical Power: Unlock the full potential of Anaconda and its integrated stack of tools for robust, insightful data analysis.
Practical, Job-Ready Skills: Through hands-on exercises, master data cleaning, feature engineering, classification, clustering, and regression modeling, preparing you for real-world scenarios.
Your Transformation Awaits.
By the conclusion of this immersive course, you will possess the knowledge, skills, and confidence to dissect complex data science problems and engineer machine learning models that deliver powerful, actionable insights. Whether your goal is to make a significant impact in your current role or to forge a new career path in the exhilarating field of data science, this course provides the definitive expertise you need to succeed.
Don't Just Learn Data Science – Become a Data Scientist.
This isn't merely about learning tools; it's about cultivating the critical thinking and practical abilities to solve tangible business challenges using the power of data. Seize this opportunity to unlock your true potential and advance your career in one of the 21st century's most vital and rewarding fields.
Enroll Now and Ignite Your Data Science Journey Today!