
This video provides an overview of the entire course.
Explain the general idea behind ensemble methods and discuss at a high level the intuition of the main ensemble methods - bagging, random forest and boosting.
Present with a practical example the procedure to build ensemble methods for regression tasks and compare the results of ensemble methods with other simpler methods.
Present with a practical example the procedure to build ensemble methods for classification tasks and compare the results of ensemble methods with other simpler methods.
Explain the main problem with hold out cross validation and explain the approach of K-fold cross-validation to solve this problem. Present how to do K-fold cross validation in scikit-learn
Show how k-fold cross-validation can be used to get a better assessment of the performance of the models and hence to make better model comparison.
Explain the need for hyper-parameter tuning when building predictive analytics models and show how we can combine k-fold cross-validation with grid search to do it.
Explain why we need different methods to distinguish between useful and useless features and explain the ones that will be used in the video: low variance, statistical tests, and RFE.
Explain the idea of dimensionality reduction and the intuition of the Principal Component Analysis method and show how to use this tool in scikit-learn.
Explain what is feature engineering and the different methods and approaches to create new features show with examples how to do it with the dataset used in the course.
Show the practical use of the techniques shown in the section and show how to improve one of the models built in the previous sections using feature engineering.
Give an overview of Artificial Neural Networks and its basic components: the perceptron, show intuitively how to construct networks of perceptrons.
Explain how to go from one single layer of perceptrons to a multi-layer model (deep model) then give a general overview of all the elements that need to be considered to train these types of models.
Explain what is TensorFlow, what is used for and how to install the CPU version of the library
Discuss the main concepts and objects used in TensorFlow, the goal is the viewer to have some familiarity with the way this library works.
Present and introductory example to show how a simple predictive model looks like in TensorFlow.
Show how to build a regression model using a Deep Neural Network and a real-world dataset, show the necessary steps for reading a dataset, train and evaluate the model.
Show how to build a classification model using a deep neural network and a real-world dataset, show the necessary steps for reading a dataset, train and evaluate the model.
Ensemble methods offer a powerful way to improve prediction accuracy by combining in a clever way predictions from many individual predictors. In this course, you will learn how to use ensemble methods to improve accuracy in classification and regression problems.
When using Predictive Analytics to solve actual problems, besides models and algorithms there are many other practical considerations that must be considered like which features should I use, how many features are enough, should I create new features, how to combine features to give the same underlying information, which hyper-parameters should I use? We explore topics that will help you answer such questions.
Artificial Neural Networks are models loosely based on how neural networks work in a living being. These models have a long history in the Artificial Intelligence community with ups and downs in popularity. Nowadays, because of the increase in computational power, improved methods, and software enhancements, they are popular again and are the basis for advanced approaches such as Deep Learning. This course introduces the use of Deep Learning models for Predictive Analytics using the powerful TensorFlow library.
About the Author :
Alvaro Fuentes is a Data Scientist with an M.S. in Quantitative Economics and a M.S. in Applied Mathematics with more than 10 years of experience in analytical roles. He worked in the Central Bank of Guatemala as an Economic Analyst, building models for economic and financial data. He founded Quant Company to provide consulting and training services in Data Science topics and has been a consultant for many projects in fields such as; Business, Education, Psychology and Mass Media. He also has taught many (online and in-site) courses to students from around the world in topics like Data Science, Mathematics, Statistics, R programming and Python.
Alvaro Fuentes is a big Python fan and has been working with Python for about 4 years and uses it routinely for analyzing data and producing predictions. He also has used it in a couple of software projects. He is also a big R fan, and doesn't like the controversy between what is the “best” R or Python, he uses them both. He is also very interested in the Spark approach to Big Data, and likes the way it simplifies complicated things. He is not a software engineer or a developer but is generally interested in web technologies.
He also has technical skills in R programming, Spark, SQL (PostgreSQL), MS Excel, machine learning, statistical analysis, econometrics, mathematical modeling.
Predictive Analytics is a topic in which he has both professional and teaching experience. Having solved practical problems in his consulting practice using the Python tools for predictive analytics and the topics of predictive analytics are part of a more general course on Data Science with Python that he teaches online.