How to Become a Data Scientist in 6 Weeks

A free video tutorial from Kirill Eremenko
DS & AI Instructor
Rating: 4.5 out of 5Instructor rating
50 courses
2,779,273 students
The 6-Week Data Scientist

Learn more from the full course

Intro to Data Science: QuickStart Guide + AI & ChatGPT Prize

Learn the critical elements of Data Science, from visualization to databases to Python and more, in just 6 weeks!

05:11:46 of on-demand video • Updated April 2024

The entire Data Science process
Cloud concepts & application in Data Science
Database concepts
Statistics fundamentals as needed in Data Science
Visualizations for data mining and presentation
An overview on Statistical Learning
The essentials of Machine Learning
More advanced Python to apply to Data Science
English [Auto]
Hello and welcome back to the intro to Data Science Course. And today we're going to talk about a brand new concept that we're introducing to the world called the six Week Data Scientist. We have seen lots of success stories. We've been through the journey ourselves in some cases several times. And we know what it takes to become a data scientist. And our statement here is that you can become a data scientist in six weeks. You can become a starting data science, not promising that you can become an A super successful best data science in the world. No, but you can become a data scientist that can add value to businesses, that can apply machine learning, visualization, understand statistical concepts and use Python and other programming languages and apply machine learning concepts within six weeks. And to prove that we've got a specific timeline set out for your specific path with a timeline attached to it. So we're going to go through it now and we'll show you exactly what you need to study and in what order and how much time you need to spend on each topic in order to get there. You ready? Let's dive straight into it. All right. So you start here on day zero. And the first thing that we suggest looking at is statistics and all of these topics that we're going to cover off here. There's going to be ten steps to get to the finish. We're going to talk about them in more detail in this section of the course, and you will get to know them a bit better. So don't worry if you feel a bit like you don't really understand why something is important or why something is necessary, what it constitutes, that's okay. Because first of all, we'll talk about it in this section of the course. And then moreover, we will cover off this whole map again in the conclusion of the course. Just to reiterate everything, once you have a different perspective, once you know what these topics are about. So statistics, we suggest spending six days on statistics. And the reason so we're going to mention in this tutorial, we're going to mention the why, the reason why we think these things are important and the reason the why for statistics is to avoid costly mistakes. I'm a firm believer that you don't really need to know all the statistics in the world to be a successful data scientist, but you do need a core foundation in order to avoid mistakes and in order to avoid misrepresenting results and misinterpreting insights. And in this course, we'll tell you exactly what statistics to focus on. Its should be sufficient for you to be able to grasp those concepts in six days. Next step is to learn the data science process. And don't worry, we've got you covered here. We will actually walk through the data science process in one of the tutorials in this course. So that part we've got covered for you and you can always reference back to it if you need a refresher. It's got five steps and the data science process is the process that pretty much any data science project goes through. And it's important to kind of distinguish what we're talking about here is your roadmap to becoming a data scientist, whereas the data science process is the process. A data science project goes through. It contains five steps and it's good to be aware of it so that you know the bigger picture. But once again, we've got you covered on this. And so let's proceed further. The next step is visualization, learning about visualization and picking a tool and learning how to apply that tool to create successful and insightful visualizations. It should be sufficient. Ten days should be sufficient to get you going on the way. There are some very powerful and yet simple to use tools that allow drag and drop, drag and drop interface. Some examples are Tableau, Click, View and Power BI. So those are just three tools I can name off the top of my head that it takes like 2 or 3 days to get your head around and be able to use, and in ten days you can create. By then you can create like five very powerful visualizations and you will know 60% of that tool. You will know how to use it in how to leverage its power for your work. So we're allocating ten days to it so that you can get to get to know it and actually practice. In that time, we actually expect that you fulfill the next checkpoint. So these orange or these brown boxes identify checkpoints and we expect that or we suggest in those ten days you attempt your first data science project. So what we're saying here is that those ten days are enough to learn visualization and practice some follow along kind of projects with instructors, whether you're taking these courses online in person, reading books or watching videos. But what we're saying with the checkpoint is like, try to attempt your first data science project using visualization techniques that you learn in those ten days and you will see. That it is actually not as difficult as you think. You might not be able to complete the project end to end. Not yet, but you'll already be able to get some insights and get some valuable information from the data. Next from there, the next step is databases. So the reason the why behind databases is so that you expand your reach of data so that you can have access to more data. So up until this point, you're probably looking at data well in this process. So in statistics and realization, you will be probably looking at data in CSV files or Excel files. Whereas if you learn the basics of SQL databases, you'll be able to get more data, access to more data and therefore perform more analytics. And again, we allocating five days to this simply because to get the basics of working with databases five days is well more than enough to understand simple concepts like a select statement in SQL that takes just a couple of days, and in five days you should be able to know the basics of how to extract some data from a database. The next step in our path is statistical learning. And so this is the tutorial in this section of the course where you will get to meet, learn. And if you haven't met him yet, then for the first time and learned upon this is the second instructor on the course and we've created lots and lots of courses together. And so he'll be telling you about statistical learning. The reason why we're allocating only two days to statistical learning is not because it's a simple concept, but because we want to make sure that you don't get bogged down in statistical learning or you don't get too carried away with it. Those are two extremes that are not that are going to create hurdles for you progressing to becoming a data scientist. In order to be a data scientist, you don't really need to know all the ins and outs of every single algorithm that you might need. If you want to do research in the space of data science. But some some basic statistical learning principles are going to be sufficient in order to just give you an example of what it's all about. So the the why here is in order to get a feel for what statistical learning is and to see that if you want to delve deeper into a certain machine learning algorithm, you can because of statistical learning. And then finally, next step and probably the largest step in our journey is machine learning. We're allocating 12 days to this because understanding how to apply machine learning algorithms and when to apply which one is very important. It is one of the most used concepts. In fact, it is the most used concept in the whole data science. You will see that a lot of business problems, a lot of industry problems, a lot of health problems, a lot of all sorts of types of problems are actually solved using machine learning. And that's why it's such an important concept and such an important space. And the idea here is that you take those 12 days to learn several algorithms between 3 and 5 different core machine learning algorithms that you would look into something from regression, something from classification, something from clustering. Those are just branches of machine learning. Each one of them has at least 3 to 7 different algorithms. So but if you learn at least one algorithm from every single one of those branches, one from regression, one from clustering, one from classification, and maybe 1 or 2 other ones, you just get to practice them. If you spend two days per algorithm, that should be totally enough to give you an understanding of, first of all, that you can do it. And second, the types of problems that they solve so that in the future, when you come across a problem, you know, Oh, I've seen this somewhere or it feels like I need to use this algorithm and I'm going to go do that. So that's machine learning. After that, we've got another checkpoint attempt and Advanced data Science project. In fact, that checkpoint is actually included in those 12 days. So as you spend the 12 days learning 3 to 5 different algorithms, give it a shot to take on an advanced data science challenge, and you'll be surprised at how far you can progress already. Then we will then we allocate five days to learning more about Python. So in this case we're suggesting to apply machine learning using Python and therefore learning more about Python is going to help you speed up your process. So the how behind learning or the why behind learning Python is to increase the agility with which you can solve these machine learning challenges. Five days should be enough to learn the basic principles of Python and get enough hands on practice so that you're more comfortable using Python. And finally, at the end of our journey, we're allocating two days to deep learning again. It's not because it's a simple concept. It's rather to get a feel for deep learning, to understand what is possible in the world of deep learning. What other. The opportunities it opens up. So you always know that there's this alternative to your visual analytics and to your machine learning analytics. There's something even more advanced, which is called deep learning. And then one day when you do decide to dive into it, you know that it's there and you know where approximately in which direction you need to go. So it's a good, good to have that overview. And so after that, ta da, 42 days you made it. If you follow this process and you diligently put in the hours every single one of these days that we specified here at the end, by day 42, you will be ready to go out there and to tell the world that a new data scientist has arrived and you can go on Kaggle and start looking at different competitions where you can participate, you can go on Upwork and create your data science profile and say that you're starting out data scientist and start bidding for jobs and getting hands on experience that way. And maybe if you're starting out your on your own, then you can approach your first client. It might be scary, but based on all the work you've done and all the hours that you've put in by this stage, you are ready to start into the real world. So that's that's our journey. And in fact, in this course, we've got two extra features for you. We'll give you a feel for what it's like to be a data scientist. We'll give you a feel for what it's like to apply visualization. We'll be talking about visualization for data mining. So we have a whole section on where you can get to play around with Tableau and solve a real world challenge step by step together with us. And also same thing for machine learning. We have a whole section where you'll actually solve two challenges together with us in machine learning using Python. So very excited about this course. We hope you're excited about this journey. As excited as we are, we've just come up with this concept and we've just introducing it to the world. So you are going to be our first subjects that are going to go through this, and we're very confident that if you follow this course, if you take this course to the end and you actually get get a feel for what data science is all about and you decide that it is what you want, it is what you're after, then then if you follow this map and you put in the hours and all of these days and you use credible sources for your knowledge, for your practice, then we're very confident that by day 42, you'll be out there and ready to take on data science challenges. All right. So I hope you're excited. Let's dive straight into the next tutorial and I'll see you there. Until then, enjoy data science.