
By the end of this task, you will be able to launch Google Colaboratory from your browser, get started by creating a brand new Python notebook to house your project codes in an appropriate runtime environment.
By the end of this task, you will be able to explain the approach to take to detect fake and real news using machine learning, get the data to use for training your fake and real news detection machine learning model and learn how to import and use the data in your Google Colaboratory project.
By the end of this task, you will be able to add a new cell, add comments, delete a cell, copy and cut cell content, save your work, execute your codes and restart the session if things go wrong without losing your written codes.
By the end of this task, you will be able to explore and visualise the contents of the two csv files in form of pandas dataframes, display the data in various charts, look for and fix missing values, duplicate values, class imbalance and all other relevant data cleaning tasks.
By the end of this task, you will be able to merge the two datasets, label the merged data by adding a label column and convert the data to numeric form.
By the end of this task, you will be able to create a machine learning model and train it on the fake and real news.
By the end of this task, you will be able to test and evaluate the performance of your model using the test data we held out just before training.
By the end of this task, you will learn how to save your trained model and download it for use in your application of choice.
By the end of this task, you will learn how to use your model in real life application.
This project is for anyone who wants to develop Data science and Machine learning projects but having limited resources on his computer and limited time. In less than 2 hours, you will learn how to develop and deploy a fake news detection data science project!
In essence, you will learn,
- how to design a real life data science project
- how to get data to train a machine learning model
- how to clean and preprocess your data
- how to create and train a model to learn from your data
- how to evaluate the performance of the trained model
- and finally, how to deploy the model in any real-life application of your choice.
According to wikipedia,
"Google Colaboratory (also known as Colab) is a free Jupyter notebook environment that runs in the cloud and stores its notebooks on Google Drive. Colab was originally an internal Google project; an attempt was made to open source all the code and work more directly upstream, leading to the development of the "Open in Colab" Google Chrome extension, but this eventually ended, and Colab development continued internally. As of October 2019, the Colaboratory UI only allows for the creation of notebooks with Python 2 and Python 3 kernels; however, an existing notebook whose kernelspec is IR or Swift will also work, since both R and Swift are installed in the container. Julia language can also work on Colab (with e.g. Python and GPUs; Google's tensor processing units also work with Julia on Colab."