
Explore building a fake news detection system using feature engineering, machine learning, logistic regression, and random forest. Examine accuracy, fairness audits, datasets, and NLP insights to mitigate bias.
Build a fake news detection system using machine learning and feature engineering with logistic regression and Python libraries, spanning ten chapters on tools, datasets from Kaggle, and bias mitigation.
Identify audiences—journalists and media professionals, data analysts and scientists, and policymakers and government officials—and learn to build a fake news detection system with machine learning and feature engineering in Python.
Build a fake news detection system with Python and libraries like pandas, matplotlib, numpy, and scikit-learn. Use Kaggle datasets and IDEs such as Google Colab, Jupyter Notebook, or VS Code.
Learn the basics of a fake news detection system using logistic regression, feature engineering, and random forest, while addressing data quality, imbalance datasets, evolving tactics, and ethical considerations.
Apply feature engineering to detect fake news by analyzing titles and sources, using keyword frequency and publisher credibility to compute a weighted prediction and classify real versus fake.
Explore how clickbait incentives, confirmation bias, political manipulation, social media echo chambers, sensationalism, and profit motives fuel the widespread of fake news and misinformation.
Set up Google Colab as a browser-based IDE, run Python code to compare real and fake news, and visualize results with a matplotlib pie chart.
Find and download a fake news classifications dataset from Kaggle, review columns like author, publish date, title, text, language, and real or fake label, then save the News articles CSV.
Create a new notebook in Google Colab, then upload the fake news dataset and read it with data = pd.read_csv('filename.csv'), using the data variable for later steps.
Explore the fake news dataset by inspecting the first and last rows, shape of 2096 rows and 12 columns, and data types, then prepare for cleaning missing values and duplicates.
Learn to clean a fake news dataset by identifying and removing missing values and duplicates, then save the cleaned data for machine learning workflows.
This lecture performs news source credibility analysis by calculating real versus fake news percentages per source and ranking the top ten most and least credible sources from a cleaned dataset.
Welcome to Detecting Fake News with Machine Learning course. This is a comprehensive project based course where you will learn step by step on how to build a fake news detection system using feature engineering, logistic regression, and other models. This course is a perfect combination between Python and machine learning, making it an ideal opportunity to enhance your data science skills. The course will be mainly focusing on three major aspects, the first one is data analysis where you will explore the fake news dataset from multiple angles, the second one is predictive modeling where you will learn how to build fake news detection system using big data, and the third one is to mitigate potential biases from the fake news detection models. In the introduction session, you will learn the basic fundamentals of fake news detection models, such as getting to know ethical considerations and common challenges. Then, in the next session, we are going to have a case study where you will learn how to implement feature engineering on a simple dataset to predict if a news is real or fake. In the case study you will specifically learn how to identify the presence of specific words which are frequently used in fake news and calculate the probability of a news article is fake based on the track record of the news publisher. Afterward, you will also learn about several factors that contribute to the widespread of fake news & misinformation, for examples like confirmation bias, social media echo chamber, and clickbait incentives. Once you have learnt all necessary knowledge about the fake news detection model, we will begin the project. Firstly you will be guided step by step on how to set up Google Colab IDE. In addition to that, you will also learn how to find and download fake news dataset from Kaggle, Once, everything is ready, we will enter the main section of the course which is the project section The project will be consisted of three main parts, the first part is the data analysis and visualization where you will explore the dataset from various angles, in the second part, you will learn step by step on how to build a fake news detection system using logistic regression and feature engineering, meanwhile, in the third part, you will learn how to evaluate the model’s accuracy. Lastly, at the end of the course, you will learn how to mitigate potential bias in fake news detection systems by diversifying training data and conducting fairness audits.
First of all, before getting into the course, we need to ask ourselves this question: why should we build fake news detection systems? Well, here is my answer. In the past couple of years, we have witnessed a significant increase in the number of people using social media and, consequently, an exponential growth in the volume of news and information shared online. While this presents incredible opportunities for communication, however, this surge in information sharing has come at a cost, the rapid spread of unverified, misleading, or completely fabricated news stories. These stories can sway public opinion, incite fear, and even have political and social consequences. In a world where information is power, the ability to distinguish between accurate reporting and deceptive content is very valuable. Last but not least, knowing how to build a complex machine learning model can potentially open a lot of opportunities.
Below are things that you can expect to learn from this course:
Learn the basic fundamentals of fake news detection model
Case study: applying feature engineering to predict if a news title is real or fake
Learn factors that contribute to the widespread of fake news & misinformation
Learn how to find and download datasets from Kaggle
Learn how to clean dataset by removing missing rows and duplicate values
Learn how to perform news source credibility
Learn how to detect keywords associated with fake news
Learn how to perform news title and length analysis
Learn how to detect sensationalism in fake news
Learn how to detect emotion in fake new with NLP
Learn how to build fake news detection model with feature engineering
Learn how to build fake news detection model with logistic regression
Learn how to build fake news detection model with Random Forest
Learn how to evaluate fake news detection model with confusion matrix
Learn how to perform fairness audit with demographic parity difference
Learn how to mitigate potential bias in fake news detection