IRJET- Fake News Detection using Logistic Regression & Multinomial Naive Bayes

International Research Journal of Engineering and Technology (IRJET)

e-ISSN: 2395-0056

Volume: 08 Issue: 04 | Apr 2021

p-ISSN: 2395-0072

www.irjet.net

FAKE NEWS DETECTION USING LOGISTIC REGRESSION & MULTINOMIAL NAIVE BAYES Abhishek Singh1, Aditya Ugale2, Niraj Shah3, Prof. Amruta Sankhe4 1,2,3Student,

Information Technology Department, Atharva college of engineering, Maharashtra, India 4Asst. Professor, Atharva College of Engineering, Maharashtra, India ---------------------------------------------------------------------***---------------------------------------------------------------------Abstract – In today’s worlds where people are more reliable on the news which are available online as it's convenient for them. As the use of the internet is increasing so thus the spread of fake news also. As the spread of such fake news can be intentional or unintentional but this affects society. Thus, an increasing number of fake news has to be controlled by using the computational tool which predicts such misleading information as if it is fake or real. In this article, we have focused on developing such computational tool to help classify news using two different algorithms. Which helps the model to be more trustworthy We will describe the pre-processing, feature extraction, classification and prediction process in detail. We’ve used Logistic Regression and Multinomial language processing techniques to classify fake news. The preprocessing functions perform some operations like tokenizing, lemmatization and exploratory data analysis like response variable distribution and data quality check (i.e., null or missing values). Simple Count Vectorization, TF-IDF is used as feature extraction techniques. The logistic regression and Multinomial model are used as a classifier for fake news detection with a probability of truth.

with these resources is that human expertise is required to identify articles/websites as fake. As human beings, when we read a sentence or a paragraph, we can interpret the words with the whole document and understand the context. In this project, we teach a system how to read and understand the differences between real news and the fake news using concepts like natural language processing, NLP and machine learning and prediction classifiers like the Logistic regression and multinomial Naïve bayes which will predict the truthfulness or fake-news of an article. We have also made a sentimental analysis of the news or article as to get as its positive or negative news.

2. LITERATURE REVIEWS In general, Fake news could be categorized into three groups. The first group is fake news, which is news that is completely fake and is made up by the writers of the articles. The second group is fake satire news, which is fake news whose main purpose is to provide humour to the readers. The third group is poorly written news articles, which have some degree of real news, but they are not entirely accurate. In short, it is news that uses, for example, quotes from political figures to report a fully fake story. Usually, this kind of news is designed to promote certain agenda or biased opinion.

Key Words: Fake news detection, Logistic regression, TFIDF, count vectorization, Multinomial Naïve Bayes, NLP, feature selection.

1. INTRODUCTION

“Fake News Detection” Akshay Jain, Amey 2018 IEEE. Information preciseness on Internet, especially on social media, is an increasingly important concern, but web-scale data hampers, ability to identify, evaluate and correct such data, or so called “fake news,” present in these platforms. In this paper, we propose a method for “fake news” detection and ways to apply it on Facebook, one of the most popular online social media platforms. This method uses Naive Bayes classification model to predict whether a post on Facebook will be labelled as REAL or FAKE. The results may be improved by applying several techniques that are discussed in the paper. Received results suggest, that fake news detection problem can be addressed with machine learning methods. Kasbe[1],

These days’ fake news is creating different issues from sarcastic articles to a fabricated news and plan government propaganda in some outlets. Fake news and lack of trust in the media are growing problems with huge ramifications in our society. Obviously, a purposely misleading story is “fake news” but lately blathering social media’s discourse is changing its definition. Some of them now use the term to dismiss the facts counter to their preferred viewpoints. Fortunately, there are a number of computational techniques that can be used to mark certain articles as fake on the basis of their textual content. Majority of these techniques use fact checking websites such as “PolitiFact” and “Snopes.” There are a number of repositories maintained by researchers that contain lists of websites that are identified as ambiguous and fake. However, the problem

|

Impact Factor value: 7.529

|

ISO 9001:2008 Certified Journal

|

Page 1705

Turn static files into dynamic content formats.

Create a flipbook