IRJET- Fake News Detector

Page 2

International Research Journal of Engineering and Technology (IRJET)

e-ISSN: 2395-0056

Volume: 07 Issue: 09 | Sep 2020

p-ISSN: 2395-0072

www.irjet.net

3. NATURAL LANGUAGE PROCESSING (NLP)

 Sentiment Analysis : Identifying the mood or subjective opinions within large amounts of text, including average sentiment and opinion mining.  Speech-to-text and text-to-speech Conversion : It Transforms voice commands into written text and vice versa.

NLP is a branch of artificial intelligence that helps computers understand, interpret, and manipulate human language. The NLP draws from many backgrounds to fill the void between human communication and computer comprehension, including computer science and computational language. Although natural language processing is not a new science, technology is evolving rapidly due to increasing interest in human-to-machine communications, as well as the availability of big data, powerful computing; and enhanced algorithms[2].As a human, you may speak and write in English, Spanish, or Chinese. But a computer’s native language known as machine code or machine language is largely understandable to most people. Communication occurs not with words but through millions of zeros and ones that produce logical actions.

 Document Summarization : It Automatically generates a summary of large bodies of text.

4. CHARACTERISTICS The contributions of this paper can be summarized as follows: We systematically characterize the Web sites and reputations of the publishers of the fake and real news articles on their registration patterns, Web site ages, domain rankings, domain popularity, and the probabilities of news disappearance from the Internet. We analyze the similarity and dissimilarity of the fake and real news on the most important terms of the news articles via the-IDF and LDA topic modeling[1].

3.1 Importance of NLP Large volumes of textual data Natural language processing helps computers communicate with humans in their language and scales other language-related tasks. For instance, NLP allows computers to read a text, hear speech, interpret it, calculate feelings, and decide which sections are significant. Today’s machines can analyze more language-based data than humans, without tiredness and in a consistent, unbiased way.

We explore document similarity between fake, real, or hybrid news articles via the Jaccard similarity coefficient to distinguish, classify, and predict fake and real news. The output of this paper is organized as follows.

Natural language processing includes many different techniques for interpreting human language, ranging from statistical and machine learning methods to rules-based and algorithmic approaches. Basic NLP tasks include tokenization and analyzing, part-of-speech tagging, language detection, and identification of logic.

Section 1: It describes the background of the fake news problem over online social media and also describes datasets. Section 2: It characterizes the Websites and reputations of the publishers of the fake and real news articles, Section 3: It focuses on analyzing the similarity and dissimilarity of the fake and real news on the most important terms of the news articles. Section 4: We show the promising direction of leveraging document similarity to distinguish fake and real news by measuring their document similarity. Section 5: Summarizes related work in detecting and analyzing fake news and highlights the difference between this effort with existing studies. Section 6: It concludes this paper and outlines our future work.

3.2 NLP Techniques  Content Categorization : A description of a linguistic text, including search and indexing, content warnings, and identification of duplicates.  Topic Discovery and Modeling : Accurately capture the meaning and themes in text collections, and apply advanced analytics to text, like optimization and predict.

5. ANALYTICS MODEL

 Contextual Extraction : Automatically convert structured information from textbased sources.

© 2020, IRJET

|

Impact Factor value: 7.529

This section describes the conceptual and mathematical ideas of the proposed analytical model developed for establishing the reliability of news articles. We start this section by describing the composition of complete and incomplete sentences. Next, we define events and topics extracted from complete sentences. Boolean-value

|

ISO 9001:2008 Certified Journal

|

Page 1410


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.
IRJET- Fake News Detector by IRJET Journal - Issuu