IRJET- Question-Answer Text Mining using Machine Learning

Page 1

International Research Journal of Engineering and Technology (IRJET)

e-ISSN: 2395-0056

Volume: 06 Issue: 03 | Mar 2019

p-ISSN: 2395-0072

www.irjet.net

Question-Answer Text Mining using Machine Learning Abhishek Kumar1, Kunal Thakur2, Pranav Nachnekar3, Nayana Vaity4 1,2,3BE 4Professor,

Student, Computer Engineering, Terna Engineering College, Nerul, Maharashtra, India

Dept. of Computer Engineering, Terna Engineering College, Nerul, Maharashtra, India

---------------------------------------------------------------------***---------------------------------------------------------------------

•Keywords extraction algorithms [1] like TF-IDF and RAKE (Rapid automatic keyword extraction) need large number of dataset to work properly.

Abstract - Questions and Answers platform are an

important part of today’s academic world. They enable users to provide, search and search knowledge. However, most such platforms today have a major problem, that of duplication of knowledge. The aim of this project is to avoid this duplication by applying machine learning algorithms to the question, when it is being submitted and help the user find a pre-existing solution to the problem if it already exists on the platform. This helps the user navigate the platform more efficiently and reduces the amount of redundant questions on the platform. We intend to match questions based on the keywords used.

•Multiple occurrences of similar type of questions which are having same meanings. •To remove similar questions from database more manpower is required. •Existing systems are not suggesting appropriate tags to the user, tags are used to categorize the questions.

3. SUGGESTED IMPROVEMENTS

Key Words: Recursive Neural Network, Sequence to sequence model, MongoDB, Encoder, Decoder

•We are using Sequence to sequence model for keyword extraction which can extract keywords even from a sentence.

1.INTRODUCTION

•No extra manpower is required for removing similar questions in our system.

The age of Internet has changed the way people express their views. It is now done through blog posts, online discussion forums, product review websites etc. People depend upon this user generated content to a great extent. When someone wants to buy a product, they will look up its reviews online before taking a decision. The amount of user generated content is too large for a normal user to analyze. so to automate this, various Machine Learning techniques are used. Symbolic techniques or Knowledge base approach and Machine learning techniques are the two main techniques used in sentiment analysis. Knowledge base approach requires a large database of predefined emotions and an efficient knowledge representation for identifying sentiments. Machine learning approach makes use of a training set to develop a machine learning model that classifies sentiments. Since a predefined database of entire emotions is not required for machine learning approach, it is rather simpler than Knowledge base approach. Main aim of this project is to implement machine learning based sentiment analysis in a Q&A platform with the following features: •Easy posting of question and answers •Rating of questions and answers •Ranking the questions and answers according to the rating •Anonymous usage of the platform

•Appropriate tags are suggesting by analyzing the question by our system

4. METHODOLOGY AND IMPLEMENTATION

2. DRAWBACKS OF EXISTING SYSTEM

Fig – 1: Architecture of system

The current & previous Question & Answers platforms are having some drawbacks which are as follows:

© 2019, IRJET

|

Impact Factor value: 7.211

|

ISO 9001:2008 Certified Journal

|

Page 3975


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.