International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395-0056
Volume: 06 Issue: 03 | Mar 2019
p-ISSN: 2395-0072
www.irjet.net
Question-Answer Text Mining using Machine Learning Abhishek Kumar1, Kunal Thakur2, Pranav Nachnekar3, Nayana Vaity4 1,2,3BE 4Professor,
Student, Computer Engineering, Terna Engineering College, Nerul, Maharashtra, India
Dept. of Computer Engineering, Terna Engineering College, Nerul, Maharashtra, India
---------------------------------------------------------------------***---------------------------------------------------------------------
•Keywords extraction algorithms [1] like TF-IDF and RAKE (Rapid automatic keyword extraction) need large number of dataset to work properly.
Abstract - Questions and Answers platform are an
important part of today’s academic world. They enable users to provide, search and search knowledge. However, most such platforms today have a major problem, that of duplication of knowledge. The aim of this project is to avoid this duplication by applying machine learning algorithms to the question, when it is being submitted and help the user find a pre-existing solution to the problem if it already exists on the platform. This helps the user navigate the platform more efficiently and reduces the amount of redundant questions on the platform. We intend to match questions based on the keywords used.
•Multiple occurrences of similar type of questions which are having same meanings. •To remove similar questions from database more manpower is required. •Existing systems are not suggesting appropriate tags to the user, tags are used to categorize the questions.
3. SUGGESTED IMPROVEMENTS
Key Words: Recursive Neural Network, Sequence to sequence model, MongoDB, Encoder, Decoder
•We are using Sequence to sequence model for keyword extraction which can extract keywords even from a sentence.
1.INTRODUCTION
•No extra manpower is required for removing similar questions in our system.
The age of Internet has changed the way people express their views. It is now done through blog posts, online discussion forums, product review websites etc. People depend upon this user generated content to a great extent. When someone wants to buy a product, they will look up its reviews online before taking a decision. The amount of user generated content is too large for a normal user to analyze. so to automate this, various Machine Learning techniques are used. Symbolic techniques or Knowledge base approach and Machine learning techniques are the two main techniques used in sentiment analysis. Knowledge base approach requires a large database of predefined emotions and an efficient knowledge representation for identifying sentiments. Machine learning approach makes use of a training set to develop a machine learning model that classifies sentiments. Since a predefined database of entire emotions is not required for machine learning approach, it is rather simpler than Knowledge base approach. Main aim of this project is to implement machine learning based sentiment analysis in a Q&A platform with the following features: •Easy posting of question and answers •Rating of questions and answers •Ranking the questions and answers according to the rating •Anonymous usage of the platform
•Appropriate tags are suggesting by analyzing the question by our system
4. METHODOLOGY AND IMPLEMENTATION
2. DRAWBACKS OF EXISTING SYSTEM
Fig – 1: Architecture of system
The current & previous Question & Answers platforms are having some drawbacks which are as follows:
© 2019, IRJET
|
Impact Factor value: 7.211
|
ISO 9001:2008 Certified Journal
|
Page 3975