INTERNATIONAL RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY (IRJET) VOLUME: 06 ISSUE: 05 | MAY 2018
WWW.IRJET.NET
E-ISSN: 2395-0056 P-ISSN: 2395-0072
Opinion Mining of Twitter Data for Hotel Review Analysis SRIKANTH M S, KAMAD BHATT, HARSH RAJ, MD. ADNAN AHMED Assistant Professor, Dept. of Computer science and Engineering, Sapthagiri college of Engineering (VTU)Bengaluru, India, srikanthms@sapthagiri.edu.in Student, Dept. of Computer science and Engineering, Sapthagiri college of Engineering (VTU), Bengaluru, India, kamadbhatt96@gmail.com Student, Dept. of Computer science and Engineering, Sapthagiri college of Engineering (VTU), Bengaluru,India, rajharsh1997@gmail.com Student, Dept. of Computer science and Engineering, Sapthagiri college of Engineering (VTU), Bengaluru, India, adnan.workoffice@gmail.com
Abstract— The rapid increase in mountains of
Keywords-sentiment analysis, text mining, association
unstructured textual data accompanied by proliferation of
rule, bag of words, opinion mining
tools to analyse them has opened up great opportunities
I. INTRODUCTION
and challenges for text mining research. The automatic
In recent years, the world has experienced a
labelling of text data is hard because people often express
tremendous rise in the volume of textual data especially
opinions in complex ways that are sometimes difficult to
for the unstructured data generated from people who
comprehend. The labelling process involves huge amount
express opinions through various web and social media
of efforts and mislabelled datasets usually lead to
platforms for different reasons. Mountains of these
incorrect decisions. In this paper, we design a frame work
textual data, initially could be equated to garbage which
for sentiment analysis with opinion mining for the case of
would need to be disposed from time to time. However,
hotel customer feedback. Most available datasets of hotel
with the advancement in storage capacity accompanied
reviews are not labelled which presents a lot of works for
by the increasing sophistication in data mining tools,
researchers as fares text data pre-processing task is
opportunities and challenges have been created for
concerned. Moreover, sentiment datasets are often highly
analysing and deriving useful insights from these
domain sensitive and hard to create because sentiments
mountains of data.
are feelings such as emotions, attitudes and opinions that are
commonly
rife
with
idioms,
onomatopoeias,
In this paper, we have chosen textual data in the form of
homophones, phonemes, alliterations and acronyms. The
hotel reviews for sentiment analysis with opinion mining
proposed framework is termed sentiment polarity that
from customer perspectives. Sentiment analysis uses the
automatically prepares a sentiment dataset for training
techniques
and testing to extract unbiased opinions of hotel services
computational linguistics to automate the classification
from reviews to discover a suitable machine learning
of sentiments generated from reviews. Hotels provide
algorithm for the classification component of the
satisfaction, security, comfort, luxury and lodging
framework.
services for travellers and people on vacation. Mining
of
natural
language
processing
and
hotel reviews is desirable to gain deeper knowledge of
Š 2019, IRJET
|
Impact Factor value: 7.211
|
ISO 9001:2008 Certified Journal
|
Page 4483