International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395 -0056
Volume: 03 Issue: 02 | Feb-2016
p-ISSN: 2395-0072
www.irjet.net
SOCIAL MEDIA MARKET TRENDER WITH DACHE MANAGER USING HADOOP AND VISUALIZATION TOOLS Raveena Rana [1], Tejas Shah [2], Jitanksha Shrimanker[3] 1-3BE
Student, Information Technology Department
K. J. Somaiya Institute of Engineering & Information Technology, Mumbai, Maharashtra, India ---------------------------------------------------------------------***--------------------------------------------------------------------Abstract- The modern business and advertising strategies
Twitter generate huge amounts of data on a daily basis.
all involve market analysis and study of the trends going on
This data is directly from the end user or consumer and as
in the world. Social media is the main advertising and
a result is of great value to the organizations and
analysis market in today’s world. The current Analytics tools
companies. [1]
and models that are available in the market are very costly,
This data is still scattered and not in a suitable
unable to handle Big Data and less secure. The traditional
format that may yield any results. Also the data is so huge
Analytics systems takes a long time to come up with results,
that the buzz-word Big Data is the only one that can
so it is not beneficial to use for Real Time Analytics. So, the
represent such an amount of data. Our Project is based on
proposed work resolves all these problems by combining the
Hadoop framework that can
Apache Open Source platform which solves the issues of Real
manage and handle Big Data efficiently. Main reasons for
Time Analytics using HADOOP. It also provides scalability
choosing Hadoop are:
and reduced cost over analytics by using open Source 1.
Software. [1]
The data generated by Social sites are too large for traditional databases and analysis tools.
2.
Key Words: Map Reduce (MR), Hadoop, Java , Flume ,
Hadoop being a framework gives more flexibility in implementing ideas.
Sqoop, Caching. 3.
Scalability is a major advantage of using Hadoop over other platforms.
I. INTRODUCTION The world is getting smaller and smaller day by day
Application developers specify the computation in
through internet and usage of social sites. Data is growing
terms of a map and a reduce function, and the underlying
exponentially as the number of users and activity over the
MR Job scheduling system automatically parallelizes the
web is increasing rapidly. The consumer market is
computation across a cluster of machines. MR gains
becoming increasingly competitive and the need to keep a
popularity for its simple programming interface and
tab on trends and consumers’ opinions is a must. [3]
excellent performance when implementing a large
Social Media is being extensively used by people
spectrum of applications. Since most such applications
belonging to all age groups’ for expressing their reviews
take a large amount of input data, they are nicknamed
and suggestions whenever they buy a new product or a
“Big-data
new
for
implementation of the Google MR programming model.
recommendations to friends and family of a particular
Hadoop consists of the Hadoop Common, which provides
product or service. Social media sites like Facebook and
access to the file systems supported by Hadoop. MR
product
is
launched.
Also
it
is
used
applications”.
Hadoop is
an open-source
provides a standardized framework for implementing
© 2016, IRJET
|
Impact Factor value: 4.45
|
ISO 9001:2008 Certified Journal
|
Page 1320