International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395 -0056
Volume: 04 Issue: 04 | Apr -2017
p-ISSN: 2395-0072
www.irjet.net
Comparative Study on Multi- Modal Topic Modeling with Dense block Detection Prajakta Sonone1, Prof. A.V.Deorankar2 1P.G.
Scholar, Department of Computer Science and Engineering, Government college of Engineering, Amravati, Maharashtra, India
2Associate
Professor, Department of Information Technology Government college of Engineering, Amravati, Maharashtra, India
---------------------------------------------------------------------***-------------------------------------------------------------------Abstract— There has been incredible growth of events 1. Introduction over the internet in recent years. Google has become With the immense growth of internet, more the giant source of knowledge for any event which has amounts of multimedia contents are generated over happened or happening over the internet. Some the internet. The social media sites like Face book, networking sites such as facebook, micro blogging sites Flickr, Google News are the most popular sites among such as twitter are evolved with time and became the the users of internet. Micro blogging sites like Twitter highly used sites over the internet. Various E-commerce and E-commerce webites contains trending topics, websites such as Amazon, Ebay, Flipkart etc are the retweeting for given hashtag and ads, posts related to widely used sites for online shopping. Above mentioned particular product respectively. While visiting these sites generates large amount of text data. In association websites user may get various posts, various tweets with text data some images are also uploaded over the from twitter, news related articles having associated internet on these sites. The images associated with images. So it is difficult for them to analyse and track particular topic plays vital role for understanding the those events with detailed summary. While modelling semantic relationship between the textual content and images and text data semantic relationship have less visual content which are related to images. To model focus in previous studies. So Multi-modal event topic this huge amount of data having both textual and visual model helps to analyse, summarize and track those contents multi-modal topic model is suggested in this events and studies semantic relationships effectively. paper. While dealing with multi-modality, study of [1] semantic relationship between the images and text data For studying suspicious behaviours in multimodal is crucial part. This model also helps to study semantic data,In case of E-commerce sites, financial activities of relationship between them effectively. Topics which are “who-trades-what stocks”, EBay’s “who-buys-fromtrending, popular over the world can be seen on Social whom” graph can be taken into consideration. Reviews, sites as well as micro blogging sites. In online shopping sites fake reviews, advertises, spam spreading advertises can be misused for spreading wrong information is posted. So to study this suspicious information or post fake advertises. So if we detect the behavior dense suspicious block is also detected in this dense block of information in given document then it model. In this paper some and methods are presented becomes easier to study the lockstep behaviors because and compared. The focus is to effectively model the for example, one tweet with multiple retweet or textual and visual contents in multi-modal dataset multiple tweets with multiple retweets the difference having dense block for examining lockstep behaviors. can be clearly detected in case of suspicious behavior. Any behavioral factors which can spread the ads which Keywords—suspicious behavior, topic model, multiare fake and which can spread the spam. So Many modality, Event tracking, event detection. commercial products and approaches are attracted towards the behavior analysis of the problem so that this fraud containing information or any spam and advertising with URL hijacking can be detected. For this © 2017, IRJET
|
Impact Factor value: 5.181
|
ISO 9001:2008 Certified Journal
| Page 2475