Pattern Analysis and Intelligent Systems
Mehrnaz Mirhasani; Reza Ravanmehr
Volume 6, Issue 4 , November 2020, , Pages 251-264
Abstract
The movie recommendation systems are always faced with the new movie cold start problem. Nowadays, social media platform such as Twitter is considered as a rich source of information in various domains, like movies, motivated us to exploit Twitter's content to tackle the movie cold start problem. In ...
Read More
The movie recommendation systems are always faced with the new movie cold start problem. Nowadays, social media platform such as Twitter is considered as a rich source of information in various domains, like movies, motivated us to exploit Twitter's content to tackle the movie cold start problem. In this study, we propose a hybrid movie recommendation method utilizing microblogs, movie features, and sentiment lexicon to reduce the effect of data sparsity. For this purpose, first, the movie features are extracted from the Internet Movie Database (IMDB), and the average IMDB score is calculated during the 7-days opening of the movie. Second, the related tweets of the movie and the cast are retrieved by the Twitter API. Third, the polarity of tweets and the public’s feeling towards the target movie is extracted using sentiment lexicon analysis. Finally, the results of the three previous steps are integrated, and the prediction is obtained. Our results are compared with the sales volume of the target movie in 7-days opening, which is available in the Mojo Box office. In addition to the real-world benchmarking, we performed extensive experiments to demonstrate the accuracy and effectiveness of our proposed approach in comparison with the other state-of-the-art methods.
Computer Networks and Distributed Systems
Azam Seilsepour; Reza Ravanmehr; Hamid Reza Sima
Volume 5, Issue 3 , August 2019, , Pages 143-160
Abstract
Big data analytics is one of the most important subjects in computer science. Today, due to the increasing expansion of Web technology, a large amount of data is available to researchers. Extracting information from these data is one of the requirements for many organizations and business centers. In ...
Read More
Big data analytics is one of the most important subjects in computer science. Today, due to the increasing expansion of Web technology, a large amount of data is available to researchers. Extracting information from these data is one of the requirements for many organizations and business centers. In recent years, the massive amount of Twitter's social networking data has become a platform for data mining research to discover facts, trends, events, and even predictions of some incidents. In this paper, a new framework for clustering and extraction of information is presented to analyze the sentiments from the big data. The proposed method is based on the keywords and the polarity determination which employs seven emotional signal groups. The dataset used is 2077610 tweets in both English and Persian. We utilize the Hive tool in the Hadoop environment to cluster the data, and the Wordnet and SentiWordnet 3.0 tools to analyze the sentiments of fans of Iranian athletes. The results of the 2016 Olympic and Paralympic events in a one-month period show a high degree of precision and recall of this approach compared to other keyword-based methods for sentiment analysis. Moreover, utilizing the big data processing tools such as Hive and Pig shows that these tools have a shorter response time than the traditional data processing methods for pre-processing, classifications and sentiment analysis of collected tweets.