Movie topics- Efficient features for movie recommendation systems

Efficient Features for Movie Recommendation

Systems

Project presentation

Suvir Bhargav

Outline

● Motivation and Why movie reviews● Problem statement● How? or the overall system ● Text preprocessing approaches● Postprocessing: movie topics from a reviews

corpus● Similarity● Experimental setup and results

Thanks to Sean Lind, source: http://www.silveroakcasino.com/blog/posts/netflix/what-to-watch-on-netflix.html

Motivation

● movie genres are not enough.● classify movies

○ keywords○ moods○ imdb ratings○ micro genres

micro genres

source: http://www.theatlantic.com/technology/archive/2014/01/how-netflix-reverse-engineered-hollywood/282679/

Why movie reviews?

Source: a sample user written movie review from imdb

Problem statement

● Feature extraction from user reviews of movies

● Use extracted features to find similar movies.

The overall system

Movie reviews corpus● preprocessing

○ tokenization, stopwords, lemmatized.

● post processing○ topic modeling: Movie topics from a reviews corpus

● similarity measure○ return movies with similar topics distribution

tokenization, stopwords, lemmatized.

Simple information extraction

Text preprocessing

Figure credit to nltk book.

Post processing

Document representation: Vector Space Model (VSM)

Picture credit: pyevolve

Post processing: generative model

source: David blei’s slide

Post processing: LDA

For each document in the collection, the words can be generated in two stage process1) Randomly choose a distribution over topics.2) For each word in the document

a) Randomly choose a topic from the distribution over topics in step 1.

b) Randomly choose a word from the corresponding distribution over the vocabulary

Documents exhibit multiple topics

Movie topics from a reviews corpus

Similarity Measure

● Cosine Similarity● KL divergence● Hellinger distance

Cosine Similarity

Similarity Measure

Hellinger Distance

Similarity Measure

The overall system: implementation

Movie reviews corpus● preprocessing

○ nltk and gensim’s simple preprocessing.

● post processing○ gensim python wrapper to MALLET○ index topic distribution of query movies, q and 1k

movies corpus, C.

● similarity measure○ python numpy implementation○ apply distance metric on indexed q and C.○ sort and pick top 5 movies.

Experimental setup

Movie reviews corpus of 1k movies

reviews data source: imdb

Evaluation criteria

Experimental setup

Conclusion

● Movie topics as efficient features for RS○ represents movies by underlying semantic patterns

○ useful for capturing movie genre and mood.

○ but not so well with plot.

○ user written movie reviews are useful movie meta-data.

● The developed prototype○ easy to add more movie meta-data

○ python allows scalability.

○ Topics as an explanation needs further tuning.

Future directions

● Movie review preprocessing○ bigram, trigrams.○ create multi-word movie keywords or language

construction

● Building complex topic models○ Hierarchical LDA○ author-topic model

■ include authorship information.■ similarity between authors

Questions ?

Thank You

Image src: http://www.brinvy.biz/177215/batman-catching-a-ride-on-supermans-back-funny-hd-wallpaper-x.html

Extra slides

List of extra slides and notes● Original LDA paper● introduction to probabilistic topic modeling● and A. Huang’s Similarity measures for text document

clustering● Another good LDA description● Integrating out multinomial parameters in LDA● language construction in micro genres

Movie topics- Efficient features for movie recommendation systems

Technology

Recommendation Movie - Computer Scienceark/654/team/5/presentation3.pdf · Introduction Recommendation System: A technique which predicts what the user may like based ... Streaming

A Non-Intrusive Movie Recommendation System

APPLYING NEURAL NETWORKS TO MOVIE RECOMMENDATION · APPLYING NEURAL NETWORKS TO MOVIE RECOMMENDATION UTKARSH KAJARIA PROBLEM MODEL TRAINING RESULTS Idea: Use deep learning to incorporate

A hybrid approach for movie recommendation · A hybrid approach for movie recommendation George Lekakos & Petros Caravelas Published online: 21 December 2006 # Springer Science +

Matrix Factorization+ for Movie Recommendation Factorization+ for Movie Recommendation Lili Zhao,† Zhongqi Lu,† Sinno Jialin Pan, Qiang Yang† †Hong Kong University of Science

An Improved Collaborative Movie Recommendation System Using Computational Intelligence-2

A Social Media-Based Movie Recommendation System

Indian Regional Movie Dataset for Recommender …available for testing and bench-marking recommendation systems. We present an Indian regional movie dataset on similar lines. India

zMovie: The Movie Recommendation Engine I. ABSTRACTcse400/CSE400_2006_2007/ChenJatia/Writeup.pdf · zMovie CSE 401: Senior Design Jing Chen, jingchen@seas.upenn.edu Faculty Advisor:

Developing a Movie recommendation Engine with Spark

Personalized Movie Recommendation System Combining · in improving the exactness. Therefore, in this paper, the personalized movie recommendation system that combines data mining

[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation

Movie Recommendation with DBpedia - IIR 2012

MOVIE RECOMMENDATION WITH K-MEANS ......2.1 K-means Clustering In this section, we briefly describe the K-means algorithm (Alpaydin, 2004) and its utilization for recommendation. K-means

Folksonomies, the Semantic Web, and Movie Recommendation

· .Founding Netflix.com to offer online movie rentals. .0ffering selling and subscription services. Events .Launching a personalized movie recommendation service (CineMatch) .lssuing

An Efficient Content Collaborative – Based and Hybrid Approach for Movie Recommendation Engine

Survey on Kernel Optimization based Enhanced Preference Learning for Online Movie Recommendation

zMovie: The Movie Recommendation Engine I. ABSTRACTcse400/CSE400_2006_2007/ChenJatia/Writeup.… · zMovie CSE 401: Senior Design Jing Chen, jingchen@seas.upenn.edu Faculty Advisor:

MOVIE RECOMMENDATION EXPERT SYSTEMmengu/Projects_files/movie-expert-readme.pdf · MOVIE RECOMMENDATION EXPERT SYSTEM Introduction Our movie recommendation expert system is composed