Restricted Boltzmann Machines for Collaborative Filtering€¦ · Learning –Netflix RBM (1) 13...

Presentation by:

Benjamin Schwehn Ioan Stanculescu

Restricted Boltzmann Machines for Collaborative Filtering

Authors: Ruslan SalakhutdinovAndriy MnihGeoffrey Hinton

Overview

The Netflix prize problem

Introduction to (Restricted) Boltzmann Machines

Applying RBMs to the Netflix problem Probabilistic model

Learning

The Conditional RBM

Results

The Netflix Prize

Automated Recommender System (ARS)

Predict how a user would rate a movie

Recommend movies likely to be rated high

USD 1,000,000 prize for a 10% improvement

User ID Movie ID Movie Name Release Rating(1-5)

Rating Date

1488844 1 Dinosaur Planet 2003 3 2005-09-06

822109 1 Dinosaur Planet 2003 5 2005-05-13

2578149 1904 Dodge City 1939 1 2003-09-02

The Netflix Data Set

100,480,507 dated ratings by N=480,189 users forM=17,770 movies

Qualifying data: 2,817,131 entries w/o rating

Teams informed of the RMSE over a random half ofqualifying data set to make optimizing on test set moredifficult

Restricted Boltzmann Machine (RBM)

Random Markov Field with no connections within both visible and hidden layers

Layers are conditionally independent

Binary Stochastic Neurons

Nodes in an RBM have state in {0,1}

p(si = 1) given by a function of the input from neurons in “other” layer and bias:

bias connection weight

node in “other” layer

Graph © G.Hinton

Equilibrium

If nodes are updated sequentially, eventually an equilibrium is reached

Define energy of joint configuration as:

bias visible nodes bias hidden nodes

connection between visible and hidden nodes

Joint probability

partition function

marginal distribution over visible nodes

Learning in RBMs

expectation in equilibrium when V is clamped“clamped phase”

expectation in equilibrium without clamp“free running phase”

(PMR Week 9)

Learning

Training Data V

1. Clamped Phase

2. Free Phase

RBMs for Collaborative filtering

FACT: The number of movies each user has rated is far less than thetotal number of movies M.

KEY IDEA #1: For each user build adifferent RBM . Every RBM has thesame number of hidden nodes, butvisible units only for the movies ratedby that user.

KEY IDEA #2: If 2 users have rated thesame movie, their 2 RBMs must use thesame weights between the hidden layerand the movie’s visible unit. To get theshared parameters just average theirupdates over all N users.

A single user-specific RBM

The user has rated m movies. Ratings are modeled as 1 to K vectors.Consequently, V is a K*m matrix. F is the number of hidden units.

Learning – Netflix RBM (1)

The goal is to maximize the likelihood of the visible unitswith respect to the weights, W, and biases, b.

The “energy” term is given by:

In the following we only discuss learning the weights. Biases are trained in the same fashion.

Learning – Netflix RBM (2)

Gradient descent update rule:

Easy to compute. To be read as “frequency withwhich movie i with rating k and feature j are ontogether when the features are being driven by theobserved user-rating data”

Impossible to compute in less than exponential time.

KEY IDEA: Performing T steps of Gibbs sampling is enough to get a good approximation of

Contrastive Divergence

Remember that the gradient updates are obtained by averagingthe gradient updates of each of the N user-specific RBMs!

Making predictions

There are 2 ways of making predictions in the trained model.1. By directly conditioning on the observed matrix V:

2. By performing one iteration of the mean field updates to geta probability distribution over ratings of a movie:

Even if the first approach is more accurate, the second one ispreferred in the experiments because of its higher speed.

Conditional RBMs

FACT: The user/movie pairs in the test set are known as well.

KEY IDEA: Build a binaryvector r indicating all themovies a user has rated(known + unknown ratings).This vector will affect thestates of the hidden units.

Experiments

RBM with F=100 hidden units RBM with F=100 Gaussian hidden units

the only difference is that the conditional Bernoullidistribution over hidden variables is replaced by aGaussian:

Conditional RBM with F=500 hidden units Conditional Factored RBM with F=500 hidden units and C=30

Each of the K weight matrices Wk (size M*F) factorizesinto product of low rank matrices Ak (size M*C) and Bk

(size C*F). SVD with C=40

Results

Conclusions

Gaussian hidden units perform worse.

Conditional RBMs outperform the simple RBMs, but also make use of information in the test data.

Factorization speeds up convergence at early stages. This might also help avoid over-fitting.

SVDs and RBMs make different errors → combine them linearly. 6% improvement over Netflix baseline score.

Questions?

Restricted Boltzmann Machines for Collaborative Filtering€¦ · Learning –Netflix RBM (1) 13...

Documents

Supervised Restricted Boltzmann Machines - UAIauai.org/uai2017/proceedings/papers/106.pdfWe ﬁrst describe the restricted Boltzmann machine (RBM) for unsupervised learning representation

MOULDING CATALOGUE - RB Millwork...Mar 25, 2020 · rbm cr-069 rbm cr-070 7 7 7 7 7 8 8 8 8 8 ... rbm cap-11 rbm cap-12 rbm cap-13 rbm cap-14 rbm cap-15 rbm cap-16 rbm cap-17 rbm

Restricted Boltzmann Machinesleizhao/Reading/chapter/06.pdf · The restricted Boltzmann machine (RBM) is a fundamentally diﬀerent model from the feed-forward network. Conventional

Rbm Spowerpoint Final

TERRATEC RBM Brochure.pdf

Cheat Sheets for AI, Neural Networks, Machine …...Hopfield Network (HN) Ava' Boltzmann Machine (BM) Restricted BM (RBM) Deep Belief Network (DBN) Deconvolutional Network (DN) Deep

RBM New Collection · 2020. 11. 13. · RBM New Collection 2020/21 RBM My Turn sofa RBM Wall in RBM Vancouver lite RBM Soft Box RBM Sam Table. RBM My Turn Table. Complete RBM New

A Deep Learning Approach to Link Prediction in Dynamic ... · 3.1 Restricted Boltzmann Machine Restricted Boltzmann Machine (RBM) is a special case of Markov Random Field which has

I'think - RBM

Laporan RBM Fix.docx

Komatina D. ISRBC, Integrated RBM The case of Sava River RBM

UNDG rbm handbook

situacion patrimonio RBM

RBM Evaluation

RBM Ana - flokkplatformprod.blob.core.windows.net · RBM Ana RBM Ana is a popular chair filling assembly halls, canteens, meeting rooms and cafeterias with activity. RBM Ana is easy

Image Classification Based on Sparse Coding …...Image Classification Based on Sparse Deep Learning Model 3.1 Unsupervised Rbm Coding In this paper, the multi-layer constrained Boltzmann

Research and Development of Deep Learning 2017/FETMS... · The basic models of restricted boltzmann machines (RBM),auto encoder (AE) and convolutional neural networks (CNN) were analyzed

On the Representational Efﬁciency of Restricted Boltzmann …toni/Papers/rbm-final.pdf · 2016. 9. 15. · The rest of the paper is organized as follows. Section 2 characterizes

Product Bulletin VRF SHRMi Noise Levels for FS Boxes...PRODUCT BULLETIN Noise Level Data for VRF 3-pipe SHRMi FS Boxes w RBM-Y1123FE w RBM-Y1803FE w RBM-Y2803FE RBM-Y1123FE RBM-Y1803FE

A Novel Feature Extraction Method for Scene Recognition ... · recognition. Although classical Restricted Boltzmann machines (RBM) can efficiently represent complicated data, it is