29
Recommending Jobs You May Be Interested In Anuj Goyal Recommendations at LinkedIn Anuj

Big data innovation_summit_2014

Embed Size (px)

DESCRIPTION

The talk I gave at the Big Data Innovation Summit.

Citation preview

Page 1: Big data innovation_summit_2014

Recruiting Solutions Recruiting Solutions Recruiting Solutions

Recommending Jobs You May Be Interested In

Anuj Goyal Recommendations at LinkedIn

Anuj

Page 2: Big data innovation_summit_2014

§  About LinkedIn §  Search vs. Recommendations §  Recommendation Opportunities §  Evaluating Job Recommendations §  Job Recommendation Algorithm §  Challenges §  Summary

2

Overview

Page 3: Big data innovation_summit_2014

270+ M

Company Pages

>3M *

Professional searches in 2012

~5.7B

90% Fortune 100 Companies use LinkedIn to hire

*

*as of March 31, 2014

New Members joining

~2/sec

2004

2005

2006

2007

2008

2009

2010

2011

2012

2013

3

World’s largest professional network Over 65% of members are now international

Page 4: Big data innovation_summit_2014

4

Recommendation Versus Search

Explicit Implicit

Page 5: Big data innovation_summit_2014

The Recommendations Opportunity

5

Page 6: Big data innovation_summit_2014

6

Page 7: Big data innovation_summit_2014

50% 7

Value of Recommendations

Page 8: Big data innovation_summit_2014

Job Recommendations

8

Page 9: Big data innovation_summit_2014

9

Jobs You May be Interested In

Page 10: Big data innovation_summit_2014

Evaluation

§  Upside metrics –  Are users getting relevant jobs?

§  Downside metrics –  Are users getting offending jobs?

10

Page 11: Big data innovation_summit_2014

Evaluation - Upside

§  Total Job Views (Clicks) §  Total Applications §  Total Viewers §  Total Applicants

11

Page 12: Big data innovation_summit_2014

Evaluation - Downside

§  Applications per Click §  Clicks per Impression §  Applications per Impression §  Expert Judgments

12

Page 13: Big data innovation_summit_2014

User Features & Recommendation Algorithm

Page 14: Big data innovation_summit_2014

14

Positions Education

Summary

Experience

Skills

User Features

Page 15: Big data innovation_summit_2014

Corpus Stats

Candidate Jobs

User Base

title geo company

industry description functional area

Candidate

General expertise specialties education headline geo experience

Current Position title summary tenure length industry functional area …

Similarity (candidate expertise, job description)

0.56 Similarity

(candidate specialties, job description)

0.2 Transition probability

(candidate industry, job industry)

0.43

Title Similarity

0.8

Similarity (headline, title)

0.7 . . .

derived

Matching Binary Exact matches: geo, industry, … Soft transition probabilities, similarity, … Text

Recommendation Algorithm

Transition probabilities Connectivity yrs of experience to reach title education needed for this title …

15

Job Collection

Page 16: Big data innovation_summit_2014

Challenges & Feature Engineering

Page 17: Big data innovation_summit_2014

Challenge: Entity Resolution

17

Page 18: Big data innovation_summit_2014

‘IBM’ has 13000+ variations -  ibm – ireland -  ibm research -  T J Watson Labs -  International Bus. Machines

Are All Companies The Same?

18

Page 19: Big data innovation_summit_2014

-  Software Engineer -  Technical Yahoo -  Member Technical Staff -  Software Development Engineer -  SDE

Are All Titles The Same?

19

Page 20: Big data innovation_summit_2014

Same company with different name

Same name but different companies

Name Variations for IBM? “Orion” refers to 20 diff. companies

large scale: 100M+ members, 2M+ company entities

IBM: Intl Brotherhood of Magicians ~ 13000

Challenges – Entity Resolution

20

Page 21: Big data innovation_summit_2014

§  Binary classifier (LR), not ranker §  P({position, company

entity} is a match) §  Features

§  Content §  Social §  Behavior

§  Company candidate set leveraged from Social graph and cosine similarity 97% Precision

at 50% Coverage Asonam’11, KDD’11

Challenges – Entity Resolution

21

Prec

isio

n Coverage

Page 22: Big data innovation_summit_2014

Challenges – Geo Location

22

Page 23: Big data innovation_summit_2014

§  Zip code mapped to Regions §  How sticky are those locations?

Feature Engineering – Sticky locations

23

Page 24: Big data innovation_summit_2014

§  Open to relocation ? §  Region similarity based on profiles or network §  Region transition probability

§  Predict individuals propensity to migrate and most likely migration target

Feature Engineering – Sticky locations

24

Page 25: Big data innovation_summit_2014

Feature Engineering – The Network effect

25

Page 26: Big data innovation_summit_2014

Hybrid Recommendation Title : Research Engineer Company : Yahoo! Location : CA,USA Skills : Stats, ML, Java

Title : Data Scientist Company : Samsung Location : PA,USA Skills : Stats, R

Title : Analyst Company : Microsoft Location : CA, USA Skills : R, ML

Title : Research Engineer <1>, Data Scientist <1>, Analyst <1> Company : Yahoo<1>, Samsung<1>, Microsoft<1>

Location : CA,USA <2>, PA,USA<1> Skills : Stats<2>, ML<2>, R<2>, Java<1>

Applicant Features

Distribution

Data Scientist / Senior Data Scientist San Jose

26

Page 27: Big data innovation_summit_2014

Information Gain

Pick Top K overrepresented features from the applicants distribution

A representative projection of the job in the member feature space

27

Hybrid Recommendation

Page 28: Big data innovation_summit_2014

§  Why Jobs Recommendations are Different §  Recommendation Algorithm §  Challenges

–  Entity Resolution –  Location Resolution

28

Summary

Page 29: Big data innovation_summit_2014

Questions?

Contact:

[email protected]

We’re Hiring!

http://data.linkedin.com/

Thank You!

29