Social Search in a Professional Context

Preview:

DESCRIPTION

Keynote at CIKM 2013 Workshop on Data-driven User Behavioral Modelling and Mining from Social Media Social Search in a Professional Context Daniel Tunkelang (LinkedIn) Social networks bring a new dimension to search. Instead of looking for web pages or text documents, LinkedIn members search a world of entities connected by a rich graph of relationships. Search is a fundamental part of the LinkedIn ecosystem, as it helps our members find and be found. Unlike most search applications, LinkedIn's search experience is highly personalized: two LinkedIn members performing the same search query are likely to see completely different results. Delivering the right results to the right person depends on our ability to leverage our each member's unique professional identity and network. In this talk, I'll describe the kinds of search behavior we see on LinkedIn, and some of the approaches we've taken to help our members address their information needs.

Citation preview

Recruiting Solutions Recruiting Solutions Recruiting Solutions

Social Search in a Professional Context

Daniel Tunkelang LinkedIn, Head of Query Understanding

1

Daniel

Workshop on Data-driven User Behavioral Modeling and Mining from Social Media

LinkedIn connects talent to opportunity.

2

Search enables the participants in the economic graph to find and be found.

Overview

Why do people search in a professional context?

How do we help people search in

a professional context?

Next play?

3

4

Scenario 1: Pleased to meet you!

5

People search isn’t the same as web search.

6

LinkedIn works hard to make it effortless.

7

Even harder to reduce user effort to a few chars.

8

Searchers use what they know to find people.

9

Not all navigational queries are name searches.

10

Scenario 2: Looking for new opportunities.

11

Lots of jobs in DC for data scientists.

12

My connections can help me get a $100k+ job.

13

Apply, contact the recruiter, or seek a referral.

14

Scenario 3: I know what I want when I see it.

15

Another year, another CIKM industry event.

16

We’ll need student volunteers, too.

17

And some sponsors!

18

19

LinkedIn’s focus: entity-oriented search.

20

Company

Employees

Jobs

Name Search

Query tagging: key to query understanding.

§  Using human judgments to evaluate tag precision. –  Extremely accurate (> 99%) for identifying person names. –  Harder to distinguish company vs. title vs. skill (e.g., oracle dba).

§  Comparing CTR for tag matches vs. non-matches. –  Difference can be large enough to suggest filtering vs. ranking:

21

Query Tagging: An Example

22

Detecting navigational vs. exploratory queries.

Pre-retrieval §  Sequence of query tags.

Post-retrieval §  Distribution of scores / features.

23

Click behavior §  Title searches >50x more

likely to get 2+ clicks than name searches.

Navigation vs. Exploration: Behavior Patterns

§  Exploratory searches leads to ~5x more clicks per search than navigational searches.

§  Clicks on 2nd-degree connection more than 2x as likely to lead to invitation from exploratory vs. navigational search.

§  For navigational queries, 1st degree > 2nd degree > …

§  For exploratory queries, 2nd and 3rd degree > 1st degree.

24

Query expansion for exploratory queries.

25

software patent lawyer

Query expansions derived from reformulations.

e.g., lawyer -> attorney

LinkedIn search is personalized.

26

kevin scott

But global factors matter.

27

Relevant results can be in or out of network.

28

§  Searcher’s network matters for relevance. –  Within network results have higher CTR.

§  But the network is not enough. –  About two thirds of search clicks come from out of

network results.

Personalized machine-learned ranking.

29

§  Data point is a triple (searcher, query, document). –  Searcher features are important!

§  Labels: Is this document relevant to the query and the user? –  Depends on the user’s network, location, etc. –  Too much to ask random person to judge.

§  Training data has to be collected from search logs.

How to train your model.

30

§  Train simple models to resemble complex ones. –  Build Additive Groves model [Sorokina et al, ECML ’07],

which is good at detecting interactions. §  Build tree with logistic regression leaves.

§  By restricting tree to user and query features, only regression model evaluated for each document.

β0 +β1T (x1)+...+βn xn

α0 +α1P(x1)+...+αnQ(xn )

X2=?

X10< 0.1234 ?

γ0 +γ1R(x1)+...+γnQ(xn )

31

32

Make search truly entity-centric.

33

results

results

Use the search box to surface task intent.

34

I am…

looking for a job… at LinkedIn in Fiji trying to hire…

software engineers web developers

interested in learning about… Hadoop NoSQL

It takes two to connect talent to opportunity.

35

LinkedIn: connecting talent to opportunity.

36

Search: enabling the participants in the economic graph to find and be found.

Thank you!

37

238,

Want to learn more?

§  Check out http://data.linkedin.com/search.

§  Contact me: dtunkelang@linkedin.com

http://linkedin.com/in/dtunkelang

§  Did I mention that we’re hiring? J

38

Recommended