Learning to Rank in
Vector Spaces and Social Networks
(WWW 2007 Tutorial)
Soumen Chakrabarti
IIT Bombay
∼http://www.cse.iitb.ac.in/ soumen
Soumen Chakrabarti Learning to Rank in Vector Spaces and Social Networks (WWW 2007 Tutorial) 1Motivation: Web search
I User query q, Web pages{v}
I (q,v) can be represented with a rich feature vector
I Text match score with title, anchor text, headings, bold
text, body text, ..., of v as a hypertext document
I Pagerank, topic-specific Pageranks, personalized
Pageranks of v as a node in the Web graph
I Estimated location of user, commercial intent, ...
I Must we guess the relative importance of these features?
I How to combine these into a single scoring function on
(q,v) so as to induce a ranking on{v}?
Soumen Chakrabarti Learning to Rank in Vector Spaces and Social Networks (WWW 2007 Tutorial) 2Motivation: Ad and link placement
I Here, the “query” is the surfer’s contextual information
I More noisy than queries, which are noisy enough!
I Plus page and site contents
I A response is an ad to place, or a link to insert
I Must rank and select from a large pool of available ads or
links
I (In this tutorial we will ignore issues of bidding and
visibility pricing)
Soumen Chakrabarti Learning to Rank in Vector Spaces and Social Networks (WWW 2007 Tutorial) 3Motivation: Desktop search
I The Web has only a few kinds of hyperlinks: same-host
subdirectory, same-host superdirectory, same-host
across-path, different-host same-domain, ...
Voir