Ramakrishnan Srikant Sugato Basu Ni Wang Daryl Pregibon 1.

Ramakrishnan Srikant

Sugato Basu

Ni Wang

Daryl Pregibon

Estimating Relevance of Search Engine ResultsUse CTR (click-through rate) data.Pr(click) = Pr(examination) x Pr(click | examination)

Need user browsing models to estimate Pr(examination)

Relevance

NotationΦ(i) : result at position i

Examination event:

Click event:

otherwise 0,

(i)on clickeduser theif ,1 iC

otherwise 0,

(i) examineduser theif ,1 iE

Examination HypothesisRichardson et al, WWW 2007:

Pr(Ci = 1) = Pr(Ei = 1) Pr(Ci = 1 | Ei = 1)

αi : position bias

Depends solely on position.Can be estimated by looking at CTR of the same result in

different positions.

Using Prior Clicks

Clicks

Pr(E5 | C1,C3) = 0.5 Pr(E5 | C1) = 0.3

ClicksR1

Examination depends on prior clicksCascade modelDependent click model (DCM)User browsing model (UBM) [Dupret & Piwowarski,

SIGIR 2008]More general and more accurate than Cascade, DCM.Conditions Pr(examination) on closest prior click.

Bayesian browsing model (BBM) [Liu et al, KDD 2009]Same user behavior model as UBM.Uses Bayesian paradigm for relevance.

Use position of closest prior click to predict Pr(examination).

Pr(Ei = 1 | C1:i-1) = αi β i,p(i)

Pr(Ci = 1 | C1:i-1) = Pr(Ei = 1 | C1:i-1) Pr(Ci = 1 | Ei = 1)

User browsing model (UBM)

position bias

p(i) = position of closest prior click

Prior clicks don’t affect relevance.

Other Related WorkExamination depends on prior clicks and prior relevance

Click chain model (CCM)General click model (GCM)

Post-click modelsDynamic Bayesian modelSession utility model

Constant Relevance AssumptionCascade model, DCM, UBM, BBM, CCM, GCM all

implicitly assume:Relevance is independent of prior clicks.Relevance is constant across query instances.

Query = “Canon S90”Aggregate relevance: Relevance to a query.

Query instance = “Canon S90” for a specific user at a specific point in time.Instance relevance: Relevance to a query

instance.

User intent Query string does not fully capture user intent.

Prior clicks signal relevance.... not just Pr(examination).

Testing Constant RelevanceIf we know that Pr(examination) ≈ 1:

Relevance ≈ CTRTest whether relevance is independent of prior clicks.

When is Pr(examination) ≈ 1?Users scan from top to bottom.If there is a click below position i, then Pr(Ei) ≈ 1.

For a specific position i:

If relevance is independent of prior clicks, we expect

Lift = LHS / RHS ≈ 1

All query instances

Statistical Power of the Test

)clicks(S Predicted

)Clicks(S

)clicks(S Predicted

)Clicks(S

Pr(examination) ≈ 1

S0: No clicks above i

S1: Click above i

The data speaks…• Over all configs:

Lift = 2.69 +/- 0.05 (99% conf. Interval)

•Graph shows config with 3 top ads, 8 rhs ads. • T2 = 2nd top ad, R3 = 3rd rhs ad, etc.

Pure Relevance

Max Examination

Joint Relevance Examination (JRE)

Pure RelevanceAny change in Pr(Ci = 1) when conditioned on other

clicks is solely due to change in instance relevance.Number of clicks on other results used as signal of

instance relevance.Does not use position of other clicks, only the count.

Yields identical aggregate relevance estimates as the baseline model (which does not use co-click information).

Pr(Ci = 1 | C≠i , Ei = 1) = rΦ(i) δn(i)

n(i) = number of click in other positions

Max ExaminationLike UBM/BBM, but also use information about clicks

below position i.Pr(examination) ≈ 1 if there is a click below i

UBM/BBM: Pr(Ei = 1 | C1:i-1) = αi βi,p(i)

Max-examination: Pr(Ei = 1 | C ≠i) = αi βi,e(i)

p(i) = position of closest prior click

i belowclick a is thereif1,i

iposition belowclick no ifp(i),e(i)

Joint Relevance Examination (JRE)Combines the features of the pure relevance and max-

examination models.Allows CTR changes to be caused by both changes in

examination and changes in instance relevance.

Pr(Ei = 1 | C≠i) = αi βi,e(i)

Pr(Ci = 1 | C≠i , Ei = 1) = rΦ(i) δn(i)

Pr(Ci = 1 | C≠i) = Pr(Ei = 1 | C≠i) Pr(Ci = 1 | Ei = 1, C≠i)

Predicting CTRModels:

Baseline: Google's production system for predicting relevance of sponsored results. Does not use co-click information.

Compare to UBM/BBM, max examination, pure relevance, and JRE.

Data: 10% sample of a week of data.50-50 split between training and testing.

Absolute Error

Baseline: Google’s production system.

Log likelihood

Squared Error

Predicting Relevance vs. Predicting CTRIf model A is more accurate than model B at predicting

CTR, wouldn’t A also be better at predicting (aggregate) relevance?

Counter-ExampleCTR:

pure-relevance >>

max-examination >>

baseline

Relevance: Eitherpure-relevance == baseline

max-examination

ORmax-examination

pure-relevance == baseline

IntuitionPredicting CTR:

Get the product, Pr(examination) x Relevance, right.Predicting Relevance:

Need to correctly assign credit between examination and relevance.

Incorrectly assigning credit can improve CTR prediction, while making relevance estimates less accurate.

Predicting relevance.Run an experiment on live traffic.

Sponsored results are ranked by bid x relevance.More accurate relevance estimates should result in higher

CTR and revenue. Will place results with higher relevance in positions with higher

Pr(examination).

Baseline/pure-relevance had better revenue and CTR than max-examination.Results were statistically significant results.

ConclusionsChanges in CTR when conditioned on other clicks are

also due to instance relevance, not just examination.New user browsing models that incorporate this insight

are more accurate.Evaluating user browsing models solely using offline

analysis of CTR prediction can be problematic.Use human ratings or live experiments.

Future WorkWhat about organic search results?Quantitatively assigning credit between instance

relevance and examination.Features are correlated.

Generalize pure-relevance and JRE to incorporate information about the relevance of prior results, or the satisfaction of the user with the prior clicked results.

Scan order

Ramakrishnan Srikant Sugato Basu Ni Wang Daryl Pregibon 1.

Documents

Transcript of Ramakrishnan Srikant Sugato Basu Ni Wang Daryl Pregibon 1.

Carbon Nanotube Enhanced Composite Materialsstatic1.squarespace.com/static/51b61725e4b056f3a70da3c3/t/51dda04... · CARBON NANOTUBE ENHANCED COMPOSITE MATERIALS Weijun Wang, Fred

, Liuwei Gao, Hua Zhang, Daowei Wang, Meng Wang, Jianquan ...Aug 27, 2013 · 1 S. uccinate dehydrogenase. 5 (SDH5) regulates (GSK)-3β-β-catenin-mediated lung cancer metastasis

Image-Based Aspect Ratio Selectiongraphics.uni-konstanz.de/publikationen/Wang2018ImageBasedAspec… · Image-Based Aspect Ratio Selection Yunhai Wang, Zeyu Wang, Chi-Wing Fu, Hansjorg

Supplementary Information Radical α-Addition Involved … · 2020. 4. 1. · Zhao, Zhang and Wang, Supporting Information S1 Supplementary Information Radical α-Addition Involved

Glycopeptides Derived from Glucosaminic Acid · 1 Glycopeptides Derived from Glucosaminic Acid Ester Abtew+, Abraham J Domb*, and Arijit Basu* Institute of Drug Research, School of

Spectroscopic Properties of Colloidal Indium Phosphide Quantum Wires …/67531/metadc902346/... · Spectroscopic Properties of Colloidal Indium Phosphide Quantum Wires . Fudong Wang,§‡

Table of Contents · Effect of Deposition Pressure on the Properties of Silicon Thin Films J.W. Chen, L. Zhao, H.W. Diao, S. Zhou, G. Wang and W.J. Wang 70 Effect of Nano Fibre Arrays

Research Paper Overexpression of Platelet-Derived Growth ...jcancer.org/v11p4614.pdfCorresponding authors: Hai-Chen Wang, MD, Associate Professor, Department of Cardiovascular Surgery,

S. Huang, W. Wang, M. Brambley, S. Goyal, W. Zuo 2018. An ...

Electrostatics, statistical mechanics, and dynamics of … & Lifshitz, Statistical Physics, 1958] Radial distribution function [Chirikjian & Wang, PRE 62 (2000) 880-892] Distribution

Maximum-likelihood estimation of admixture proportions from genetic data Jinliang Wang.

Landau-Lifshitz-Maxwell equation in dimension threewang2482/Ding-Liu-Wang-LLM.pdf · Landau-Lifshitz-Maxwell equation in dimension three Shijin Ding Xiangao Liu y Changyou Wang z

D. Ding, J.-B. Wang, S. R. Johnson, S.-Q. Yu, Y.-H. Zhang

Contentsmahan/hypperc.pdf · 2019. 9. 11. · 2 RIDDHIPRATIM BASU AND MAHAN MJ investigated thoroughly, the literature on other background geometries is sparse. In the special case

ON PROJECTIONS OF SEMI-ALGEBRAIC SETS …sbasu/proj_quad.pdfON PROJECTIONS OF SEMI-ALGEBRAIC SETS DEFINED BY FEW QUADRATIC INEQUALITIES SAUGATA BASU AND THIERRY ZELL Abstract. Let

Xudong Wang - CMU · Microsoft PowerPoint - 13-us-korea forum.pptx Author: Dante Created Date: 10/17/2013 11:12:10 AM ...

Magic clusters PtMg2,3H5‐ facilitated by local σ‐aromaticity PDF/299.pdf · COMMUNICATION 1 Magic clusters PtMg 2,3 H 5-facilitated by local σ-aromaticity Wei Wang, Jie Wang,

Tao Wang, Huili Liang,* Zuyin Han, Yanxin Sui, and Zengxia ...

Peiling Wang, PhD Associate Professor peilingw@utk.edu Institutional Repositories and Open Access Βιβλιοθήκη Αλεξάνδρειου Τεχνολογικού Εκπαιδευτικού

Entanglement cost of quantum state preparation and channel ... · Entanglement cost of quantum state preparation and channel simulation Xin Wang QuICS,UniversityofMaryland JointworkwithMark

Glycopeptides Derived from Glucosaminic Acid · 1 Glycopeptides Derived from Glucosaminic Acid Ester Abtew+, Abraham J Domb, and Arijit Basu Institute of Drug Research, School of