Documentos de Académico
Documentos de Profesional
Documentos de Cultura
BEHAVIOR:
INFERRING CAUSALITY USING QUASI-
EXPERIMENTAL DESIGNS
Ramesh K. Sitaraman
UMass, Amherst & Akamai
ramesh@cs.umass.edu
http://www.cs.umass.edu/~ramesh
CDN
Improved
$ Performance
Content
Users
Provider
“Improved” User
Behavior
Ramesh Sitaraman: ramesh@cs.umass.edu 7
CDN
Improved
Performance
$
CDN
1. Availability: Viewers
$ Improved download video without
Performance failure.
2. Startup Delay: Video
Content starts without much
Users
Provider delay.
3. Rebuffers: Video plays
“Improved” User without freezes.
Behavior
Media & Entertainment
(Ad-supported or Subscription)
1. Abandonment: Reduce viewers who abandon without viewing
the video.
2. Engagement: Viewers watch videos longer.
3. Repeat Viewership: Viewers keep coming back to site to
watch more videos.
Ramesh Sitaraman: ramesh@cs.umass.edu 9
2. Engagement:
2. Startup Delay: Viewers watch
Video starts videos longer.
without much
delay.
3. Repeat Viewership:
Viewers keep
3. Rebuffers: coming back to site
Video plays to watch more
without freezes. videos.
Ramesh Sitaraman: ramesh@cs.umass.edu 10
The Data
Ramesh Sitaraman: ramesh@cs.umass.edu 11
Akamai’s
CDN
Video
Beacon
Globally-deployed Akamai plugin that runs inside the media
player and reports viewer actions and performance metrics via
``beacons’’ from millions of actual end-users around the world.
Ramesh Sitaraman: ramesh@cs.umass.edu 12
The Techniques
Ramesh Sitaraman: ramesh@cs.umass.edu 14
Correlation
Correlation
Threat to causality: Users who are better off can afford better
network connectivity, resulting in less rebuffering. They can also
afford access to more interesting content.
Ramesh Sitaraman: ramesh@cs.umass.edu 16
Correlation ≠ Causality
Correlation: A and B “move together”.
versus
Randomized Experiments
Idea: Equalize the impact of confounding variables using
randomness. (R.A. Fisher 1937)
Hypothesis:
Performance Outcome +1: supports hypothesis
èBehavior -1: rejects hypothesis
0: Neither
Ramesh Sitaraman: ramesh@cs.umass.edu 19
Viewer Engagement
Ramesh Sitaraman: ramesh@cs.umass.edu 20
Hypothesis:
More Rebuffers Outcome For each pair, outcome =
èSmaller Play time playtime(untreated) –
playtime(treated)
Ramesh Sitaraman: ramesh@cs.umass.edu 22
Results of Quasi-Experiment
Normalized Rebuffer Delay Net Outcome
(γ%)
1 5.0%
2 5.5%
3 5.7%
4 6.7%
5 6.3%
6 7.4%
7 7.5%
Viewer Abandonment
Ramesh Sitaraman: ramesh@cs.umass.edu 24
Short: < 30 mins (e,g, news clip). Median Duration: 1.8 mins
Long: ≥ 30 mins(e.g, movie). Median Duration: 43.2 mins
Ramesh Sitaraman: ramesh@cs.umass.edu 27
Repeat Viewership
Ramesh Sitaraman: ramesh@cs.umass.edu 30
Hypothesis:
Failed visit è Outcome For each pair, outcome =
Viewer returning to +1, if treated returns but
site not untreated
-1, if untreated returns but
not treated
0, otherwise
Ramesh Sitaraman: ramesh@cs.umass.edu 32
Our Contributions
Questions?