Yuta Saito
Yuta Saito
Home
Publications
Contact
CV
日本語
Light
Dark
Automatic
English
日本語
1
Long-term Off-Policy Evaluation and Learning
Short- and long-term outcomes of an algorithm often differ, with damaging downstream effects. A known example is a click-bait …
Yuta Saito
,
Himan Abdollahpouri
,
Jesse Anderton
,
Ben Carterette
,
Mounia Lalmas
Cite
Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
We study off-policy evaluation (OPE) in slate contextual bandits where a policy selects multi-dimensional actions known as slates. This …
Haruka Kiyohara
,
Masahiro Nomura
,
Yuta Saito
Cite
Scalable and Provably Fair Exposure Control for Large-Scale Recommender Systems
Typical recommendation and ranking methods aim to optimize the satisfaction of users, but they are often oblivious to their impact on …
Riku Togashi
,
Kenshi Abe
,
Yuta Saito
Cite
Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Off-Policy Evaluation (OPE) aims to assess the effectiveness of counterfactual policies using only offline logged data and is often …
Haruka Kiyohara
,
Ren Kishimoto
,
Kosuke Kawakami
,
Ken Kobayashi
,
Kazuhide Nakata
,
Yuta Saito
Cite
Code
arXiv
Off-Policy Evaluation of Ranking Policies under Diverse User Behavior
Ranking interfaces are everywhere in online platforms. There is thus an ever growing interest in their Off-Policy Evaluation (OPE), …
Haruka Kiyohara
,
Tatsuya Matsuhiro
,
Yusuke Narita
,
Nobuyuki Shimizu
,
Yasuo Yamamoto
,
Yuta Saito
Cite
Code
Poster
Slides
arXiv
Proceedings
Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling
We study off-policy evaluation (OPE) of contextual bandit policies for large discrete action spaces where conventional …
Yuta Saito
,
Qingyang Ren
,
Thorsten Joachims
Cite
Code
Poster
arXiv
Proceedings
Policy-Adaptive Estimator Selection for Off-Policy Evaluation
Off-policy evaluation (OPE) aims to accurately evaluate the performance of counterfactual policies using only offline logged data. …
Takuma Udagawa
,
Haruka Kiyohara
,
Yusuke Narita
,
Yuta Saito
,
Kei Tateno
Cite
Code
Slides
arXiv
Fair Ranking as Fair Division: Impact-Based Individual Fairness in Ranking
Rankings have become the primary interface of many two-sided markets. Many have noted that the rankings not only affect the …
Yuta Saito
,
Thorsten Joachims
Cite
Code
Video
Slides
arXiv
Proceedings
Off-Policy Evaluation for Large Action Spaces via Embeddings
Off-policy evaluation (OPE) in contextual bandits has seen rapid adoption in real-world systems, since it enables offline evaluation of …
Yuta Saito
,
Thorsten Joachims
Cite
Code
Video
Slides
arXiv
Proceedings
Towards Resolving Propensity Contradiction in Offline Recommender Learning
We study offline recommender learning from explicit rating feedback in the presence of selection bias. A current promising solution for …
Yuta Saito
,
Masahiro Nomura
Cite
Proceedings
»
Cite
×