Publications

(2024). Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits. In Proceedings of the 18th ACM Conference on Recommender Systems (RecSys).

Cite

(2024). Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It. In Proceedings of the 33rd International Joint Conference on Artificial Intelligence (IJCAI) (Acceptance rate=15%).

Cite Proceedings

(2024). Scalable and Provably Fair Exposure Control for Large-Scale Recommender Systems. In Proceedings of the ACM Web Conference 2024 (TheWebConf) (Acceptance Rate=20.2%).

Cite Video arXiv Proceedings

(2024). Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction. In Proceedings of the ACM Web Conference 2024 (TheWebConf) (Acceptance Rate=20.2%).

Cite Code arXiv Proceedings

(2024). Long-term Off-Policy Evaluation and Learning. In Proceedings of the ACM Web Conference 2024 (TheWebConf) (Acceptance Rate=20.2%).

Cite Code arXiv Proceedings

(2024). Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation. In Proceedings of the Twelfth International Conference on Learning Representations (ICLR) (Acceptance Rate=31%).

Cite Code arXiv

(2023). Off-Policy Evaluation of Ranking Policies under Diverse User Behavior. In Proceedings of the 29th SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) (Acceptance rate=22.3%).

Cite Code Poster Slides arXiv Proceedings

(2023). Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling. In Proceedings of the 40th International Conference on Machine Learning (ICML) (Acceptance rate=27.9%).

Cite Code Poster arXiv Proceedings

(2022). Policy-Adaptive Estimator Selection for Off-Policy Evaluation. In Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI) (Acceptance rate=19.6%).

Cite Code Slides arXiv

(2022). Counterfactual Evaluation and Learning for Interactive Systems. In Proceedings of the 28th SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD).

Cite Code Proceedings Website

(2022). Fair Ranking as Fair Division: Impact-Based Individual Fairness in Ranking. In Proceedings of the 28th SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) (Acceptance rate=14.9%).

Cite Code Video Slides arXiv Proceedings

(2022). Off-Policy Evaluation for Large Action Spaces via Embeddings. In Proceedings of 39th International Conference on Machine Learning (ICML) (Acceptance rate=21.9%).

Cite Code Video Slides arXiv Proceedings

(2022). Towards Resolving Propensity Contradiction in Offline Recommender Learning. In Proceedings of the 31st International Joint Conference on Artificial Intelligence (IJCAI) (Acceptance rate=15%, Long Talk (top 4% of submissions)).

Cite Proceedings

(2022). Unbiased Recommender Learning from Biased Graded Implicit Feedback. WSDM 2022 Workshop on Decision Making for Modern Information Retrieval System.

Cite PDF

(2022). Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model. In Proceedings of the 15th International Conference on Web Search and Data Mining (WSDM) (Acceptance rate=20.2%, Best Paper Runner-Up Award).

Cite Code arXiv Proceedings

(2021). A Real-World Implementation of Unbiased Lift-based Bidding System. In Proceedings of the 2021 IEEE International Conference on Big Data (IEEE Big Data).

Cite

(2021). Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation. In Proceedings of the Neural Information Processing Systems (NeurIPS) Track on Datasets and Benchmarks.

Cite Code Dataset Proceedings arXiv

(2021). Efficient Hyperparameter Optimization under Multi-Source Covariate Shift. In Proceedings of the 30th ACM International Conference on Information and Knowledge Management (CIKM) (Acceptance rate=21.7%).

Cite Code Proceedings arXiv

(2021). Evaluating the Robustness of Off-Policy Evaluation. In Proceedings of the 15th ACM Conference on Recommender Systems (RecSys) (Acceptance rate=18.4%).

Cite Code Slides Proceedings arXiv

(2021). Counterfactual Learning and Evaluation for Recommender Systems. In Proceedings of the 15th ACM Conference on Recommender Systems (RecSys).

Cite Code Video Proceedings Website

(2021). Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation. RecSys 2021 Workshop on Simulation Methods for Recommender Systems.

Cite arXiv

(2021). Optimal Off-Policy Evaluation from Multiple Logging Policies. In Proceedings of 38th International Conference on Machine Learning (ICML) (Acceptance rate=21.5%).

Cite Code Proceedings arXiv

(2020). Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions. In Proceedings of the 14th ACM Conference on Recommender Systems (RecSys) (Acceptance rate=17.9%).

Cite Code Slides Proceedings

(2020). Data-Driven Off-Policy Estimator Selection: An Application in User Marketing on An Online Content Delivery Service. RecSys 2020 Workshop on Bandit and Reinforcement Learning from User Interactions.

Cite arXiv

(2020). Unbiased Pairwise Learning from Biased Implicit Feedback. In Proceedings of the 2020 ACM SIGIR on International Conference on Theory of Information Retrieval (ICTIR) (Acceptance rate=40.5%).

Cite Code Proceedings

(2020). Unbiased lift-based bidding system. AdKDD & TargetAd 2020 Workshop (held in conjunction with KDD2020).

Cite arXiv

(2020). Dual Learning Algorithm for Delayed Conversions. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR) (Acceptance rate=30.0%).

Cite Slides Proceedings

(2020). Counterfactual Cross-Validation: Stable Model Selection Procedure for Causal Inference Models. In Proceedings of 37th International Conference on Machine Learning (ICML) (Acceptance rate=21.8%).

Cite Code Slides Proceedings

(2020). Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR) (Acceptance rate=26.5%).

Cite Code Slides Proceedings

(2020). Cost-Effective and Stable Policy Optimization Algorithm for Uplift Modeling with Multiple Treatments. In Proceedings of the 2020 SIAM International Conference on Data Mining (SDM) (Acceptance rate=24.0%).

Cite Proceedings

(2020). Unbiased Recommender Learning from Missing-Not-At-Random Implicit Feedback. In Proceedings of the 13th International Conference on Web Search and Data Mining (WSDM) (Acceptance rate=14.8%).

Cite Code Slides Proceedings

(2019). Doubly Robust Prediction and Evaluation Methods Improve Uplift Modeling for Observational Data. In Proceedings of the 2019 SIAM International Conference on Data Mining (SDM) (Acceptance rate=22.7%).

Cite Proceedings