Research Papers

Journal Articles:

Regimes of No Gain in Multi-class Active Learning. Gan Yuan, Yunfan Zhao, and Samory Kpotufe. Journal of Machine Learning Research, 2024.

Perfect domination ratios of Archimedean lattices. Yunfan Zhao, John C. Wierman, and Thomas G. Marge. Electronic Journal of Combinatorics, 2022.

Estimate-Then-Optimize Versus Integrated-Estimation-Optimization: A Stochastic Dominance Perspective. Adam N. Elmachtoub, Henry Lam, Haofeng Zhang, and Yunfan Zhao (alphabetical order). Under Revision, Operations Research.

Conference Papers:

The Bandit Whisperer: Communication Learning for Restless Bandits. Yunfan Zhao, Tonghan Wang, Dheeraj Nagaraj, Aparna Taneja, Milind Tambe. Under Review.

Optimizing Vital Sign Monitoring in Resource-Constrained Maternal Care: An RL-Based Restless Bandit Approach. Niclas Boehmer, Yunfan Zhao, Guojun Xiong, Paula Rodriguez-Diaz, Paola Del Cueto Cibrian, Joseph Ngonzi, Adeline Boatin, Milind Tambe. IAAI 2025.

A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health. Nikhil Behari, Edwin Zhang, Yunfan Zhao, Dheeraj Nagaraj, Aparna Taneja, and Milind Tambe. NeurIPS 2024.

Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization . Yunfan Zhao, Nikhil Behari, Edward Hughes, Edwin Zhang, Dheeraj Nagaraj, Karl Tuyls, Aparna Taneja, and Milind Tambe. IJCAI 2024.

Group Fairness in Predict-Then-Optimize Settings for Restless Bandits. Shresth Verma, Yunfan Zhao, Sanket Shah, Niclas Boehmer, Aparna Taneja, and Milind Tambe. UAI 2024 (oral presentation).

Scalable Neural Network Kernels Arijit Sehanobish, Krzysztof Choromanski, Yunfan Zhao, Avinava Dubey, and Valerii Likhosherstov. ICLR 2024.

Implicit Two-Tower Policies. Yunfan Zhao, Qingkai Pan, Krzysztof Choromanski, Deepali Jain, and Vikas Sindhwani. ICLR 2024 PML4LRS Workshop.

Improving the prediction of individual engagement in recommendations using cognitive models. Roderick Seow, Yunfan Zhao, Duncan Wood, Milind Tambe, Cleotilde Gonzalez. ACM RecSys 2024 Workshop on Health Recommender Systems.

Estimate-Then-Optimize Versus Integrated-Estimation-Optimization: A Stochastic Dominance Perspective. Adam N. Elmachtoub, Henry Lam, Haofeng Zhang, and Yunfan Zhao (alphabetical order). IOS 2024.

Efficient Graph Field Integrators Meet Point Clouds. Krzysztof Choromanski, Arijit Sehanobish, Han Lin, Yunfan Zhao, et al. ICML, 2023.

Balanced Off-Policy Evaluation for Personalized Pricing. Adam N. Elmachtoub, Vishal Gupta, and Yunfan Zhao (alphabetical order). AISTATS, 2023.

Nuances in Margin Conditions Determine Gains in Active Learning. Samory Kpotufe, Gan Yuan, and Yunfan Zhao (alphabetical order). AISTATS, 2022.

Efficient and non-efficient domination of Archimedean lattices. Thomas G. Marge, Yunfan Zhao, and John C. Wierman. Congressus Numerantium, 2018.