闲情居|机器学习领域顶会ICML20精选论文分享(13)


Breaking the Curse of Many Agents: Provable Mean Embedding $Q$-Iteration for Mean-Field Reinforcement Learning Lingxiao Wang, Zhuoran Yang, Zhaoran Wang
Learning with Bounded Instance- and Label-dependent Label Noise Jiacheng Cheng, Tongliang Liu, Kotagiri Ramamohanarao, Dacheng Tao
Transparency Promotion with Model-Agnostic Linear Competitors Hassan Rafique, Tong Wang, Qihang Lin, Arshia Singhani
Learning Mixtures of Graphs from Epidemic Cascades Jessica Hoffmann, Soumya Basu, Surbhi Goel, Constantine Caramanis
Implicit differentiation of Lasso-type models for hyperparameter optimization Quentin Bertrand, Quentin Klopfenstein, Mathieu Blondel, Samuel Vaiter, Alexandre Gramfort, Joseph Salmon
Latent Space Factorisation and Manipulation via Matrix Subspace Projection Xiao Li, Chenghua Lin, Ruizhe Li, Chaozheng Wang, Frank Guerin
Active World Model Learning in Agent-rich Environments with Progress Curiosity Kuno Kim, Megumi Sano, Julian De Freitas, Nick Haber, Daniel Yamins
SDE-Net: Equipping Deep Neural Networks with Uncertainty Estimates Lingkai Kong, Jimeng Sun, Chao Zhang
GANs May Have No Nash Equilibria Farzan Farnia, Asuman Ozdaglar
Gradient Temporal-Difference Learning with Regularized Corrections Sina Ghiassian, Andrew Patterson, Shivam Garg, Dhawal Gutpa, Adam White, Martha White
Online mirror descent and dual averaging: keeping pace in the dynamic case Huang Fang, Victor Sanches Portella, Nick Harvey, Michael Friedlander
Choice Set Optimization Under Discrete Choice Models of Group Decisions Kiran Tomlinson, Austin Benson
Complexity of Finding Stationary Points of Nonconvex Nonsmooth Functions Jingzhao Zhang, Hongzhou Lin, Stefanie Jegelka, Suvrit Sra, Ali Jadbabaie
Multi-Agent Routing Value Iteration Network Quinlan Sykora, Mengye Ren, Raquel Urtasun
Adversarial Attacks on Copyright Detection Systems Parsa Saadatpanah, Ali Shafahi, Tom Goldstein
Differentiating through the Fréchet Mean Aaron Lou, Isay Katsman, Qingxuan Jiang, Serge Belongie, Ser Nam Lim, Christopher De Sa
Online Learning for Active Cache Synchronization Andrey Kolobov, Sebastien Bubeck, Julian Zimmert
PoKED: A Semi-Supervised System for Word Sense Disambiguation Feng Wei
A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation Pan Xu, Quanquan Gu
Understanding and Stabilizing GANs' Training Dynamics Using Control Theory Kun Xu, Chongxuan Li, Jun Zhu, Bo Zhang
Scalable Nearest Neighbor Search for Optimal Transport Arturs Backurs, Yihe Dong, Piotr Indyk, Ilya Razenshteyn, Tal Wagner
Supervised learning: no loss no cry Richard Nock, Aditya Menon
Label-Noise Robust Domain Adaptation Xiyu Yu, Tongliang Liu, Mingming Gong, Kun Zhang, Kayhan Batmanghelich, Dacheng Tao
Description Based Text Classification with Reinforcement Learning Wei Wu, Duo Chai, Qinghong Han, Fei Wu, Jiwei Li
Bandits for BMO Functions Tianyu Wang, Cynthia Rudin
Cost-effectively Identifying Causal Effect When Only Response Variable Observable Tian-Zuo Wang, Xi-Zhu Wu, Sheng-Jun Huang, Zhi-Hua Zhou


推荐阅读