闲情居|机器学习领域顶会ICML20精选论文分享( 五 )


Minimax Weight and Q-Function Learning for Off-Policy Evaluation Masatoshi Uehara, Jiawei Huang, Nan Jiang
Tensor denoising and completion based on ordinal observations Chanwoo Lee, Miaoyan Wang
Learning Human Objectives by Evaluating Hypothetical Behavior Siddharth Reddy, Anca Dragan, Sergey Levine, Shane Legg, Jan Leike
Counterfactual Cross-Validation: Stable Model Selection Procedure for Causal Inference Models Yuta Saito, Shota Yasui
Learning Efficient Multi-agent Communication: An Information Bottleneck Approach Rundong Wang, Xu He, Runsheng Yu, Wei Qiu, Bo An, Zinovi Rabinovich
MoNet3D: Towards Accurate Monocular 3D Object Localization in Real Time XICHUAN ZHOU, YiCong Peng, Chunqiao Long, Fengbo Ren, Cong Shi
SIGUA: Forgetting May Make Learning with Noisy Labels More Robust Bo Han, Gang Niu, Xingrui Yu, QUANMING YAO, Miao Xu, Ivor Tsang, Masashi Sugiyama
Multinomial Logit Bandit with Low Switching Cost Kefan Dong, Yingkai Li, Qin Zhang, Yuan Zhou
Deep Reasoning Networks for Unsupervised Pattern De-mixing with Constraint Reasoning Di Chen, Yiwei Bai, Wenting Zhao, Sebastian Ament, John Gregoire, Carla Gomes
Uncertainty-Aware Lookahead Factor Models for Improved Quantitative Investing Lakshay Chauhan, John Alberg, Zachary Lipton
On the Unreasonable Effectiveness of the Greedy Algorithm: Greedy Adapts to Sharpness Sebastian Pokutta, Mohit Singh, Alfredo Torrico
Stronger and Faster Wasserstein Adversarial Attacks Kaiwen Wu, Allen Wang, Yaoliang Yu
Optimizing Multiagent Cooperation via Policy Evolution and Shared Experiences Somdeb Majumdar, Shauharda Khadka, Santiago Miret, Stephen Mcaleer, Kagan Tumer
Why Are Learned Indexes So Effective? Paolo Ferragina, Fabrizio Lillo, Giorgio Vinciguerra
Fast OSCAR and OWL with Safe Screening Rules Runxue Bao, Bin Gu, Heng Huang
Which Tasks Should Be Learned Together in Multi-task Learning? Trevor Standley, Amir Zamir, Dawn Chen, Leonidas Guibas, Jitendra Malik, Silvio Savarese
Inertial Block Proximal Methods for Non-Convex Non-Smooth Optimization Hien Le, Nicolas Gillis, Panagiotis Patrinos
Adversarial Neural Pruning with Latent Vulnerability Suppression Divyam Madaan, Jinwoo Shin, Sung Ju Hwang
Lifted Disjoint Paths with Application in Multiple Object Tracking Andrea Hornakova, Roberto Henschel, Bodo Rosenhahn, Paul Swoboda
Being Bayesian, Even Just a Bit, Fixes Overconfidence in ReLU Networks Agustinus Kristiadi, Matthias Hein, Philipp Hennig
SCAFFOLD: Stochastic Controlled Averaging for Federated Learning Sai Praneeth Reddy Karimireddy, Satyen Kale, Mehryar Mohri, Sashank Jakkam Reddi, Sebastian Stich, Ananda Theertha Suresh
Statistically Preconditioned Accelerated Gradient Method for Distributed Optimization Hadrien Hendrikx, Lin Xiao, Sebastien Bubeck, Francis Bach, Laurent Massoulié
Pretrained Generalized Autoregressive Model with Adaptive Probabilistic Label Cluster for Extreme Multi-label Text Classification Hui Ye, Zhiyu Chen, Da-Han Wang, Brian Davison


推荐阅读