闲情居|机器学习领域顶会ICML20精选论文分享( 八 )


Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control Jie Xu, Yunsheng Tian, Pingchuan Ma, Daniela Rus, Shinjiro Sueda, Wojciech Matusik
Goodness-of-Fit Tests for Inhomogeneous Random Graphs Soham Dan, Bhaswar B. Bhattacharya
Few-shot Domain Adaptation by Causal Mechanism Transfer Takeshi Teshima, Issei Sato, Masashi Sugiyama
Adaptive Adversarial Multi-task Representation Learning YUREN MAO, Weiwei Liu, Xuemin Lin
Streaming Submodular Maximization under a k-Set System Constraint Ran Haba, Ehsan Kazemi, Moran Feldman, Amin Karbasi
A Generic First-Order Algorithmic Framework for Bi-Level Programming Beyond Lower-Level Singleton Risheng Liu, Pan Mu, Xiaoming Yuan, Shangzhi Zeng, Jin Zhang
Optimal approximation for unconstrained non-submodular minimization Marwa El Halabi, Stefanie Jegelka
Generating Programmatic Referring Expressions via Program Synthesis Jiani Huang, Calvin Smith, Osbert Bastani, Rishabh Singh, Aws Albarghouthi, Mayur Naik
Nearly Linear Row Sampling Algorithm for Quantile Regression Yi Li, Ruosong Wang, Lin Yang, Hanrui Zhang
On Leveraging Pretrained GANs for Generation with Limited Data Miaoyun Zhao, Yulai Cong, Lawrence Carin
More Data Can Expand The Generalization Gap Between Adversarially Robust and Standard Models Lin Chen, Yifei Min, Mingrui Zhang, Amin Karbasi
Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation Nathan Kallus, Masatoshi Uehara
Statistically Efficient Off-Policy Policy Gradients Nathan Kallus, Masatoshi Uehara
Self-PU: Self Boosted and Calibrated Positive-Unlabeled Training Xuxi Chen, Wuyang Chen, Tianlong Chen, Ye Yuan, Chen Gong, Kewei Chen, Zhangyang Wang
When Does Self-Supervision Help Graph Convolutional Networks? Yuning You, Tianlong Chen, Zhangyang Wang, Yang Shen
On Differentially Private Stochastic Convex Optimization with Heavy-tailed Data Di Wang, Hanshen Xiao, Srinivas Devadas, Jinhui Xu
Variance Reduced Coordinate Descent with Acceleration: New Method With a Surprising Application to Finite-Sum Problems Filip Hanzely, Dmitry Kovalev, Peter Richtarik
Stochastic Subspace Cubic Newton Method Filip Hanzely, Nikita Doikov, Yurii Nesterov, Peter Richtarik
Ready Policy One: World Building Through Active Learning Philip Ball, Jack Parker-Holder, Aldo Pacchiano, Krzysztof Choromanski, Stephen Roberts
Structural Language Models of Code Uri Alon, Roy Sadaka, Omer Levy, Eran Yahav
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization Jingqing Zhang, Yao Zhao, Mohammad Saleh, Peter Liu
Aggregation of Multiple Knockoffs Tuan-Binh Nguyen, Jerome-Alexis Chevalier, Thirion Bertrand, Sylvain Arlot
Off-Policy Actor-Critic with Shared Experience Replay Simon Schmitt, Matteo Hessel, Karen Simonyan
Graph-based Nearest Neighbor Search: From Practice to Theory Liudmila Prokhorenkova, Aleksandr Shekhovtsov
Policy Teaching via Environment Poisoning: Training-time Adversarial Attacks against Reinforcement Learning Amin Rakhsha, Goran Radanovic, Rati Devidze, Jerry Zhu, Adish Singla


推荐阅读