Ruoyu Sun

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat When GNNs meet symmetry in ILPs: an orbit-based feature augmentation approach 2025 Qian Chen
Lei Li
Qian Li
Jianghua Wu
Akang Wang
Ruoyu Sun
Xiaodong Luo
Tsung‐Hui Chang
Qingjiang Shi
+ PDF Chat An Efficient Unsupervised Framework for Convex Quadratic Programs via Deep Unrolling 2024 Linxin Yang
Bingheng Li
Tian Ding
Jianghua Wu
Akang Wang
Yuyi Wang
Jiliang Tang
Ruoyu Sun
Xiaodong Luo
+ PDF Chat On Representing Convex Quadratically Constrained Quadratic Programs via Graph Neural Networks 2024 Chenyang Wu
Qian Chen
Akang Wang
Tian Ding
Ruoyu Sun
Wenguo Yang
Qingjiang Shi
+ PDF Chat Provable Adaptivity of Adam under Non-uniform Smoothness 2024 B. Wang
Y. Q. Zhang
Huishuai Zhang
Qi Meng
Ruoyu Sun
Zhi-Ming Ma
Tie‐Yan Liu
Zhi‐Quan Luo
Wei Chen
+ PDF Chat On the Convergence of Adam under Non-uniform Smoothness: Separability from SGDM and Beyond 2024 Bohan Wang
Huishuai Zhang
Meng Qi
Ruoyu Sun
Zhi-Ming Ma
Wei Chen
+ PDF Chat Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization. 2024 Jiancong Xiao
Ruoyu Sun
Qi Long
Weijie Su
+ PDF Chat AceGPT, Localizing Large Language Models in Arabic 2024 Huang Huang
Fei Yu
Jianqing Zhu
Xuening Sun
Hao Cheng
Song Dingjie
Zhihong Chen
Mosen Alharthi
Bang An
Juncai He
+ Restricted Generative Projection for One-Class Classification and Anomaly Detection 2023 Feng Xiao
Ruoyu Sun
Jicong Fan
+ AceGPT, Localizing Large Language Models in Arabic 2023 Huang Huang
F. Richard Yu
Jianqing Zhu
Xuening Sun
Hao Cheng
Dingjie Song
Zhi Hong Chen
Abdulmohsen Alharthi
Bang An
Ziche Liu
+ PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization 2023 Jiancong Xiao
Ruoyu Sun
Zhi‐Quan Luo
+ PDF Chat Revisiting Landscape Analysis in Deep Neural Networks: Eliminating Decreasing Paths to Infinity 2022 Shiyu Liang
Ruoyu Sun
R. Srikant
+ On the Benefit of Width for Neural Networks: Disappearance of Basins 2022 Dawei Li
Tian Ding
Ruoyu Sun
+ PDF Chat Global Convergence of MAML and Theory-Inspired Neural Architecture Search for Few-Shot Learning 2022 Haoxiang Wang
Yite Wang
Ruoyu Sun
Bo Li
+ PDF Chat On the landscape of one-hidden-layer sparse networks and beyond 2022 Dachao Lin
Ruoyu Sun
Zhihua Zhang
+ Momentum Doesn't Change the Implicit Bias. 2021 Bohan Wang
Qi Meng
Huishuai Zhang
Ruoyu Sun
Wei Chen
Zhi-Ming Ma
+ On a Faster $R$-Linear Convergence Rate of the Barzilai-Borwein Method 2021 Dawei Li
Ruoyu Sun
+ Achieving Small Test Error in Mildly Overparameterized Neural Networks 2021 Shiyu Liang
Ruoyu Sun
R. Srikant
+ Federated Semi-Supervised Learning with Class Distribution Mismatch 2021 Zhiguo Wang
Xintong Wang
Ruoyu Sun
Tsung‐Hui Chang
+ Two Symmetrized Coordinate Descent Methods Can Be $O(n^2)$ Times Slower Than the Randomized Version 2021 Peijun Xiao
Zhisheng Xiao
Ruoyu Sun
+ Towards Understanding the Impact of Model Size on Differential Private Classification 2021 Yinchen Shen
Zhiguo Wang
Ruoyu Sun
Xiaojing Shen
+ Towards a Better Global Loss Landscape of GANs 2020 Ruoyu Sun
Tiantian Fang
Alex Schwing
+ PDF Chat A Single-Loop Smoothed Gradient Descent-Ascent Algorithm for Nonconvex-Concave Min-Max Problems 2020 Jiawei Zhang
Peijun Xiao
Ruoyu Sun
Zhi Luo
+ PDF Chat The Global Landscape of Neural Networks: An Overview 2020 Ruoyu Sun
Dawei Li
Shiyu Liang
Tian Ding
R. Srikant
+ Global Convergence and Induced Kernels of Gradient-Based Meta-Learning with Neural Nets. 2020 Haoxiang Wang
Ruoyu Sun
Bo Li
+ DEED: A General Quantization Scheme for Communication Efficiency in Bits 2020 Tian Ye
Peijun Xiao
Ruoyu Sun
+ The Global Landscape of Neural Networks: An Overview 2020 Ruoyu Sun
Dawei Li
Shiyu Liang
Tian Ding
R. Srikant
+ A Single-Loop Smoothed Gradient Descent-Ascent Algorithm for Nonconvex-Concave Min-Max Problems 2020 Jiawei Zhang
Peijun Xiao
Ruoyu Sun
Zhi‐Quan Luo
+ Global Convergence and Generalization Bound of Gradient-Based Meta-Learning with Deep Neural Nets 2020 Haoxiang Wang
Ruoyu Sun
Bo Li
+ PDF Chat On the Efficiency of Random Permutation for ADMM and Coordinate Descent 2019 Ruoyu Sun
Zhi-Quan Luo
Yinyu Ye
+ Sub-Optimal Local Minima Exist for Almost All Over-parameterized Neural Networks. 2019 Tian Ding
Dawei Li
Ruoyu Sun
+ PDF Chat Worst-case complexity of cyclic coordinate descent: $$O(n^2)$$ gap with randomized version 2019 Ruoyu Sun
Yinyu Ye
+ PDF Chat Max-Sliced Wasserstein Distance and Its Use for GANs 2019 Ishan Deshpande
Yuan-Ting Hu
Ruoyu Sun
Ayis Pyrros
Nasir Siddiqui
Sanmi Koyejo
Zhizhen Zhao
David Forsyth
Alexander G. Schwing
+ PDF Chat Globally Optimal Joint Uplink Base Station Association and Beamforming 2019 Wei Liu
Ruoyu Sun
Zhi‐Quan Luo
+ Understanding Limitation of Two Symmetrized Orders by Worst-case Complexity 2019 Peijun Xiao
Zhisheng Xiao
Ruoyu Sun
+ Optimization for deep learning: theory and algorithms 2019 Ruoyu Sun
+ Revisiting Landscape Analysis in Deep Neural Networks: Eliminating Decreasing Paths to Infinity 2019 Shiyu Liang
Ruoyu Sun
R. Srikant
+ Sub-Optimal Local Minima Exist for Neural Networks with Almost All Non-Linear Activations 2019 Tian Ding
Dawei Li
Ruoyu Sun
+ Over-Parameterized Deep Neural Networks Have No Strict Local Minima For Any Continuous Activations. 2018 Dawei Li
Tian Ding
Ruoyu Sun
+ Adding one neuron can eliminate all bad local minima 2018 Shiyu Liang
Ruoyu Sun
Jason D. Lee
R. Srikant
+ On the Convergence of A Class of Adam-Type Algorithms for Non-Convex Optimization 2018 Xiangyi Chen
Sijia Liu
Ruoyu Sun
Mingyi Hong
+ Understanding the Loss Surface of Neural Networks for Binary Classification 2018 Shiyu Liang
Ruoyu Sun
Yixuan Li
R. Srikant
+ Adding One Neuron Can Eliminate All Bad Local Minima 2018 Shiyu Liang
Ruoyu Sun
Jason D. Lee
R. Srikant
+ On the Convergence of A Class of Adam-Type Algorithms for Non-Convex Optimization 2018 Xiangyi Chen
Sijia Liu
Ruoyu Sun
Mingyi Hong
+ On the Benefit of Width for Neural Networks: Disappearance of Bad Basins 2018 Dawei Li
Tian Ding
Ruoyu Sun
+ Training Language Models Using Target-Propagation 2017 Sam Wiseman
Sumit Chopra
Marc’Aurelio Ranzato
Arthur Szlam
Ruoyu Sun
Soumith Chintala
Nicolas Vasilache
+ Guaranteed Matrix Completion via Non-Convex Factorization 2016 Ruoyu Sun
Zhi‐Quan Luo
+ Worst-case Complexity of Cyclic Coordinate Descent: $O(n^2)$ Gap with Randomized Version 2016 Ruoyu Sun
Yinyu Ye
+ PDF Chat Guaranteed Matrix Completion via Nonconvex Factorization 2015 Ruoyu Sun
Zhi‐Quan Luo
+ Matrix Completion via Nonconvex Factorization: Algorithms and Theory 2015 Ruoyu Sun
+ PDF Chat Joint Downlink Base Station Association and Power Control for Max-Min Fairness: Computation and Complexity 2015 Ruoyu Sun
Mingyi Hong
Zhi‐Quan Luo
+ Globally Optimal Joint Uplink Base Station Association and Beamforming 2015 Wei Liu
Ruoyu Sun
Zhi‐Quan Luo
Jiandong Li
+ On the Efficiency of Random Permutation for ADMM and Coordinate Descent 2015 Ruoyu Sun
Zhi‐Quan Luo
Yinyu Ye
+ Improved Iteration Complexity Bounds of Cyclic Block Coordinate Descent for Convex Problems 2015 Ruoyu Sun
Mingyi Hong
+ PDF Chat Interference Alignment Using Finite and Dependent Channel Extensions: The Single Beam Case 2014 Ruoyu Sun
Zhi‐Quan Luo
+ Cross Layer Provision of Future Cellular Networks 2014 Hadi Baligh
Moonki Hong
Wei Liao
Z.-Q. Luo
Meisam Razaviyayn
Maziar Sanjabi
Ruoyu Sun
+ PDF Chat Joint Base Station Clustering and Beamformer Design for Partial Coordinated Transmission in Heterogeneous Networks 2013 Mingyi Hong
Ruoyu Sun
Hadi Baligh
Zhi‐Quan Luo
+ Joint Base Station Clustering and Beamformer Design for Partial Coordinated Transmission in Heterogenous Networks 2012 Mingyi Hong
Ruoyu Sun
Hadi Baligh
Zhi‐Quan Luo
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ Neural Tangent Kernel: Convergence and Generalization in Neural Networks 2018 Arthur Paul Jacot
Franck Gabriel
Clément Hongler
12
+ Deep Learning without Poor Local Minima 2016 Kenji Kawaguchi
12
+ PDF Chat Efficiency of Coordinate Descent Methods on Huge-Scale Optimization Problems 2012 Yu. Nesterov
12
+ On the loss landscape of a class of deep neural networks with no bad local valleys 2018 Quynh L. Nguyen
Mahesh Chandra Mukkamala
Matthias Hein
11
+ A Convergence Theory for Deep Learning via Over-Parameterization 2018 Zeyuan Allen-Zhu
Yuanzhi Li
Zhao Song
11
+ Convergence of a Block Coordinate Descent Method for Nondifferentiable Minimization 2001 P. Tseng
11
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
10
+ On the Benefit of Width for Neural Networks: Disappearance of Bad Basins 2018 Dawei Li
Tian Ding
Ruoyu Sun
9
+ The Loss Surfaces of Multilayer Networks 2015 Anna Choromanska
Mikael Henaff
Michaël Mathieu
Gérard Ben Arous
Yann LeCun
9
+ PDF Chat A Unified Convergence Analysis of Block Successive Minimization Methods for Nonsmooth Optimization 2013 Meisam Razaviyayn
Mingyi Hong
Zhi-Quan Luo
8
+ PDF Chat Joint Base Station Clustering and Beamformer Design for Partial Coordinated Transmission in Heterogeneous Networks 2013 Mingyi Hong
Ruoyu Sun
Hadi Baligh
Zhi‐Quan Luo
8
+ Beating the Perils of Non-Convexity: Guaranteed Training of Neural Networks using Tensor Methods 2015 Majid Janzamin
Hanie Sedghi
Anima Anandkumar
8
+ Topology and Geometry of Half-Rectified Network Optimization 2016 C. Daniel Freeman
Joan Bruna
8
+ Understanding the Loss Surface of Neural Networks for Binary Classification 2018 Shiyu Liang
Ruoyu Sun
Yixuan Li
R. Srikant
8
+ Exponentially vanishing sub-optimal local minima in multilayer neural networks 2017 Daniel Soudry
Elad Hoffer
7
+ Theoretical Insights Into the Optimization Landscape of Over-Parameterized Shallow Neural Networks 2018 Mahdi Soltanolkotabi
Adel Javanmard
Jason D. Lee
7
+ Learning ReLU Networks on Linearly Separable Data: Algorithm, Optimality, and Generalization 2019 Gang Wang
Georgios B. Giannakis
Jie Chen
7
+ PDF Chat Parallel stochastic gradient algorithms for large-scale matrix completion 2013 Benjamin Recht
Christopher Ré
7
+ Globally Optimal Gradient Descent for a ConvNet with Gaussian Inputs 2017 Alon Brutzkus
Amir Globerson
7
+ Elimination of All Bad Local Minima in Deep Learning 2019 Kenji Kawaguchi
Leslie Pack Kaelbling
7
+ PDF Chat Randomized Methods for Linear Constraints: Convergence Rates and Conditioning 2010 D. Leventhal
Adrian S. Lewis
7
+ Revisiting Landscape Analysis in Deep Neural Networks: Eliminating Decreasing Paths to Infinity 2019 Shiyu Liang
Ruoyu Sun
R. Srikant
6
+ PDF Chat Densely Connected Convolutional Networks 2017 Gao Huang
Zhuang Liu
Laurens van der Maaten
Kilian Q. Weinberger
6
+ PDF Chat Coordinate descent algorithms 2015 Stephen J. Wright
6
+ PDF Chat On the convergence of the block nonlinear Gauss–Seidel method under convex constraints 2000 Luigi Grippo
Marco Sciandrone
6
+ Depth Creates No Bad Local Minima 2017 Haihao Lu
Kenji Kawaguchi
6
+ PDF Chat Jamming transition as a paradigm to understand the loss landscape of deep neural networks 2019 Mario Geiger
Stefano Spigler
Stéphane d’Ascoli
Levent Sagun
Marco Baity‐Jesi
Giulio Biroli
Matthieu Wyart
6
+ Stochastic Gradient Descent Optimizes Over-parameterized Deep ReLU Networks 2018 Difan Zou
Yuan Cao
Dongruo Zhou
Quanquan Gu
6
+ Adding one neuron can eliminate all bad local minima 2018 Shiyu Liang
Ruoyu Sun
Jason D. Lee
R. Srikant
5
+ Nonlinear Programming 1995 Dimitri P. Bertsekas
5
+ Provable Methods for Training Neural Networks with Sparse Connectivity 2014 Hanie Sedghi
Anima Anandkumar
5
+ PDF Chat Iteration complexity of randomized block-coordinate descent methods for minimizing a composite function 2012 Peter Richtárik
Martin Takáč
5
+ On Exact Computation with an Infinitely Wide Neural Net 2019 Sanjeev Arora
Simon S. Du
Wei Hu
Zhiyuan Li
Ruslan Salakhutdinov
Ruosong Wang
5
+ On the Connection Between Learning Two-Layers Neural Networks and Tensor Decomposition 2018 Marco Mondelli
Andrea Montanari
5
+ Critical Points of Linear Neural Networks: Analytical Forms and Landscape Properties 2018 Yi Zhou
Yingbin Liang
5
+ PDF Chat On the linear convergence of the alternating direction method of multipliers 2016 Mingyi Hong
Zhi‐Quan Luo
5
+ On the Convergence of Block Coordinate Descent Type Methods 2013 Amir Beck
Luba Tetruashvili
5
+ Spurious Valleys in Two-layer Neural Network Optimization Landscapes 2018 Luca Venturi
Afonso S. Bandeira
Joan Bruna
5
+ Essentially No Barriers in Neural Network Energy Landscape 2018 Felix Draxler
Kambis Veschgini
Manfred Salmhofer
Fred A. Hamprecht
5
+ PDF Chat Parallel Multi-Block ADMM with o(1 / k) Convergence 2016 Wei Deng
Ming‐Jun Lai
Zhimin Peng
Wotao Yin
4
+ Mean Field Analysis of Neural Networks 2018 Justin Sirignano
Konstantinos Spiliopoulos
4
+ Neural networks as Interacting Particle Systems: Asymptotic convexity of the Loss Landscape and Universal Scaling of the Approximation Error 2018 Grant M. Rotskoff
Eric Vanden‐Eijnden
4
+ PDF Chat Efficient random coordinate descent algorithms for large-scale structured nonconvex optimization 2014 Andrei Pătraşcu
Ion Necoara
4
+ On Full Jacobian Decomposition of the Augmented Lagrangian Method for Separable Convex Programming 2015 Bingsheng He
Liusheng Hou
Xiaoming Yuan
4
+ An Accelerated Randomized Proximal Coordinate Gradient Method and its Application to Regularized Empirical Risk Minimization 2015 Qihang Lin
Zhaosong Lu
Lin Xiao
4
+ The Zero Set of a Real Analytic Function 2015 Boris Mityagin
4
+ Worst-case Complexity of Cyclic Coordinate Descent: $O(n^2)$ Gap with Randomized Version 2016 Ruoyu Sun
Yinyu Ye
4
+ A Mean Field View of the Landscape of Two-Layers Neural Networks 2018 Mei Song
Andrea Montanari
Phan-Minh Nguyen
4
+ Porcupine Neural Networks: (Almost) All Local Optima are Global. 2017 Soheil Feizi
Hamid Javadi
Jesse M. Zhang
David Tse
4
+ On the Convergence of Adam and Beyond 2019 Sashank J. Reddi
Satyen Kale
Sanjiv Kumar
4