Yanshuai Cao

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks 2024 Yongchang Hao
Yanshuai Cao
Lili Mou
+ PDF Chat Leveraging Environment Interaction for Automated PDDL Generation and Planning with Large Language Models 2024 Sadegh Mahdavi
Raquel Aoki
Keyi Tang
Yanshuai Cao
+ PDF Chat EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation 2024 Yuqiao Wen
Behzad Shayegh
Chenyang Huang
Yanshuai Cao
Lili Mou
+ PDF Chat Ginger: An Efficient Curvature Approximation with Linear Complexity for General Neural Networks 2024 Yongchang Hao
Yanshuai Cao
Lili Mou
+ Ensemble Distillation for Unsupervised Constituency Parsing 2023 Behzad Shayegh
Yanshuai Cao
Xiaodan Zhu
Jackie Chi Kit Cheung
Lili Mou
+ An Equal-Size Hard EM Algorithm for Diverse Dialogue Generation 2022 Yuqiao Wen
Yongchang Hao
Yanshuai Cao
Lili Mou
+ PDF Chat Hierarchical Neural Data Synthesis for Semantic Parsing 2021 Wei Yang
Peng Xu
Yanshuai Cao
+ Semantic Parsing with Less Prior and More Monolingual Data. 2021 Sajad Norouzi
Yanshuai Cao
+ A Globally Normalized Neural Model for Semantic Parsing 2021 Chenyang Huang
Wei Yang
Yanshuai Cao
Osmar R. Zaı̈ane
Lili Mou
+ Code Generation from Natural Language with Less Prior and More Monolingual Data 2021 Sajad Norouzi
Keyi Tang
Yanshuai Cao
+ Turing: an Accurate and Interpretable Multi-Hypothesis Cross-Domain Natural Language Database Interface 2021 Peng Xu
Wenjie Zi
Hamidreza Shahidi
Ákos Kádár
Keyi Tang
Wei Yang
Jawad Ateeq
Harsh Barot
Meidan Alon
Yanshuai Cao
+ TURING: an Accurate and Interpretable Multi-Hypothesis Cross-Domain Natural Language Database Interface 2021 Peng Xu
Wenjie Zi
Hamidreza Shahidi
Ákos Kádár
Keyi Tang
Wei Yang
Jawad Ateeq
Harsh Barot
Meidan Alon
Yanshuai Cao
+ Optimizing Deeper Transformers on Small Datasets 2021 Peng Xu
Dhruv Kumar
Wei Yang
Wenjie Zi
Keyi Tang
Chenyang Huang
Jackie Chi Kit Cheung
Simon J. D. Prince
Yanshuai Cao
+ A Globally Normalized Neural Model for Semantic Parsing 2021 Chenyang Huang
Wei Yang
Yanshuai Cao
Osmar R. Zaı̈ane
Lili Mou
+ Hierarchical Neural Data Synthesis for Semantic Parsing 2021 Wei Yang
Peng Xu
Yanshuai Cao
+ Optimizing Deeper Transformers on Small Datasets: An Application on Text-to-SQL Semantic Parsing. 2020 Peng Xu
Wei Yang
Wenjie Zi
Keyi Tang
C. S. Huang
Jackie Chi Kit Cheung
Yanshuai Cao
+ Variational Hyper RNN for Sequence Modeling 2020 Ruizhi Deng
Yanshuai Cao
Bo Chang
Leonid Sigal
Greg Mori
Marcus A. Brubaker
+ Evaluating Lossy Compression Rates of Deep Generative Models 2020 Sicong Huang
Alireza Makhzani
Yanshuai Cao
Roger Grosse
+ Optimizing Deeper Transformers on Small Datasets 2020 Peng Xu
Dhruv Kumar
Wei Yang
Wenjie Zi
Keyi Tang
Chenyang Huang
Jackie Chi Kit Cheung
Simon J. D. Prince
Yanshuai Cao
+ Preventing Posterior Collapse in Sequence VAEs with Pooling 2019 Teng Fei Long
Yanshuai Cao
Jackie Chi Kit Cheung
+ Unsupervised Controllable Text Generation with Global Variation Discovery and Disentanglement. 2019 Peng Xu
Yanshuai Cao
Jackie Chi Kit Cheung
+ Better Long-Range Dependency By Bootstrapping A Mutual Information Regularizer 2019 Yanshuai Cao
Peng Xu
+ A Cross-Domain Transferable Neural Coherence Model 2019 Peng Xu
Hamidreza Saghir
Jin Sung Kang
Teng Long
Avishek Joey Bose
Yanshuai Cao
Jackie Chi Kit Cheung
+ A Cross-Domain Transferable Neural Coherence Model 2019 Peng Xu
Hamidreza Saghir
Jin Sung Kang
Teng Long
Avishek Joey Bose
Yanshuai Cao
Jackie Chi Kit Cheung
+ On Variational Learning of Controllable Representations for Text without Supervision 2019 Peng Xu
Jackie Chi Kit Cheung
Yanshuai Cao
+ Better Long-Range Dependency By Bootstrapping A Mutual Information Regularizer 2019 Yanshuai Cao
Peng Xu
+ On Posterior Collapse and Encoder Feature Dispersion in Sequence VAEs 2019 Teng Fei Long
Yanshuai Cao
Jackie Chi Kit Cheung
+ Improving GAN Training via Binarized Representation Entropy (BRE) Regularization 2018 Yanshuai Cao
Gavin Weiguang Ding
Kry Yik-Chau Lui
Ruitong Huang
+ Adversarial Contrastive Estimation 2018 Avishek Joey Bose
Huan Ling
Yanshuai Cao
+ Few-Shot Self Reminder to Overcome Catastrophic Forgetting 2018 Junfeng Wen
Yanshuai Cao
Ruitong Huang
+ PDF Chat Adversarial Contrastive Estimation 2018 Avishek Joey Bose
Huan Ling
Yanshuai Cao
+ Improving GAN Training via Binarized Representation Entropy (BRE) Regularization 2018 Yanshuai Cao
Gavin Weiguang Ding
Kry Yik-Chau Lui
Ruitong Huang
+ Automatic Selection of t-SNE Perplexity 2017 Yanshuai Cao
Luyu Wang
+ Implicit Manifold Learning on Generative Adversarial Networks 2017 Kry Yik Chau Lui
Yanshuai Cao
Maxime Gazeau
Kelvin Shuangjian Zhang
+ Adversarial Manipulation of Deep Representations 2015 Sara Sabour
Yanshuai Cao
Fartash Faghri
David J. Fleet
+ PDF Chat Efficient Optimization for Sparse Gaussian Process Regression 2015 Yanshuai Cao
Marcus A. Brubaker
David J. Fleet
Aaron Hertzmann
+ Transductive Log Opinion Pool of Gaussian Process Experts 2015 Yanshuai Cao
David J. Fleet
+ Adversarial Manipulation of Deep Representations 2015 Sara Sabour
Yanshuai Cao
Fartash Faghri
David J. Fleet
+ Generalized Product of Experts for Automatic and Principled Fusion of Gaussian Process Predictions 2014 Yanshuai Cao
David J. Fleet
+ Efficient Optimization for Sparse Gaussian Process Regression 2013 Yanshuai Cao
Marcus A. Brubaker
David J. Fleet
Aaron Hertzmann
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ Adam: A Method for Stochastic Optimization 2014 Diederik P. Kingma
Jimmy Ba
8
+ Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation 2019 Jiaqi Guo
Zecheng Zhan
Yan Gao
Yan Xiao
Jian–Guang Lou
Ting Liu
Dongmei Zhang
7
+ Attention is All you Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
6
+ RoBERTa: A Robustly Optimized BERT Pretraining Approach 2019 Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
Mike Lewis
Luke Zettlemoyer
Veselin Stoyanov
5
+ PDF Chat Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task 2018 Tao Yu
Rui Zhang
Kai Yang
Michihiro Yasunaga
Dongxu Wang
Zifan Li
James Ma
Irene Li
Qingning Yao
Shanelle Roman
5
+ PDF Chat Rethinking the Inception Architecture for Computer Vision 2016 Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jon Shlens
Zbigniew Wojna
5
+ PDF Chat Improving Text-to-SQL Evaluation Methodology 2018 Catherine Finegan-Dollak
Jonathan K. Kummerfeld
Li Zhang
Karthik Ramanathan
Sesh Sadasivam
Rui Zhang
Dragomir Radev
4
+ RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers 2020 Bailin Wang
Richard Shin
Xiaodong Liu
Oleksandr Polozov
Matthew Richardson
4
+ Self-Attention with Relative Position Representations 2018 Peter Shaw
Jakob Uszkoreit
Ashish Vaswani
4
+ PDF Chat A Syntactic Neural Model for General-Purpose Code Generation 2017 Pengcheng Yin
Graham Neubig
4
+ Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning 2017 Victor W. Zhong
Caiming Xiong
Richard Socher
4
+ PDF Chat Supervised Learning of Universal Sentence Representations from Natural Language Inference Data 2017 Alexis Conneau
Douwe Kiela
Holger Schwenk
Loïc Barrault
Antoine Bordes
4
+ PDF Chat Efficient Optimization for Sparse Gaussian Process Regression 2015 Yanshuai Cao
Marcus A. Brubaker
David J. Fleet
Aaron Hertzmann
3
+ Improving Deep Transformer with Depth-Scaled Initialization and Merged Attention 2019 Biao Zhang
Ivan Titov
Rico Sennrich
3
+ PDF Chat Training Tips for the Transformer Model 2018 Martin Popel
Ondřej Bojar
3
+ Photon: A Robust Cross-Domain Text-to-SQL System 2020 Jichuan Zeng
Xi Lin
Steven C. H. Hoi
Richard Socher
Caiming Xiong
Michael R. Lyu
Irwin King
3
+ Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks 2015 Alec Radford
Luke Metz
Soumith Chintala
3
+ Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task 2018 Changyuan Yu
Rui Zhang
Kai Yang
Michihiro Yasunaga
Dongxu Wang
Zifan Li
James Ma
Irene Li
Qingning Yao
Shanelle Roman
3
+ Learning Longer-term Dependencies in RNNs with Auxiliary Losses 2018 Trieu H. Trinh
Andrew M. Dai
Minh-Thang Luong
Quoc V. Le
3
+ RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers. 2019 Bailin Wang
Richard Shin
Xiaodong Liu
Oleksandr Polozov
Matthew Richardson
3
+ RYANSQL: Recursively Applying Sketch-based Slot Fillings for Complex Text-to-SQL in Cross-Domain Databases 2020 Donghyun Choi
Myeong Cheol Shin
EungGyun Kim
Dong Ryeol Shin
3
+ RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers 2019 Bailin Wang
Richard Shin
Xiaodong Liu
Oleksandr Polozov
Matthew Richardson
3
+ ALBERT: A Lite BERT for Self-supervised Learning of Language Representations 2019 Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
3
+ Importance Weighted Autoencoders 2015 Yuri Burda
Roger Grosse
Ruslan Salakhutdinov
3
+ TRANX: A Transition-based Neural Abstract Syntax Parser for Semantic Parsing and Code Generation 2018 Pengcheng Yin
Graham Neubig
3
+ Deep Contextualized Word Representations 2018 Matthew E. Peters
Mark E Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
3
+ PDF Chat The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation 2018 Mia Xu Chen
Orhan Fırat
Ankur Bapna
Melvin Johnson
Wolfgang Macherey
George Foster
Llion Jones
Mike Schuster
Noam Shazeer
Niki Parmar
3
+ PDF Chat Language to Logical Form with Neural Attention 2016 Li Dong
Mirella Lapata
3
+ Incorporating Copying Mechanism in Sequence-to-Sequence Learning 2016 Jiatao Gu
Zhengdong Lu
Hang Li
Victor O. K. Li
3
+ Energy-based Generative Adversarial Network 2016 Junbo Zhao
Michaël Mathieu
Yann LeCun
3
+ PDF Chat Learning Deep Transformer Models for Machine Translation 2019 Qiang Wang
Bei Li
Tong Xiao
Jingbo Zhu
Changliang Li
Derek F. Wong
Lidia S. Chao
3
+ Improving GAN Training via Binarized Representation Entropy (BRE) Regularization 2018 Yanshuai Cao
Gavin Weiguang Ding
Kry Yik-Chau Lui
Ruitong Huang
3
+ GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing 2020 Tao Yu
Chien-Sheng Wu
Xi Lin
Bailin Wang
Yi Chern Tan
Xinyi Yang
Dragomir Radev
Richard Socher
Caiming Xiong
3
+ On Using Monolingual Corpora in Neural Machine Translation 2015 Çaǧlar Gülçehre
Orhan Fırat
Kelvin Xu
Kyunghyun Cho
Loïc Barrault
Huei-Chi Lin
Fethi Bougares
Holger Schwenk
Yoshua Bengio
2
+ PDF Chat Improving Neural Machine Translation Models with Monolingual Data 2016 Rico Sennrich
Barry Haddow
Alexandra Birch
2
+ Improving neural networks by preventing co-adaptation of feature detectors 2012 Geoffrey E. Hinton
Nitish Srivastava
Alex Krizhevsky
Ilya Sutskever
Ruslan Salakhutdinov
2
+ Noise-contrastive estimation of unnormalized statistical models, with applications to natural image statistics 2012 Michael U. Gutmann
Aapo Hyvärinen
2
+ Simple Fusion: Return of the Language Model 2018 Felix Stahlberg
James Cross
Veselin Stoyanov
2
+ Generating Sentences from a Continuous Space 2016 Samuel R. Bowman
Luke Vilnis
Oriol Vinyals
Andrew M. Dai
Rafał Józefowicz
Samy Bengio
2
+ PDF Chat Self-Critical Sequence Training for Image Captioning 2017 Steven J. Rennie
Etienne Marcheret
Youssef Mroueh
Jerret Ross
Vaibhava Goel
2
+ Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer 2017 Noam Shazeer
Azalia Mirhoseini
Krzysztof Maziarz
Andy Davis
Quoc V. Le
Geoffrey E. Hinton
Jeff Dean
2
+ Notes on Noise Contrastive Estimation and Negative Sampling 2014 Chris Dyer
2
+ A Fast and Simple Algorithm for Training Neural Probabilistic Language Models 2012 Andriy Mnih
Yee Whye Teh
2
+ Generalization and Equilibrium in Generative Adversarial Nets (GANs) 2017 Sanjeev Arora
Rong Ge
Yingyu Liang
Tengyu Ma
Yi Zhang
2
+ Latent Intention Dialogue Models 2017 Tsung-Hsien Wen
Yishu Miao
Phil Blunsom
Steve Young
2
+ BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding 2018 Jacob Devlin
Ming‐Wei Chang
Kenton Lee
Kristina Toutanova
2
+ PDF Chat Predictive low-rank decomposition for kernel methods 2005 Francis Bach
Michael I. Jordan
2
+ PDF Chat Incorporating GAN for Negative Sampling in Knowledge Representation Learning 2018 Peifeng Wang
Shuangyin Li
Rong Pan
2
+ The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables 2016 Chris J. Maddison
Andriy Mnih
Yee Whye Teh
2
+ Using Monolingual Data in Neural Machine Translation: a Systematic Study 2018 Franck Burlot
François Yvon
2