Collin Burns

Follow

Generating author description...

All published works
Action Title Year Authors
+ Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision 2023 Collin Burns
Pavel Izmailov
Jan H. Kirchner
Bowen Baker
Leo Gao
Leopold Aschenbrenner
Yining Chen
Adrien Ecoffet
Manas Joglekar
Jan Leike
+ Discovering Latent Knowledge in Language Models Without Supervision 2022 Collin Burns
Haotian Ye
Dan Klein
Jacob Steinhardt
+ PDF Chat Limitations of Post-Hoc Feature Alignment for Robustness 2021 Collin Burns
Jacob Steinhardt
+ Limitations of Post-Hoc Feature Alignment for Robustness 2021 Collin Burns
Jacob Steinhardt
+ Measuring Mathematical Problem Solving With the MATH Dataset 2021 Dan Hendrycks
Collin Burns
Saurav Kadavath
Akul Arora
Steven Basart
Eric Tang
Dawn Song
Jacob Steinhardt
+ CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review 2021 Dan Hendrycks
Collin Burns
Anya Chen
Spencer Ball
+ Measuring Coding Challenge Competence With APPS 2021 Dan Hendrycks
Steven Basart
Saurav Kadavath
Mantas Mazeika
Akul Arora
Ethan Guo
Collin Burns
Samir Puranik
Horace He
Dawn Song
+ Limitations of Post-Hoc Feature Alignment for Robustness 2021 Collin Burns
Jacob Steinhardt
+ PDF Chat Interpreting Black Box Models via Hypothesis Testing 2020 Collin Burns
Jesse Thomason
Wesley Tansey
+ PDF Chat Measuring Massive Multitask Language Understanding 2020 Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Dawn Song
Jacob Steinhardt
+ Aligning AI With Shared Human Values 2020 Dan Hendrycks
Collin Burns
Steven Basart
Andrew Critch
Jerry Li
Dawn Song
Jacob Steinhardt
+ Streaming Complexity of SVMs 2020 Alexandr Andoni
Collin Burns
Yi Li
Sepideh Mahabadi
David P. Woodruff
+ Measuring Massive Multitask Language Understanding 2020 Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Dawn Song
Jacob Steinhardt
+ Aligning AI With Shared Human Values 2020 Dan Hendrycks
Collin Burns
Steven Basart
Andrew Critch
Jerry Li
Dawn Song
Jacob Steinhardt
+ Interpreting Black Box Models with Statistical Guarantees. 2019 Collin Burns
Jesse Thomason
Wesley Tansey
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ RoBERTa: A Robustly Optimized BERT Pretraining Approach 2019 Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
Mike Lewis
Luke Zettlemoyer
Veselin Stoyanov
4
+ PDF Chat PIQA: Reasoning about Physical Commonsense in Natural Language 2020 Yonatan Bisk
Rowan Zellers
Ronan Le Bras
Jianfeng Gao
Yejin Choi
4
+ ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness 2018 Robert Geirhos
Patricia Rubisch
Claudio Michaelis
Matthias Bethge
Felix A. Wichmann
Wieland Brendel
3
+ Attention is All you Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Ɓukasz Kaiser
Illia Polosukhin
3
+ Interactive Fiction Games: A Colossal Adventure 2019 Matthew Hausknecht
Prithviraj Ammanabrolu
Marc-Alexandre CÎté
Xingdi Yuan
2
+ LogiQA: A Challenge Dataset for Machine Reading Comprehension with Logical Reasoning 2020 Liu Jian
Leyang Cui
Hanmeng Liu
Dandan Huang
Yile Wang
Yue Zhang
2
+ Evaluating Prediction-Time Batch Normalization for Robustness under Covariate Shift 2020 Zachary Nado
Shreyas Padhy
D. Sculley
Alexander D’Amour
Balaji Lakshminarayanan
Jasper Snoek
2
+ Learning to Explain: An Information-Theoretic Perspective on Model Interpretation 2018 Jianbo Chen
Le Song
Martin J. Wainwright
Michael I. Jordan
2
+ Online Learning with an Unknown Fairness Metric 2018 Stephen Gillen
Christopher Jung
Michael Kearns
Aaron Roth
2
+ PDF Chat AutoDIAL: Automatic Domain Alignment Layers 2017 Fabio Maria Carlucci
Lorenzo Porzi
Barbara Caputo
Elisa Ricci
Samuel Rota BulĂČ
2
+ PDF Chat Deep CORAL: Correlation Alignment for Deep Domain Adaptation 2016 Baochen Sun
Kate Saenko
2
+ PDF Chat Cluster Alignment With a Teacher for Unsupervised Domain Adaptation 2019 Zhijie Deng
Yucen Luo
Jun Zhu
2
+ Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text 2020 Felix Hill
Soƈa Mokrå
Nathaniel Wong
Tim Harley
2
+ Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks 2020 Suchin Gururangan
Ana Marasović
Swabha Swayamdipta
Kyle Lo
Iz Beltagy
Doug Downey
Noah A. Smith
2
+ Estimating individual treatment effect: generalization bounds and algorithms 2016 Uri Shalit
Fredrik Johansson
David Sontag
2
+ Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps 2013 Karen Simonyan
Andrea Vedaldi
Andrew Zisserman
2
+ Decoupled Weight Decay Regularization 2017 Ilya Loshchilov
Frank Hutter
2
+ PDF Chat The mythos of model interpretability 2018 Zachary C. Lipton
2
+ A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks 2016 Dan Hendrycks
Kevin Gimpel
2
+ Optimal Transport for Multi-source Domain Adaptation under Target Shift 2018 Ievgen Redko
Nicolas Courty
RĂ©mi Flamary
Devis Tuia
2
+ Probably Approximately Metric-Fair Learning 2018 Gal Yona
Guy N. Rothblum
2
+ Unsupervised Domain Adaptation by Backpropagation 2014 Yaroslav Ganin
Victor Lempitsky
2
+ PDF Chat Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net 2018 Xingang Pan
Ping Luo
Jianping Shi
Xiaoou Tang
2
+ Can you trust your model's uncertainty? Evaluating predictive uncertainty under dataset shift 2019 Yaniv Ovadia
Emily Fertig
Jie Ren
Zachary Nado
D. Sculley
Sebastian Nowozin
Joshua V. Dillon
Balaji Lakshminarayanan
Jasper Snoek
2
+ DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter 2019 Victor Sanh
Lysandre Debut
Julien Chaumond
Thomas Wolf
2
+ HuggingFace's Transformers: State-of-the-art Natural Language Processing 2019 Thomas Wolf
Lysandre Debut
Victor Sanh
Julien Chaumond
Clément Delangue
Anthony Moi
Pierric Cistac
Tim Rault
RĂ©mi Louf
Morgan Funtowicz
2
+ Scaling Laws for Neural Language Models 2020 Jared Kaplan
Sam McCandlish
Tom Henighan
T. B. Brown
Benjamin Chess
Rewon Child
Scott Gray
Alec Radford
Jeffrey Wu
Dario Amodei
2
+ Recipes for building an open-domain chatbot 2020 Stephen Roller
Emily Dinan
Naman Goyal
Da Young Ju
Mary Williamson
Yinhan Liu
Jing Xu
Myle Ott
Kurt Shuster
Eric M. Smith
2
+ Language Models are Few-Shot Learners 2020 T. B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
2
+ DeBERTa: Decoding-enhanced BERT with Disentangled Attention 2020 Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
2
+ HellaSwag: Can a Machine Really Finish Your Sentence?. 2019 Rowan Zellers
Ari Holtzman
Yonatan Bisk
Ali Farhadi
Yejin Choi
2
+ Gaussian Error Linear Units (GELUs) 2016 Dan Hendrycks
Kevin Gimpel
2
+ Deep reinforcement learning from human preferences 2017 Paul F. Christiano
Jan Leike
T. B. Brown
Miljan Martic
Shane Legg
Dario Amodei
2
+ Co-regularized Alignment for Unsupervised Domain Adaptation 2018 Abhishek Kumar
Prasanna Sattigeri
Kahini Wadhawan
Leonid Karlinsky
Rogério Feris
William T. Freeman
Gregory W. Wornell
2
+ PDF Chat Deep visual domain adaptation: A survey 2018 Mei Wang
Weihong Deng
2
+ Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation 2016 Yonghui Wu
Mike Schuster
Zhifeng Chen
Quoc V. Le
Mohammad Norouzi
Wolfgang Macherey
Maxim Krikun
Yuan Cao
Qin Gao
Klaus Macherey
2
+ Conditional Adversarial Domain Adaptation 2017 Mingsheng Long
Zhangjie Cao
Jianmin Wang
Michael I. Jordan
2
+ Equality of Opportunity in Supervised Learning 2016 Moritz Hardt
Eric Price
Nathan Srebro
2
+ PDF Chat Towards Empathetic Open-domain Conversation Models: A New Benchmark and Dataset 2019 Hannah Rashkin
Eric M. Smith
Margaret Li
Y-Lan Boureau
2
+ Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing 1995 Yoav Benjamini
Yosef Hochberg
2
+ PDF Chat Fairness Beyond Disparate Treatment & Disparate Impact: Learning Classification without Disparate Mistreatment 2017 Muhammad Bilal Zafar
Isabel Valera
Manuel Gomez-Rodriguez
Krishna P. Gummadi
2
+ PDF Chat Panning for Gold: ‘Model-X’ Knockoffs for High Dimensional Controlled Variable Selection 2018 Emmanuel J. Candùs
Yingying Fan
Lucas Janson
Jinchi Lv
2
+ Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift 2015 Sergey Ioffe
Christian Szegedy
2
+ Troubling Trends in Machine Learning Scholarship 2018 Zachary C. Lipton
Jacob Steinhardt
2
+ Demystifying Neural Style Transfer 2017 Yanghao Li
Naiyan Wang
Jiaying Liu
Xiaodi Hou
2
+ Towards Universal Paraphrastic Sentence Embeddings 2016 John Wieting
Mohit Bansal
Kevin Gimpel
Karen Livescu
2
+ PDF Chat Return of Frustratingly Easy Domain Adaptation 2016 Baochen Sun
Jiashi Feng
Kate Saenko
2
+ Learning Representations for Counterfactual Inference 2016 Fredrik Johansson
Uri Shalit
David Sontag
2
+ Wide Residual Networks 2016 Sergey Zagoruyko
Nikos Komodakis
2
+ Generative Language Modeling for Automated Theorem Proving. 2020 Stanislas Polu
Ilya Sutskever
2