+
PDF
Chat
|
TapeAgents: a Holistic Framework for Agent Development and Optimization
|
2024
|
Dzmitry Bahdanau
Nicolas Gontier
George Q. Huang
Ehsan Kamalloo
Rafael Pardinas
Alex Piché
Torsten Scholak
Oleh Shliazhko
J.P. Tremblay
Karam Ghanem
|
+
PDF
Chat
|
NNetscape Navigator: Complex Demonstrations for Web Agents Without a
Demonstrator
|
2024
|
Shikhar Murty
Dzmitry Bahdanau
Christopher D. Manning
|
+
PDF
Chat
|
LLMs can learn self-restraint through iterative self-reflection
|
2024
|
Alexandre Piché
Aristides Milios
Dzmitry Bahdanau
Chris Pal
|
+
PDF
Chat
|
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
|
2024
|
Parishad BehnamGhader
Vaibhav Adlakha
Marius Mosbach
Dzmitry Bahdanau
Nicolas Chapados
Siva Reddy
|
+
PDF
Chat
|
Evaluating In-Context Learning of Libraries for Code Generation
|
2024
|
Arkil Patel
Siva Reddy
Dzmitry Bahdanau
Pradeep Dasigi
|
+
|
SantaCoder: don't reach for the stars!
|
2023
|
Loubna Ben Allal
Raymond Li
Denis Kocetkov
Chenghao Mou
Christopher Akiki
Carlos Munoz Ferrandis
Niklas Muennighoff
Mayank Mishra
Alex Gu
Manan Dey
|
+
|
StarCoder: may the source be with you!
|
2023
|
Raymond Li
Loubna Ben Allal
Yangtian Zi
Niklas Muennighoff
Denis Kocetkov
Chenghao Mou
Marc Marone
Christopher Akiki
Jia Li
Jenny Chim
|
+
|
RepoFusion: Training Code Models to Understand Your Repository
|
2023
|
Disha Shrivastava
Denis Kocetkov
Harm de Vries
Dzmitry Bahdanau
Torsten Scholak
|
+
|
In-Context Learning for Text Classification with Many Labels
|
2023
|
Aristides Milios
Siva Reddy
Dzmitry Bahdanau
|
+
|
MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations
|
2023
|
Arkil Patel
Satwik Bhattamishra
Siva Reddy
Dzmitry Bahdanau
|
+
|
PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
|
2023
|
Gaurav Sahu
Olga Vechtomova
Dzmitry Bahdanau
Issam Laradji
|
+
|
Evaluating In-Context Learning of Libraries for Code Generation
|
2023
|
Arkil Patel
Siva Reddy
Dzmitry Bahdanau
Pradeep Dasigi
|
+
PDF
Chat
|
PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
|
2023
|
Gaurav Sahu
Olga Vechtomova
Dzmitry Bahdanau
Issam Laradji
|
+
PDF
Chat
|
MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations
|
2023
|
Arkil Patel
Satwik Bhattamishra
Siva Reddy
Dzmitry Bahdanau
|
+
|
In-Context Learning for Text Classification with Many Labels
|
2023
|
Aristides Milios
Siva Reddy
Dzmitry Bahdanau
|
+
PDF
Chat
|
Compositional Generalization in Dependency Parsing
|
2022
|
Emily Goodwin
Siva Reddy
Timothy OâDonnell
Dzmitry Bahdanau
|
+
|
Data Augmentation for Intent Classification with Off-the-shelf Large Language Models
|
2022
|
Gaurav Sahu
Pau RodrĂguez
Issam Laradji
Parmida Atighehchian
David VĂĄzquez
Dzmitry Bahdanau
|
+
|
Evaluating the Text-to-SQL Capabilities of Large Language Models
|
2022
|
Nitarshan Rajkumar
Raymond Li
Dzmitry Bahdanau
|
+
|
LAGr: Label Aligned Graphs for Better Systematic Generalization in Semantic Parsing
|
2022
|
Dora Jambor
Dzmitry Bahdanau
|
+
|
Data Augmentation for Intent Classification with Off-the-shelf Large Language Models
|
2022
|
Gaurav Sahu
Pau RodrĂguez
Issam Laradji
Parmida Atighehchian
David VĂĄzquez
Dzmitry Bahdanau
|
+
|
On the Compositional Generalization Gap of In-Context Learning
|
2022
|
Arian Hosseini
Ankit Vani
Dzmitry Bahdanau
Alessandro Sordoni
Aaron Courville
|
+
|
The Stack: 3 TB of permissively licensed source code
|
2022
|
Denis Kocetkov
Raymond Li
Loubna Ben Allal
Jia Li
Chenghao Mou
Carlos Muñoz Ferrandis
Yacine Jernite
Margaret Mitchell
Sean Hughes
Thomas Wolf
|
+
|
On the Compositional Generalization Gap of In-Context Learning
|
2022
|
Arian Hosseini
Ankit Vani
Dzmitry Bahdanau
Alessandro Sordoni
Aaron Courville
|
+
PDF
Chat
|
Systematic Generalization with Edge Transformers
|
2021
|
Leon Bergen
Timothy OâDonnell
Dzmitry Bahdanau
|
+
|
LAGr: Labeling Aligned Graphs for Improving Systematic Generalization in Semantic Parsing.
|
2021
|
Dora Jambor
Dzmitry Bahdanau
|
+
|
PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models
|
2021
|
Torsten Scholak
Nathan Schucher
Dzmitry Bahdanau
|
+
PDF
Chat
|
Combating False Negatives in Adversarial Imitation Learning
|
2021
|
Konrad Ć»oĆna
Chitwan Saharia
LĂ©onard Boussioux
David Yu-Tung Hui
Maxime Chevalier-Boisvert
Dzmitry Bahdanau
Yoshua Bengio
|
+
|
Jointly Learning Truth-Conditional Denotations and Groundings using Parallel Attention
|
2021
|
Leon Bergen
Dzmitry Bahdanau
Timothy J. OâDonnell
|
+
PDF
Chat
|
Understanding by Understanding Not: Modeling Negation in Language Models
|
2021
|
Arian Hosseini
Siva Reddy
Dzmitry Bahdanau
R Devon Hjelm
Alessandro Sordoni
Aaron Courville
|
+
PDF
Chat
|
DuoRAT: Towards Simpler Text-to-SQL Models
|
2021
|
Torsten Scholak
Raymond Li
Dzmitry Bahdanau
Harm de Vries
Chris Pal
|
+
PDF
Chat
|
PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models
|
2021
|
Torsten Scholak
Nathan Schucher
Dzmitry Bahdanau
|
+
|
LAGr: Labeling Aligned Graphs for Improving Systematic Generalization in Semantic Parsing
|
2021
|
Dora Jambor
Dzmitry Bahdanau
|
+
|
Compositional Generalization in Dependency Parsing
|
2021
|
Emily Goodwin
Siva Reddy
Timothy J. OâDonnell
Dzmitry Bahdanau
|
+
|
PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models
|
2021
|
Torsten Scholak
Nathan Schucher
Dzmitry Bahdanau
|
+
|
Understanding by Understanding Not: Modeling Negation in Language Models
|
2021
|
Arian Hosseini
Siva Reddy
Dzmitry Bahdanau
R Devon Hjelm
Alessandro Sordoni
Aaron Courville
|
+
|
Systematic Generalization with Edge Transformers
|
2021
|
Leon Bergen
Timothy OâDonnell
Dzmitry Bahdanau
|
+
|
BabyAI 1.1.
|
2020
|
David Yu-Tung Hui
Maxime Chevalier-Boisvert
Dzmitry Bahdanau
Yoshua Bengio
|
+
|
Towards Ecologically Valid Research on Language User Interfaces
|
2020
|
Harm de Vries
Dzmitry Bahdanau
Christopher D. Manning
|
+
|
Combating False Negatives in Adversarial Imitation Learning
|
2020
|
Konrad Ć»oĆna
Chitwan Saharia
LĂ©onard Boussioux
David Yu-Tung Hui
Maxime Chevalier-Boisvert
Dzmitry Bahdanau
Yoshua Bengio
|
+
|
BabyAI 1.1
|
2020
|
David Yu-Tung Hui
Maxime Chevalier-Boisvert
Dzmitry Bahdanau
Yoshua Bengio
|
+
|
CLOSURE: Assessing Systematic Generalization of CLEVR Models
|
2019
|
Dzmitry Bahdanau
Harm de Vries
Timothy OâDonnell
Shikhar Murty
Philippe Beaudoin
Yoshua Bengio
Aaron Courville
|
+
|
CLOSURE: Assessing Systematic Generalization of CLEVR Models.
|
2019
|
Dzmitry Bahdanau
Harm de Vries
Timothy OâDonnell
Shikhar Murty
Philippe Beaudoin
Yoshua Bengio
Aaron Courville
|
+
|
Automated curriculum generation for Policy Gradients from Demonstrations
|
2019
|
Anirudh Srinivasan
Dzmitry Bahdanau
Maxime Chevalier-Boisvert
Yoshua Bengio
|
+
|
CLOSURE: Assessing Systematic Generalization of CLEVR Models
|
2019
|
Dzmitry Bahdanau
Harm de Vries
Timothy OâDonnell
Shikhar Murty
Philippe Beaudoin
Yoshua Bengio
Aaron Courville
|
+
|
Systematic Generalization: What Is Required and Can It Be Learned?
|
2018
|
Dzmitry Bahdanau
Shikhar Murty
Michael Noukhovitch
Thien Huu Nguyen
Harm de Vries
Aaron Courville
|
+
|
BabyAI: First Steps Towards Grounded Language Learning With a Human In the Loop.
|
2018
|
Maxime Chevalier-Boisvert
Dzmitry Bahdanau
Salem Lahlou
Lucas Willems
Chitwan Saharia
Thien Huu Nguyen
Yoshua Bengio
|
+
|
Systematic Generalization: What Is Required and Can It Be Learned?
|
2018
|
Dzmitry Bahdanau
Shikhar Murty
Michael Noukhovitch
Thien Huu Nguyen
Harm de Vries
Aaron Courville
|
+
|
Learning to Follow Language Instructions with Adversarial Reward Induction
|
2018
|
Dzmitry Bahdanau
Felix Hill
Jan Leike
Edward Hughes
Pushmeet Kohli
Edward Grefenstette
|
+
|
Commonsense mining as knowledge base completion? A study on the impact of novelty
|
2018
|
StanisĆaw JastrzÈ©bski
Dzmitry Bahdanau
Seyedarian Hosseini
Michael Noukhovitch
Yoshua Bengio
Jackie Chi Kit Cheung
|
+
|
Learning to Understand Goal Specifications by Modelling Reward
|
2018
|
Dzmitry Bahdanau
Felix Hill
Jan Leike
Edward Hughes
Arian Hosseini
Pushmeet Kohli
Edward Grefenstette
|
+
|
Commonsense mining as knowledge base completion? A study on the impact of novelty
|
2018
|
StanisĆaw JastrzÈ©bski
Dzmitry Bahdanau
Seyedarian Hosseini
Michael Noukhovitch
Yoshua Bengio
Jackie Chi Kit Cheung
|
+
|
BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning
|
2018
|
Maxime Chevalier-Boisvert
Dzmitry Bahdanau
Salem Lahlou
Lucas Willems
Chitwan Saharia
Thien Huu Nguyen
Yoshua Bengio
|
+
|
Systematic Generalization: What Is Required and Can It Be Learned?
|
2018
|
Dzmitry Bahdanau
Shikhar Murty
Michael Noukhovitch
Thien Huu Nguyen
Harm de Vries
Aaron Courville
|
+
|
Commonsense mining as knowledge base completion? A study on the impact of novelty
|
2018
|
StanisĆaw JastrzÈ©bski
Dzmitry Bahdanau
Seyedarian Hosseini
Michael Noukhovitch
Yoshua Bengio
Jackie Kit Cheung
|
+
|
Learning to Compute Word Embeddings On the Fly
|
2017
|
Dzmitry Bahdanau
Tom Bosc
StanisĆaw JastrzÈ©bski
Edward Grefenstette
Pascal Vincent
Yoshua Bengio
|
+
|
An Actor-Critic Algorithm for Sequence Prediction
|
2016
|
Dzmitry Bahdanau
Philémon Brakel
Kelvin Xu
Anirudh Goyal
Ryan Lowe
Joëlle Pineau
Aaron Courville
Yoshua Bengio
|
+
|
An Actor-Critic Algorithm for Structured Prediction
|
2016
|
Dzmitry Bahdanau
Philémon Brakel
Kelvin Xu
Anirudh Goyal
Ryan Lowe
Joëlle Pineau
Aaron Memisevic
Yoshua Bengio
|
+
PDF
Chat
|
End-to-end attention-based large vocabulary speech recognition
|
2016
|
Dzmitry Bahdanau
Jan Chorowski
Dmitriy Serdyuk
Philémon Brakel
Yoshua Bengio
|
+
|
Theano: A Python framework for fast computation of mathematical expressions
|
2016
|
The Theano Development Team
Rami AlâRfou
Guillaume Alain
Amjad Almahairi
Christof Angermueller
Dzmitry Bahdanau
Nicolas Ballas
Frédéric Bastien
Justin Bayer
Anatoly Belikov
|
+
|
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control
|
2016
|
Natasha Jaques
Shixiang Gu
Dzmitry Bahdanau
José Miguel Hernåndez-Lobato
Richard E. Turner
Douglas Eck
|
+
|
An Actor-Critic Algorithm for Sequence Prediction
|
2016
|
Dzmitry Bahdanau
Philémon Brakel
Kelvin Xu
Anirudh Goyal
Ryan Lowe
Joëlle Pineau
Aaron Courville
Yoshua Bengio
|
+
|
End-to-End Attention-based Large Vocabulary Speech Recognition
|
2015
|
Dzmitry Bahdanau
Jan Chorowski
Dmitriy Serdyuk
Philémon Brakel
Yoshua Bengio
|
+
|
Blocks and Fuel: Frameworks for deep learning
|
2015
|
Bart van Merriënboer
Dzmitry Bahdanau
Vincent Dumoulin
Dmitriy Serdyuk
David Warde-Farley
Jan Chorowski
Yoshua Bengio
|
+
|
Task Loss Estimation for Sequence Prediction
|
2015
|
Dzmitry Bahdanau
Dmitriy Serdyuk
Philémon Brakel
Nan Rosemary Ke
Jan Chorowski
Aaron Courville
Yoshua Bengio
|
+
|
Neural Machine Translation by Jointly Learning to Align and Translate
|
2015
|
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
|
+
|
End-to-End Attention-based Large Vocabulary Speech Recognition
|
2015
|
Dzmitry Bahdanau
Jan Chorowski
Dmitriy Serdyuk
Philémon Brakel
Yoshua Bengio
|
+
|
Blocks and Fuel: Frameworks for deep learning
|
2015
|
Bart van Merriënboer
Dzmitry Bahdanau
Vincent Dumoulin
Dmitriy Serdyuk
David Warde-Farley
Jan Chorowski
Yoshua Bengio
|
+
|
Attention-Based Models for Speech Recognition
|
2015
|
Jan Chorowski
Dzmitry Bahdanau
Dmitriy Serdyuk
Kyunghyun Cho
Yoshua Bengio
|
+
|
Overcoming the Curse of Sentence Length for Neural Machine Translation using Automatic Segmentation
|
2014
|
Jean Pouget-Abadie
Dzmitry Bahdanau
Bart van Merriënboer
Kyunghyun Cho
Yoshua Bengio
|
+
|
Neural Machine Translation by Jointly Learning to Align and Translate
|
2014
|
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
|
+
|
End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results
|
2014
|
Jan Chorowski
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
|
+
|
Learning Phrase Representations using RNN EncoderâDecoder for Statistical Machine Translation
|
2014
|
Kyunghyun Cho
Bart van Merriënboer
Ăaǧlar GĂŒlçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
|
+
|
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches
|
2014
|
Kyunghyun Cho
Bart van Merriënboer
Dzmitry Bahdanau
Yoshua Bengio
|
+
|
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
|
2014
|
Kyunghyun Cho
Bart van Merriënboer
Ăaǧlar GĂŒlçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
|
+
|
On the Properties of Neural Machine Translation: EncoderâDecoder Approaches
|
2014
|
Kyunghyun Cho
Bart van Merriënboer
Dzmitry Bahdanau
Yoshua Bengio
|
+
|
Overcoming the Curse of Sentence Length for Neural Machine Translation using Automatic Segmentation
|
2014
|
Jean Pouget-Abadie
Dzmitry Bahdanau
Bart van Merriënboer
Kyunghyun Cho
Yoshua Bengio
|
+
|
Neural Machine Translation by Jointly Learning to Align and Translate
|
2014
|
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
|
+
|
Overcoming the Curse of Sentence Length for Neural Machine Translation using Automatic Segmentation
|
2014
|
Jean Pouget-Abadie
Dzmitry Bahdanau
Bart van Merriënboer
Kyunghyun Cho
Yoshua Bengio
|