+
|
Learning Transferable Visual Models From Natural Language Supervision
|
2021
|
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya Ramesh
Gabriel Goh
Sandhini Agarwal
Girish Sastry
Amanda Askell
Pamela Mishkin
Jack Clark
|
4
|
+
PDF
Chat
|
Foley Music: Learning to Generate Music from Videos
|
2020
|
Chuang Gan
Deng Huang
Peihao Chen
Joshua B. Tenenbaum
Antonio Torralba
|
2
|
+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
2
|
+
PDF
Chat
|
Less Is More: Learning Highlight Detection From Video Duration
|
2019
|
Bo Xiong
Yannis Kalantidis
Deepti Ghadiyaram
Kristen Grauman
|
2
|
+
PDF
Chat
|
GestureMap: Supporting Visual Analytics and Quantitative Analysis of Motion Elicitation Data by Learning 2D Embeddings
|
2021
|
Hai Dang
Daniel Buschek
|
2
|
+
PDF
Chat
|
Visually Indicated Sounds
|
2016
|
Andrew Owens
Phillip Isola
Josh H. McDermott
Antonio Torralba
Edward H. Adelson
William T. Freeman
|
2
|
+
PDF
Chat
|
EmbeddingVis: A Visual Analytics Approach to Comparative Network Embedding Inspection
|
2018
|
Quan Li
Kristanto Sean Njotoprawiro
Hammad Haleem
Qiaoan Chen
Chris Yi
Xiaojuan Ma
|
2
|
+
PDF
Chat
|
Text-based editing of talking-head video
|
2019
|
Ohad Fried
Ayush Tewari
Michael Zollhöfer
Adam Finkelstein
Eli Shechtman
Dan B Goldman
Kyle Genova
Zeyu Jin
Christian Theobalt
Maneesh Agrawala
|
2
|
+
PDF
Chat
|
Co-Separating Sounds of Visual Objects
|
2019
|
Ruohan Gao
Kristen Grauman
|
2
|
+
|
On the Opportunities and Risks of Foundation Models
|
2021
|
Rishi Bommasani
Drew A. Hudson
Ehsan Adeli
Russ B. Altman
Simran Arora
Sydney von Arx
Michael S. Bernstein
Jeannette Bohg
Antoine Bosselut
Emma Brunskill
|
2
|
+
|
Embedding Comparator: Visualizing Differences in Global Structure and Local Neighborhoods via Small Multiples
|
2022
|
Angie Boggust
Brandon Carter
Arvind Satyanarayan
|
2
|
+
PDF
Chat
|
Learning to Cut by Watching Movies
|
2021
|
Alejandro Pardo
Fabian Caba Heilbron
Juan LeĂłn AlcĂĄzar
Ali Thabet
Bernard Ghanem
|
2
|
+
|
Emblaze: Illuminating Machine Learning Representations through Interactive Comparison of Embedding Spaces
|
2022
|
Venkatesh Sivaraman
Yiwei Wu
Adam Perer
|
2
|
+
PDF
Chat
|
Learning to Separate Object Sounds by Watching Unlabeled Video
|
2018
|
Ruohan Gao
Rogério Feris
Kristen Grauman
|
2
|
+
|
Audeo: Audio Generation for a Silent Performance Video
|
2020
|
Kun Su
Xiulong Liu
Eli Shlizerman
|
2
|
+
PDF
Chat
|
Generating Visually Aligned Sound From Videos
|
2020
|
Peihao Chen
Yang Zhang
Mingkui Tan
Hongdong Xiao
Deng Huang
Chuang Gan
|
2
|
+
PDF
Chat
|
Learning to Localize Sound Source in Visual Scenes
|
2018
|
Arda Senocak
Tae-Hyun Oh
Junsik Kim
MingâHsuan Yang
In So Kweon
|
2
|
+
PDF
Chat
|
Video2GIF: Automatic Generation of Animated GIFs from Video
|
2016
|
Michael Gygli
Yale Song
Liangliang Cao
|
2
|
+
PDF
Chat
|
Dynamic Word Embeddings for Evolving Semantic Discovery
|
2018
|
Zijun Yao
Yifan Sun
Weicong Ding
Nikhil Rao
Hui Xiong
|
2
|
+
PDF
Chat
|
Visual to Sound: Generating Natural Sound for Videos in the Wild
|
2018
|
Yipin Zhou
Zhaowen Wang
Fang Chen
Trung Bui
Tamara L. Berg
|
2
|
+
PDF
Chat
|
Audio-Visual Event Localization in Unconstrained Videos
|
2018
|
Yapeng Tian
Jing Shi
Bochen Li
Zhiyao Duan
Chenliang Xu
|
2
|
+
PDF
Chat
|
AutoFoley: Artificial Synthesis of Synchronized Sound Tracks for Silent Videos With Deep Learning
|
2020
|
Sanchita Ghose
John J. Prevost
|
2
|
+
|
A Neural Knowledge Language Model
|
2016
|
Sungjin Ahn
Heeyoul Choi
Tanel PĂ€rnamaa
Yoshua Bengio
|
2
|
+
|
Adam: A Method for Stochastic Optimization
|
2014
|
Diederik P. Kingma
Jimmy Ba
|
1
|
+
|
Neural Architecture Search: A Survey
|
2019
|
Thomas Elsken
Jan Hendrik Metzen
Frank Hutter
|
1
|
+
|
Learning style similarity for searching infographics
|
2015
|
Babak Saleh
Mira Dontcheva
Aaron Hertzmann
Zhicheng Liu
|
1
|
+
|
MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
|
2019
|
Kundan Kumar
Rithesh Kumar
T. de BoissiĂšre
Lucas Gestin
Wei Zhen Teoh
Jose Sotelo
Alexandre de Brébisson
Yoshua Bengio
Aaron Courville
|
1
|
+
PDF
Chat
|
Billion-Scale Similarity Search with GPUs
|
2019
|
Jeff Johnson
Matthijs Douze
Hervé Jeǔou
|
1
|
+
PDF
Chat
|
From Lost to Found
|
2020
|
Chunyang Chen
Sidong Feng
Zhengyang Liu
Zhenchang Xing
Shengdong Zhao
|
1
|
+
PDF
Chat
|
Generative adversarial networks
|
2020
|
Ian Goodfellow
Jean Pouget-Abadie
Mehdi Mirza
Bing Xu
David Warde-Farley
Sherjil Ozair
Aaron Courville
Yoshua Bengio
|
1
|
+
PDF
Chat
|
Learning icons appearance similarity
|
2018
|
Manuel Lagunas
Elena Garcés
Diego Gutiérrez
|
1
|
+
PDF
Chat
|
Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization
|
2019
|
Ramprasaath R. Selvaraju
Michael Cogswell
Abhishek Das
Ramakrishna Vedantam
Devi Parikh
Dhruv Batra
|
1
|
+
PDF
Chat
|
NIMA: Neural Image Assessment
|
2018
|
Hossein Talebi
Peyman Milanfar
|
1
|
+
|
Zero-Shot Text-to-Image Generation
|
2021
|
Aditya Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
|
1
|
+
|
DiffWave: A Versatile Diffusion Model for Audio Synthesis
|
2020
|
Zhifeng Kong
Wei Ping
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
|
1
|
+
PDF
Chat
|
Learning Personal Style from Few Examples
|
2021
|
David Chuan-En Lin
Nikolas Martelaro
|
1
|
+
PDF
Chat
|
Towards A Process Model for Co-Creating AI Experiences
|
2021
|
Hariharan Subramonyam
Colleen M. Seifert
Eytan Adar
|
1
|
+
|
Dance2Music: Automatic Dance-driven Music Generation
|
2021
|
Gunjan Aggarwal
Devi Parikh
|
1
|
+
PDF
Chat
|
FoleyGAN: Visually Guided Generative Adversarial Network-Based Synchronous Sound Generation in Silent Videos
|
2022
|
Sanchita Ghose
John J. Prevost
|
1
|
+
PDF
Chat
|
A Survey on Automated Fact-Checking
|
2022
|
Zhijiang Guo
Michael Schlichtkrull
Andreas Vlachos
|
1
|
+
PDF
Chat
|
From Who You Know to What You Read: Augmenting Scientific Recommendations with Implicit Social Networks
|
2022
|
Hyeonsu B Kang
RafaĆ Kocielnik
Andrew Head
Jiangjiang Yang
Matt Latzke
Aniket Kittur
Daniel S. Weld
Doug Downey
Jonathan Bragg
|
1
|
+
PDF
Chat
|
PromptChainer: Chaining Large Language Model Prompts through Visual Programming
|
2022
|
Tongshuang Wu
Ellen Jiang
Aaron Donsbach
Jeff Gray
Alejandra Molina
Michael Terry
Carrie J. Cai
|
1
|
+
|
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
|
2022
|
Andy Zeng
Adrian Wong
Stefan Welker
Krzysztof ChoromaĆski
Federico Tombari
Aveek Purohit
Michael S. Ryoo
Vikas Sindhwani
Johnny Lee
Vincent Vanhoucke
|
1
|
+
PDF
Chat
|
Augmenting Scientific Creativity with an Analogical Search Engine
|
2022
|
Hyeonsu B Kang
Xin Qian
Tom Hope
Dafna Shahaf
Joel Chan
Aniket Kittur
|
1
|
+
|
Augmenting Scientific Creativity with Retrieval across Knowledge Domains
|
2022
|
Hyeonsu B Kang
Sheshera Mysore
Kevin Huang
Haw-Shiuan Chang
Thorben Prein
Andrew McCallum
Aniket Kittur
Elsa Olivetti
|
1
|
+
|
Emergent Abilities of Large Language Models
|
2022
|
Jason Lee
Yi Tay
Rishi Bommasani
Colin Raffel
Barret Zoph
Sebastian Borgeaud
Dani Yogatama
Maarten Bosma
Denny Zhou
Donald Metzler
|
1
|
+
|
Video-Specific Autoencoders for Exploring, Editing and Transmitting Videos
|
2021
|
Kevin Wang
Deva Ramanan
Aayush Bansal
|
1
|
+
|
Jukebox: A Generative Model for Music
|
2020
|
Prafulla Dhariwal
Heewoo Jun
Christine Payne
Jong Wook Kim
Alec Radford
Ilya Sutskever
|
1
|
+
|
Language Models are Few-Shot Learners
|
2020
|
T. B. Brown
Benjamin F. Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
|
1
|
+
|
AudioGen: Textually Guided Audio Generation
|
2022
|
Felix Kreuk
Gabriel Synnaeve
Adam Polyak
Uriel Singer
Alexandre DĂ©fossez
Jade Copet
Devi Parikh
Yaniv Taigman
Yossi Adi
|
1
|