Jianlong Fu

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation 2024 Qixiu Li
Yaobo Liang
Zeyu Wang
Lin Luo
Xi Chen
M. Liao
Fangyun Wei
Yu Deng
Sicheng Xu
Yizhong Zhang
+ PDF Chat Zero-Reference Low-Light Enhancement via Physical Quadruple Priors 2024 Wenjing Wang
Huan Yang
Jianlong Fu
Jiaying Liu
+ PDF Chat Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion 2024 Wenhui Tan
Bei Liu
Junbo Zhang
Ruihua Song
Jianlong Fu
+ PDF Chat Spatiotemporal Predictive Pre-training for Robotic Motor Control 2024 Jiange Yang
Bei Liu
Jianlong Fu
Bocheng Pan
Gangshan Wu
Limin Wang
+ Learning Position-Aware Implicit Neural Network for Real-World Face Inpainting 2024 Bo Zhao
Huan Yang
Jianlong Fu
+ MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text 2023 Junchen Zhu
Huan Yang
Wenjing Wang
Huiguo He
Zixi Tuo
Yongsheng Yu
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
+ MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images 2023 Junchen Zhu
Huan Yang
Huiguo He
Wenjing Wang
Zixi Tuo
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
+ PDF Chat Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning 2023 Huiguo He
Tianfu Wang
Huan Yang
Jianlong Fu
Nicholas Jing Yuan
Jian Yin
Hongyang Chao
Qi Zhang
+ PDF Chat Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution 2023 Zixi Tuo
Huan Yang
Jianlong Fu
Yujie Dun
Xueming Qian
+ PDF Chat Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations 2023 Seogkyu Jeon
Bei Liu
Pilhyeon Lee
Kibeom Hong
Jianlong Fu
Hyeran Byun
+ PDF Chat SINC: Self-Supervised In-Context Learning for Vision-Language Tasks 2023 Yi-Syuan Chen
Yun-Zhu Song
Cheng Yu Yeo
Bei Liu
Jianlong Fu
Hong-Han Shuai
+ PDF Chat MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation 2023 Ludan Ruan
Yiyang Ma
Huan Yang
Huiguo He
Bei Liu
Jianlong Fu
Nicholas Jing Yuan
Qin Jin
Baining Guo
+ PDF Chat Weakly-supervised pre-training for 3D human pose estimation via perspective knowledge 2023 Zhongwei Qiu
Kai Qiu
Jianlong Fu
Dongmei Fu
+ PDF Chat Language-Guided Face Animation by Recurrent StyleGAN-Based Generator 2023 Tiankai Hang
Huan Yang
Bei Liu
Jianlong Fu
Xin Geng
Baining Guo
+ PDF Chat Fine-Grained Image Style Transfer with Visual Transformers 2023 Jianbo Wang
Huan Yang
Jianlong Fu
Toshihiko Yamasaki
Baining Guo
+ Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation 2023 Yiyang Ma
Huan Yang
Wenjing Wang
Jianlong Fu
Jiaying Liu
+ Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution 2023 Zixi Tuo
Huan Yang
Jianlong Fu
Yujie Dun
Xueming Qian
+ NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation 2023 Shengming Yin
Chenfei Wu
Huan Yang
Jianfeng Wang
Xiaodong Wang
Minheng Ni
Zhengyuan Yang
Linjie Li
Shuguang Liu
Fan Yang
+ VideoFactory: Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation 2023 Wenjing Wang
Huan Yang
Zixi Tuo
Huiguo He
Junchen Zhu
Jianlong Fu
Jiaying Liu
+ Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution 2023 Yiyang Ma
Huan Yang
Wenhan Yang
Jianlong Fu
Jiaying Liu
+ AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation 2023 Chuhao Jin
Wenhui Tan
Jiange Yang
Bei Liu
Ruihua Song
Limin Wang
Jianlong Fu
+ Transferring Foundation Models for Generalizable Robotic Manipulation 2023 Jiange Yang
Wenhui Tan
Chuhao Jin
Bei Liu
Jianlong Fu
Ruihua Song
Limin Wang
+ MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images 2023 Junchen Zhu
Huan Yang
Huiguo He
Wenjing Wang
Zixi Tuo
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
+ Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning 2023 Huiguo He
Tianfu Wang
Huan Yang
Jianlong Fu
Nicholas Jing Yuan
Jian Yin
Hongyang Chao
Qi Zhang
+ SINC: Self-Supervised In-Context Learning for Vision-Language Tasks 2023 Yi-Syuan Chen
Yun-Zhu Song
Cheng Yu Yeo
Bei Liu
Jianlong Fu
Hong-Han Shuai
+ MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text 2023 Junchen Zhu
Huan Yang
Wenjing Wang
Huiguo He
Zixi Tuo
Yongsheng Yu
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
+ NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation 2023 Shengming Yin
Chenfei Wu
Huan Yang
Jianfeng Wang
Xiaodong Wang
Minheng Ni
Zhengyuan Yang
Linjie Li
Shuguang Liu
Fan Yang
+ PDF Chat TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation 2023 Chengxu Liu
Huan Yang
Jianlong Fu
Xueming Qian
+ PDF Chat 4D LUT: Learnable Context-Aware 4D Lookup Table for Image Enhancement 2023 Chengxu Liu
Huan Yang
Jianlong Fu
Xueming Qian
+ Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations 2023 Seogkyu Jeon
Bei Liu
Pilhyeon Lee
Kibeom Hong
Jianlong Fu
Hyeran Byun
+ ViCo: Engaging Video Comment Generation with Human Preference Rewards 2023 Yuchong Sun
Bei Liu
Xu Chen
Ruihua Song
Jianlong Fu
+ PDF Chat AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation 2022 Yiyang Ma
Huan Yang
Bei Liu
Jianlong Fu
Jiaying Liu
+ PDF Chat Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions 2022 Hongwei Xue
Tiankai Hang
Yanhong Zeng
Yuchong Sun
Bei Liu
Huan Yang
Jianlong Fu
Baining Guo
+ PDF Chat MiniViT: Compressing Vision Transformers with Weight Multiplexing 2022 Jinnian Zhang
Houwen Peng
Kan Wu
Mengchen Liu
Bin Xiao
Jianlong Fu
Lu Yuan
+ PDF Chat Learning Trajectory-Aware Transformer for Video Super-Resolution 2022 Chengxu Liu
Huan Yang
Jianlong Fu
Xueming Qian
+ PDF Chat Aggregated Contextual Transformations for High-Resolution Image Inpainting 2022 Yanhong Zeng
Jianlong Fu
Hongyang Chao
Baining Guo
+ PDF Chat Cyclic Differentiable Architecture Search 2022 Hongyuan Yu
Houwen Peng
Yan Huang
Jianlong Fu
Hao Du
Liang Wang
Haibin Ling
+ Learning Trajectory-Aware Transformer for Video Super-Resolution 2022 Chengxu Liu
Huan Yang
Jianlong Fu
Xueming Qian
+ MiniViT: Compressing Vision Transformers with Weight Multiplexing 2022 Jinnian Zhang
Houwen Peng
Kan Wu
Mengchen Liu
Bin Xiao
Jianlong Fu
Lü Yuan
+ Degradation-Guided Meta-Restoration Network for Blind Super-Resolution 2022 Fuzhi Yang
Huan Yang
Yanhong Zeng
Jianlong Fu
Hongtao Lu
+ TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation 2022 Chengxu Liu
Huan Yang
Jianlong Fu
Xueming Qian
+ TinyViT: Fast Pretraining Distillation for Small Vision Transformers 2022 Kan Wu
Jinnian Zhang
Houwen Peng
Mengchen Liu
Bin Xiao
Jianlong Fu
Lu Yuan
+ Expanding Language-Image Pretrained Models for General Video Recognition 2022 Bolin Ni
Houwen Peng
Minghao Chen
Songyang Zhang
Gaofeng Meng
Jianlong Fu
Shiming Xiang
Haibin Ling
+ Exploring Anchor-based Detection for Ego4D Natural Language Query 2022 Sipeng Zheng
Qi Zhang
Bei Liu
Qin Jin
Jianlong Fu
+ Language-Guided Face Animation by Recurrent StyleGAN-based Generator 2022 Tiankai Hang
Huan Yang
Bei Liu
Jianlong Fu
Xin Geng
Baining Guo
+ CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment 2022 Xue Hong-wei
Yuchong Sun
Bei Liu
Jianlong Fu
Ruihua Song
Houqiang Li
Jiebo Luo
+ 4D LUT: Learnable Context-Aware 4D Lookup Table for Image Enhancement 2022 Chengxu Liu
Huan Yang
Jianlong Fu
Xueming Qian
+ Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution 2022 Zhongwei Qiu
Huan Yang
Jianlong Fu
Dongmei Fu
+ GRIT-VLP: Grouped Mini-batch Sampling for Efficient Vision and Language Pre-training 2022 Jaeseok Byun
Taebaek Hwang
Jianlong Fu
Taesup Moon
+ Fine-Grained Image Style Transfer with Visual Transformers 2022 Jianbo Wang
Huan Yang
Jianlong Fu
Toshihiko Yamasaki
Baining Guo
+ Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning 2022 Yuchong Sun
Xue Hong-wei
Ruihua Song
Bei Liu
Huan Yang
Jianlong Fu
+ Weakly-supervised Pre-training for 3D Human Pose Estimation via Perspective Knowledge 2022 Zhongwei Qiu
Kai Qiu
Jianlong Fu
Dongmei Fu
+ MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation 2022 Ludan Ruan
Yiyang Ma
Huan Yang
Huiguo He
Bei Liu
Jianlong Fu
Nicholas Jing Yuan
Qin Jin
Baining Guo
+ PDF Chat Expanding Language-Image Pretrained Models for General Video Recognition 2022 Bolin Ni
Houwen Peng
Minghao Chen
Songyang Zhang
Gaofeng Meng
Jianlong Fu
Shiming Xiang
Haibin Ling
+ PDF Chat TinyViT: Fast Pretraining Distillation for Small Vision Transformers 2022 Kan Wu
Jinnian Zhang
Houwen Peng
Mengchen Liu
Bin Xiao
Jianlong Fu
Lu Yuan
+ PDF Chat GRIT-VLP: Grouped Mini-batch Sampling for Efficient Vision and Language Pre-training 2022 Jaeseok Byun
Taebaek Hwang
Jianlong Fu
Taesup Moon
+ PDF Chat Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution 2022 Zhongwei Qiu
Huan Yang
Jianlong Fu
Dongmei Fu
+ Learning Spatiotemporal Frequency-Transformer for Low-Quality Video Super-Resolution 2022 Zhongwei Qiu
Huan Yang
Jianlong Fu
Daochang Liu
Chang Xu
Dongmei Fu
+ Searching the Search Space of Vision Transformer 2021 Minghao Chen
Kan Wu
Bolin Ni
Houwen Peng
Bei Liu
Jianlong Fu
Hongyang Chao
Haibin Ling
+ PDF Chat Searching the Search Space of Vision Transformer 2021 Minghao Chen
Kan Wu
Bolin Ni
Houwen Peng
Bei Liu
Jianlong Fu
Hongyang Chao
Haibin Ling
+ PDF Chat Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers 2021 Yanhong Zeng
Huan Yang
Hongyang Chao
Jianbo Wang
Jianlong Fu
+ A Picture is Worth a Thousand Words: A Unified System for Diverse Captions and Rich Images Generation 2021 Yupan Huang
Bei Liu
Jianlong Fu
Yutong Lu
+ PDF Chat A Picture is Worth a Thousand Words 2021 Yupan Huang
Bei Liu
Jianlong Fu
Yutong Lu
+ PDF Chat Learning Fine-Grained Motion Embedding for Landscape Animation 2021 Hongwei Xue
Bei Liu
Huan Yang
Jianlong Fu
Houqiang Li
Jiebo Luo
+ PDF Chat A Picture is Worth a Thousand Words 2021 Yupan Huang
Bei Liu
Jianlong Fu
Yutong Lu
+ PDF Chat Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment 2021 Heliang Zheng
Huan Yang
Jianlong Fu
Zheng-Jun Zha
Jiebo Luo
+ PDF Chat Rethinking and Improving Relative Position Encoding for Vision Transformer 2021 Kan Wu
Houwen Peng
Minghao Chen
Jianlong Fu
Hongyang Chao
+ PDF Chat AutoFormer: Searching Transformers for Visual Recognition 2021 Minghao Chen
Houwen Peng
Jianlong Fu
Haibin Ling
+ PDF Chat Domain-Aware Universal Style Transfer 2021 Kibeom Hong
Seogkyu Jeon
Huan Yang
Jianlong Fu
Hyeran Byun
+ PDF Chat Learning Spatio-Temporal Transformer for Visual Tracking 2021 Bin Yan
Houwen Peng
Jianlong Fu
Dong Wang
Huchuan Lu
+ PDF Chat LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search 2021 Bin Yan
Houwen Peng
Kan Wu
Dong Wang
Jianlong Fu
Huchuan Lu
+ PDF Chat One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking 2021 Minghao Chen
Jianlong Fu
Haibin Ling
+ PDF Chat Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning 2021 Zhicheng Huang
Zhaoyang Zeng
Yupan Huang
Bei Liu
Dongmei Fu
Jianlong Fu
+ Learning Spatio-Temporal Transformer for Visual Tracking. 2021 Bin Yan
Houwen Peng
Jianlong Fu
Dong Wang
Huchuan Lu
+ One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking 2021 Minghao Chen
Houwen Peng
Jianlong Fu
Haibin Ling
+ Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning 2021 Zhicheng Huang
Zhaoyang Zeng
Yupan Huang
Bei Liu
Dongmei Fu
Jianlong Fu
+ LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search 2021 Bin Yan
Houwen Peng
Kan Wu
Dong Wang
Jianlong Fu
Huchuan Lu
+ AutoFormer: Searching Transformers for Visual Recognition 2021 Minghao Chen
Houwen Peng
Jianlong Fu
Haibin Ling
+ Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training 2021 Hongwei Xue
Yupan Huang
Bei Liu
Houwen Peng
Jianlong Fu
Houqiang Li
Jiebo Luo
+ PDF Chat Reference-Based Defect Detection Network 2021 Zhaoyang Zeng
Bei Liu
Jianlong Fu
Hongyang Chao
+ Rethinking and Improving Relative Position Encoding for Vision Transformer 2021 Kan Wu
Houwen Peng
Minghao Chen
Jianlong Fu
Hongyang Chao
+ Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment 2021 Heliang Zheng
Huan Yang
Jianlong Fu
Zheng-Jun Zha
Jiebo Luo
+ Domain-Aware Universal Style Transfer 2021 Kibeom Hong
Seogkyu Jeon
Huan Yang
Jianlong Fu
Hyeran Byun
+ Learning Fine-Grained Motion Embedding for Landscape Animation 2021 Hongwei Xue
Bei Liu
Huan Yang
Jianlong Fu
Houqiang Li
Jiebo Luo
+ Learning Spatio-Temporal Transformer for Visual Tracking 2021 Bin Yan
Houwen Peng
Jianlong Fu
Dong Wang
Huchuan Lu
+ Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers 2021 Yanhong Zeng
Huan Yang
Hongyang Chao
Jianbo Wang
Jianlong Fu
+ Searching the Search Space of Vision Transformer 2021 Minghao Chen
Kan Wu
Bolin Ni
Houwen Peng
Bei Liu
Jianlong Fu
Hongyang Chao
Haibin Ling
+ Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions 2021 Hongwei Xue
Tiankai Hang
Yanhong Zeng
Yuchong Sun
Bei Liu
Huan Yang
Jianlong Fu
Baining Guo
+ Exact weak bosonic zero modes in a spin/fermion chain 2021 Jianlong Fu
+ Aggregated Contextual Transformations for High-Resolution Image Inpainting 2021 Yanhong Zeng
Jianlong Fu
Hongyang Chao
Baining Guo
+ PDF Chat NTIRE 2020 Challenge on Perceptual Extreme Super-Resolution: Methods and Results 2020 Kai Zhang
Shuhang Gu
Radu Timofte
Taizhang Shang
Qiuju Dai
Shengchen Zhu
Tong Yang
Yandong Guo
Younghyun Jo
Sejong Yang
+ PDF Chat Learning Texture Transformer Network for Image Super-Resolution 2020 Fuzhi Yang
Huan Yang
Jianlong Fu
Hongtao Lu
Baining Guo
+ Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language 2020 Songyang Zhang
Houwen Peng
Jianlong Fu
Jiebo Luo
+ PDF Chat 360-Indoor: Towards Learning Real-World Objects in 360° Indoor Equirectangular Images 2020 Shih-Han Chou
Cheng Sun
Wen‐Yen Chang
Wan‐Ting Hsu
Min Sun
Jianlong Fu
+ Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Transformers 2020 Zhicheng Huang
Zhaoyang Zeng
Bei Liu
Dongmei Fu
Jianlong Fu
+ NTIRE 2020 Challenge on Perceptual Extreme Super-Resolution: Methods and Results 2020 Kai Zhang
Shuhang Gu
Radu Timofte
Taizhang Shang
Qiuju Dai
Shengchen Zhu
Tong Yang
Yandong Guo
Younghyun Jo
Sejong Yang
+ Learning Texture Transformer Network for Image Super-Resolution 2020 Fuzhi Yang
Huan Yang
Jianlong Fu
Hongtao Lu
Baining Guo
+ Ocean: Object-aware Anchor-free Tracking 2020 Zhipeng Zhang
Houwen Peng
Jianlong Fu
Bing Li
Weiming Hu
+ Learning Joint Spatial-Temporal Transformations for Video Inpainting 2020 Yanhong Zeng
Jianlong Fu
Hongyang Chao
+ PDF Chat Revisiting Anchor Mechanisms for Temporal Action Localization 2020 Le Yang
Houwen Peng
Dingwen Zhang
Jianlong Fu
Junwei Han
+ PDF Chat Learning Joint Spatial-Temporal Transformations for Video Inpainting 2020 Yanhong Zeng
Jianlong Fu
Hongyang Chao
+ Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search 2020 Houwen Peng
Hao Du
Hongyuan Yu
Qi Li
Jing Liao
Jianlong Fu
+ PDF Chat Ocean: Object-Aware Anchor-Free Tracking 2020 Zhipeng Zhang
Houwen Peng
Jianlong Fu
Bing Li
Weiming Hu
+ Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language 2020 Songyang Zhang
Houwen Peng
Jianlong Fu
Yijuan Lu
Jiebo Luo
+ PDF Chat Learn to Scale: Generating Multipolar Normalized Density Maps for Crowd Counting 2019 Chenfeng Xu
Kai Qiu
Jianlong Fu
Song Bai
Yongchao Xu
Xiang Bai
+ PDF Chat WSOD2: Learning Bottom-Up and Top-Down Objectness Distillation for Weakly-Supervised Object Detection 2019 Zhaoyang Zeng
Bei Liu
Jianlong Fu
Hongyang Chao
Lei Zhang
+ From Words to Sentences: A Progressive Learning Approach for Zero-resource Machine Translation with Visual Pivots 2019 Shizhe Chen
Qin Jin
Jianlong Fu
+ PDF Chat Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting 2019 Yanhong Zeng
Jianlong Fu
Hongyang Chao
Baining Guo
+ PDF Chat Looking for the Devil in the Details: Learning Trilinear Attention Sampling Network for Fine-Grained Image Recognition 2019 Heliang Zheng
Jianlong Fu
Zheng-Jun Zha
Jiebo Luo
+ Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting 2019 Yanhong Zeng
Jianlong Fu
Hongyang Chao
Baining Guo
+ Looking for the Devil in the Details: Learning Trilinear Attention Sampling Network for Fine-grained Image Recognition 2019 Heliang Zheng
Jianlong Fu
Zheng-Jun Zha
Jiebo Luo
+ From Words to Sentences: A Progressive Learning Approach for Zero-resource Machine Translation with Visual Pivots 2019 Shizhe Chen
Qin Jin
Jianlong Fu
+ Activitynet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Videos 2019 Shizhe Chen
Yuqing Song
Yida Zhao
Qin Jin
Zhaoyang Zeng
Bei Liu
Jianlong Fu
Alexander G. Hauptmann
+ Learn to Scale: Generating Multipolar Normalized Density Maps for Crowd Counting 2019 Chenfeng Xu
Kai Qiu
Jianlong Fu
Song Bai
Yongchao Xu
Xiang Bai
+ WSOD^2: Learning Bottom-up and Top-down Objectness Distillation for Weakly-supervised Object Detection 2019 Zhaoyang Zeng
Bei Liu
Jianlong Fu
Hongyang Chao
Lei Zhang
+ 360-Indoor: Towards Learning Real-World Objects in 360° Indoor Equirectangular Images 2019 Shih-Han Chou
Cheng Sun
Wen‐Yen Chang
Wan‐Ting Hsu
Min Sun
Jianlong Fu
+ Learning Rich Image Region Representation for Visual Question Answering 2019 Bei Liu
Zhicheng Huang
Zhaoyang Zeng
Zheyu Chen
Jianlong Fu
+ Learning Deep Bilinear Transformation for Fine-grained Image Representation 2019 Heliang Zheng
Jianlong Fu
Zheng-Jun Zha
Jiebo Luo
+ Neural Storyboard Artist: Visualizing Stories with Coherent Image Sequences 2019 Shizhe Chen
Bei Liu
Jianlong Fu
Ruihua Song
Qin Jin
Pingping Lin
Xiaoyu Qi
Chunting Wang
Jin Zhou
+ Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language 2019 Songyang Zhang
Houwen Peng
Jianlong Fu
Jiebo Luo
+ Learning Sparse 2D Temporal Adjacent Networks for Temporal Action Localization 2019 Songyang Zhang
Houwen Peng
Le Yang
Jianlong Fu
Jiebo Luo
+ PDF Chat Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training 2018 Bei Liu
Jianlong Fu
Makoto P. Kato
Masatoshi Yoshikawa
+ PDF Chat Beyond Narrative Description 2018 Bei Liu
Jianlong Fu
Makoto P. Kato
Masatoshi Yoshikawa
+ PDF Chat Self-View Grounding Given a Narrated 360° Video 2018 Shih-Han Chou
Yi‐Chun Chen
Kuo-Hao Zeng
Hou-Ning Hu
Jianlong Fu
Min Sun
+ PDF Chat Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training 2018 Bei Liu
Jianlong Fu
Makoto P. Kato
Masatoshi Yoshikawa
+ Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions 2018 Qing Li
Jianlong Fu
Dongfei Yu
Tao Mei
Jiebo Luo
+ DA-GAN: Instance-level Image Translation by Deep Attention Generative Adversarial Networks (with Supplementary Materials) 2018 Shuang Ma
Jianlong Fu
Chang Wen Chen
Tao Mei
+ Image Inspired Poetry Generation in XiaoIce 2018 Wen‐Feng Cheng
Chao-Chung Wu
Ruihua Song
Jianlong Fu
Xing Xie
Jian‐Yun Nie
+ PDF Chat Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions 2018 Qing Li
Jianlong Fu
Dongfei Yu
Tao Mei
Jiebo Luo
+ PDF Chat 3D Human Body Reshaping with Anthropometric Modeling 2018 Yanhong Zeng
Jianlong Fu
Hongyang Chao
+ PDF Chat Show, Adapt and Tell: Adversarial Training of Cross-Domain Image Captioner 2017 Tseng-Hung Chen
Yuan-Hong Liao
Ching-Yao Chuang
Wan‐Ting Hsu
Jianlong Fu
Min Sun
+ Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner 2017 Tseng-Hung Chen
Yuan-Hong Liao
Ching-Yao Chuang
Wan‐Ting Hsu
Jianlong Fu
Min Sun
+ Self-view Grounding Given a Narrated 360° Video 2017 Shih-Han Chou
Yi-Chun Chen
Kuo-Hao Zeng
Hou-Ning Hu
Jianlong Fu
Min Sun
+ Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network 2016 Yu Liu
Jianlong Fu
Tao Mei
Chang Wen Chen
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
48
+ Very Deep Convolutional Networks for Large-Scale Image Recognition 2014 Karen Simonyan
Andrew Zisserman
20
+ PDF Chat Learning Texture Transformer Network for Image Super-Resolution 2020 Fuzhi Yang
Huan Yang
Jianlong Fu
Hongtao Lu
Baining Guo
15
+ Attention Is All You Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
14
+ Attention is All you Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
14
+ PDF Chat Deep visual-semantic alignments for generating image descriptions 2015 Andrej Karpathy
Li Fei-Fei
13
+ PDF Chat Perceptual Losses for Real-Time Style Transfer and Super-Resolution 2016 Justin Johnson
Alexandre Alahi
Li Fei-Fei
12
+ PDF Chat The Unreasonable Effectiveness of Deep Features as a Perceptual Metric 2018 Richard Zhang
Phillip Isola
Alexei A. Efros
Eli Shechtman
Oliver Wang
12
+ EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks 2019 Mingxing Tan
Quoc V. Le
12
+ PDF Chat Aggregated Residual Transformations for Deep Neural Networks 2017 Saining Xie
Ross Girshick
Piotr Dollár
Zhuowen Tu
Kaiming He
11
+ BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding 2018 Jacob Devlin
Ming‐Wei Chang
Kenton Lee
Kristina Toutanova
11
+ PDF Chat ImageNet Large Scale Visual Recognition Challenge 2015 Olga Russakovsky
Jia Deng
Hao Su
Jonathan Krause
Sanjeev Satheesh
Sean Ma
Zhiheng Huang
Andrej Karpathy
Aditya Khosla
Michael S. Bernstein
10
+ Adam: A Method for Stochastic Optimization 2014 Diederik P. Kingma
Jimmy Ba
10
+ Very Deep Convolutional Networks for Large-Scale Image Recognition 2014 Karen Simonyan
Andrew Zisserman
10
+ An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale 2020 Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
Thomas Unterthiner
Mostafa Dehghani
Matthias Minderer
Georg Heigold
Sylvain Gelly
10
+ PDF Chat Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering 2018 Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Jay Gould
Lei Zhang
10
+ PDF Chat A Style-Based Generator Architecture for Generative Adversarial Networks 2019 Tero Karras
Samuli Laine
Timo Aila
9
+ PDF Chat Show and tell: A neural image caption generator 2015 Oriol Vinyals
Alexander Toshev
Samy Bengio
Dumitru Erhan
9
+ PDF Chat End-to-End Object Detection with Transformers 2020 Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
9
+ Distilling the Knowledge in a Neural Network 2015 Geoffrey E. Hinton
Oriol Vinyals
Jay B. Dean
8
+ PDF Chat Swin Transformer: Hierarchical Vision Transformer using Shifted Windows 2021 Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
Baining Guo
8
+ Bilinear Attention Networks 2018 Jin-Hwa Kim
Jae-Hyun Jun
Byoung‐Tak Zhang
8
+ Regularized Evolution for Image Classifier Architecture Search 2019 Esteban Real
Alok Aggarwal
Yanping Huang
Quoc V. Le
8
+ PDF Chat Learning Spatiotemporal Features with 3D Convolutional Networks 2015 Du Tran
Lubomir Bourdev
Rob Fergus
Lorenzo Torresani
Manohar Paluri
8
+ PDF Chat Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting 2019 Yanhong Zeng
Jianlong Fu
Hongyang Chao
Baining Guo
8
+ PDF Chat Single Path One-Shot Neural Architecture Search with Uniform Sampling 2020 Zichao Guo
Xiangyu Zhang
Haoyuan Mu
Wen Heng
Zechun Liu
Yichen Wei
Jian Sun
8
+ PDF Chat Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering 2017 Yash Goyal
Tejas Khot
Douglas Summers-Stay
Dhruv Batra
Devi Parikh
7
+ PDF Chat Non-local Neural Networks 2018 Xiaolong Wang
Ross Girshick
Abhinav Gupta
Kaiming He
7
+ PDF Chat MobileNetV2: Inverted Residuals and Linear Bottlenecks 2018 Mark Sandler
Andrew Howard
Menglong Zhu
Andrey Zhmoginov
Liang-Chieh Chen
7
+ PDF Chat Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset 2017 João Carreira
Andrew Zisserman
7
+ PDF Chat Going deeper with convolutions 2015 Christian Szegedy
Wei Liu
Yangqing Jia
Pierre Sermanet
Scott Reed
Dragomir Anguelov
Dumitru Erhan
Vincent Vanhoucke
Andrew Rabinovich
7
+ Neural Architecture Search with Reinforcement Learning 2016 Barret Zoph
Quoc V. Le
7
+ PDF Chat Image-to-Image Translation with Conditional Adversarial Networks 2017 Phillip Isola
Jun-Yan Zhu
Tinghui Zhou
Alexei A. Efros
7
+ PDF Chat Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks 2016 Shaoqing Ren
Kaiming He
Ross Girshick
Jian Sun
7
+ mixup: Beyond Empirical Risk Minimization 2017 Hongyi Zhang
Moustapha Cissé
Yann Dauphin
David López-Paz
7
+ Once-for-All: Train One Network and Specialize it for Efficient Deployment 2019 Han Cai
Chuang Gan
Tianzhe Wang
Zhekai Zhang
Song Han
7
+ PDF Chat Fast Online Object Tracking and Segmentation: A Unifying Approach 2019 Qiang Wang
Li Zhang
Luca Bertinetto
Weiming Hu
Philip H. S. Torr
6
+ PDF Chat Feature Pyramid Networks for Object Detection 2017 Tsung-Yi Lin
Piotr Dollár
Ross Girshick
Kaiming He
Bharath Hariharan
Serge Belongie
6
+ Searching for MobileNetV3. 2019 Andrew Howard
Mark Sandler
Grace Chu
Liang-Chieh Chen
Bo Chen
Mingxing Tan
Weijun Wang
Yukun Zhu
Ruoming Pang
Vijay Vasudevan
6
+ PDF Chat ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks 2019 Xintao Wang
Ke Yu
Shixiang Wu
Jinjin Gu
Yihao Liu
Chao Dong
Yu Qiao
Chen Change Loy
6
+ PDF Chat GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in the Wild 2019 Lianghua Huang
Xin Zhao
Kaiqi Huang
6
+ PDF Chat Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation 2019 Chenxi Liu
Liang-Chieh Chen
Florian Schroff
Hartwig Adam
Wei Hua
Alan Yuille
Li Fei-Fei
6
+ PDF Chat Deeper and Wider Siamese Networks for Real-Time Visual Tracking 2019 Zhipeng Zhang
Houwen Peng
6
+ Deep Fragment Embeddings for Bidirectional Image Sentence Mapping 2014 Andrej Karpathy
Armand Joulin
Fei Fei F Li
6
+ PDF Chat Image Super-Resolution Using Very Deep Residual Channel Attention Networks 2018 Yulun Zhang
Kunpeng Li
Kai Li
Lichen Wang
Bineng Zhong
Yun Fu
6
+ PDF Chat Learning Multi-domain Convolutional Neural Networks for Visual Tracking 2016 Hyeonseob Nam
Bohyung Han
6
+ A Corpus for Reasoning about Natural Language Grounded in Photographs 2019 Alane Suhr
Stephanie Zhou
Ally Zhang
Iris Zhang
Huajun Bai
Yoav Artzi
6
+ PDF Chat Rethinking the Inception Architecture for Computer Vision 2016 Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jon Shlens
Zbigniew Wojna
6
+ PDF Chat Densely Connected Convolutional Networks 2017 Gao Huang
Zhuang Liu
Laurens van der Maaten
Kilian Q. Weinberger
6
+ PDF Chat Image Inpainting for Irregular Holes Using Partial Convolutions 2018 Guilin Liu
Fitsum A. Reda
Kevin J. Shih
Ting-Chun Wang
Andrew Tao
Bryan Catanzaro
6