+
PDF
Chat
|
CogACT: A Foundational Vision-Language-Action Model for Synergizing
Cognition and Action in Robotic Manipulation
|
2024
|
Qixiu Li
Yaobo Liang
Zeyu Wang
Lin Luo
Xi Chen
M. Liao
Fangyun Wei
Yu Deng
Sicheng Xu
Yizhong Zhang
|
+
PDF
Chat
|
Zero-Reference Low-Light Enhancement via Physical Quadruple Priors
|
2024
|
Wenjing Wang
Huan Yang
Jianlong Fu
Jiaying Liu
|
+
PDF
Chat
|
Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
|
2024
|
Wenhui Tan
Bei Liu
Junbo Zhang
Ruihua Song
Jianlong Fu
|
+
PDF
Chat
|
Spatiotemporal Predictive Pre-training for Robotic Motor Control
|
2024
|
Jiange Yang
Bei Liu
Jianlong Fu
Bocheng Pan
Gangshan Wu
Limin Wang
|
+
|
Learning Position-Aware Implicit Neural Network for Real-World Face Inpainting
|
2024
|
Bo Zhao
Huan Yang
Jianlong Fu
|
+
|
MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text
|
2023
|
Junchen Zhu
Huan Yang
Wenjing Wang
Huiguo He
Zixi Tuo
Yongsheng Yu
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
|
+
|
MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images
|
2023
|
Junchen Zhu
Huan Yang
Huiguo He
Wenjing Wang
Zixi Tuo
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
|
+
PDF
Chat
|
Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
|
2023
|
Huiguo He
Tianfu Wang
Huan Yang
Jianlong Fu
Nicholas Jing Yuan
Jian Yin
Hongyang Chao
Qi Zhang
|
+
PDF
Chat
|
Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution
|
2023
|
Zixi Tuo
Huan Yang
Jianlong Fu
Yujie Dun
Xueming Qian
|
+
PDF
Chat
|
Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations
|
2023
|
Seogkyu Jeon
Bei Liu
Pilhyeon Lee
Kibeom Hong
Jianlong Fu
Hyeran Byun
|
+
PDF
Chat
|
SINC: Self-Supervised In-Context Learning for Vision-Language Tasks
|
2023
|
Yi-Syuan Chen
Yun-Zhu Song
Cheng Yu Yeo
Bei Liu
Jianlong Fu
Hong-Han Shuai
|
+
PDF
Chat
|
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
|
2023
|
Ludan Ruan
Yiyang Ma
Huan Yang
Huiguo He
Bei Liu
Jianlong Fu
Nicholas Jing Yuan
Qin Jin
Baining Guo
|
+
PDF
Chat
|
Weakly-supervised pre-training for 3D human pose estimation via perspective knowledge
|
2023
|
Zhongwei Qiu
Kai Qiu
Jianlong Fu
Dongmei Fu
|
+
PDF
Chat
|
Language-Guided Face Animation by Recurrent StyleGAN-Based Generator
|
2023
|
Tiankai Hang
Huan Yang
Bei Liu
Jianlong Fu
Xin Geng
Baining Guo
|
+
PDF
Chat
|
Fine-Grained Image Style Transfer with Visual Transformers
|
2023
|
Jianbo Wang
Huan Yang
Jianlong Fu
Toshihiko Yamasaki
Baining Guo
|
+
|
Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation
|
2023
|
Yiyang Ma
Huan Yang
Wenjing Wang
Jianlong Fu
Jiaying Liu
|
+
|
Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution
|
2023
|
Zixi Tuo
Huan Yang
Jianlong Fu
Yujie Dun
Xueming Qian
|
+
|
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
|
2023
|
Shengming Yin
Chenfei Wu
Huan Yang
Jianfeng Wang
Xiaodong Wang
Minheng Ni
Zhengyuan Yang
Linjie Li
Shuguang Liu
Fan Yang
|
+
|
VideoFactory: Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
|
2023
|
Wenjing Wang
Huan Yang
Zixi Tuo
Huiguo He
Junchen Zhu
Jianlong Fu
Jiaying Liu
|
+
|
Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution
|
2023
|
Yiyang Ma
Huan Yang
Wenhan Yang
Jianlong Fu
Jiaying Liu
|
+
|
AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
|
2023
|
Chuhao Jin
Wenhui Tan
Jiange Yang
Bei Liu
Ruihua Song
Limin Wang
Jianlong Fu
|
+
|
Transferring Foundation Models for Generalizable Robotic Manipulation
|
2023
|
Jiange Yang
Wenhui Tan
Chuhao Jin
Bei Liu
Jianlong Fu
Ruihua Song
Limin Wang
|
+
|
MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images
|
2023
|
Junchen Zhu
Huan Yang
Huiguo He
Wenjing Wang
Zixi Tuo
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
|
+
|
Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
|
2023
|
Huiguo He
Tianfu Wang
Huan Yang
Jianlong Fu
Nicholas Jing Yuan
Jian Yin
Hongyang Chao
Qi Zhang
|
+
|
SINC: Self-Supervised In-Context Learning for Vision-Language Tasks
|
2023
|
Yi-Syuan Chen
Yun-Zhu Song
Cheng Yu Yeo
Bei Liu
Jianlong Fu
Hong-Han Shuai
|
+
|
MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text
|
2023
|
Junchen Zhu
Huan Yang
Wenjing Wang
Huiguo He
Zixi Tuo
Yongsheng Yu
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
|
+
|
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
|
2023
|
Shengming Yin
Chenfei Wu
Huan Yang
Jianfeng Wang
Xiaodong Wang
Minheng Ni
Zhengyuan Yang
Linjie Li
Shuguang Liu
Fan Yang
|
+
PDF
Chat
|
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation
|
2023
|
Chengxu Liu
Huan Yang
Jianlong Fu
Xueming Qian
|
+
PDF
Chat
|
4D LUT: Learnable Context-Aware 4D Lookup Table for Image Enhancement
|
2023
|
Chengxu Liu
Huan Yang
Jianlong Fu
Xueming Qian
|
+
|
Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations
|
2023
|
Seogkyu Jeon
Bei Liu
Pilhyeon Lee
Kibeom Hong
Jianlong Fu
Hyeran Byun
|
+
|
ViCo: Engaging Video Comment Generation with Human Preference Rewards
|
2023
|
Yuchong Sun
Bei Liu
Xu Chen
Ruihua Song
Jianlong Fu
|
+
PDF
Chat
|
AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation
|
2022
|
Yiyang Ma
Huan Yang
Bei Liu
Jianlong Fu
Jiaying Liu
|
+
PDF
Chat
|
Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions
|
2022
|
Hongwei Xue
Tiankai Hang
Yanhong Zeng
Yuchong Sun
Bei Liu
Huan Yang
Jianlong Fu
Baining Guo
|
+
PDF
Chat
|
MiniViT: Compressing Vision Transformers with Weight Multiplexing
|
2022
|
Jinnian Zhang
Houwen Peng
Kan Wu
Mengchen Liu
Bin Xiao
Jianlong Fu
Lu Yuan
|
+
PDF
Chat
|
Learning Trajectory-Aware Transformer for Video Super-Resolution
|
2022
|
Chengxu Liu
Huan Yang
Jianlong Fu
Xueming Qian
|
+
PDF
Chat
|
Aggregated Contextual Transformations for High-Resolution Image Inpainting
|
2022
|
Yanhong Zeng
Jianlong Fu
Hongyang Chao
Baining Guo
|
+
PDF
Chat
|
Cyclic Differentiable Architecture Search
|
2022
|
Hongyuan Yu
Houwen Peng
Yan Huang
Jianlong Fu
Hao Du
Liang Wang
Haibin Ling
|
+
|
Learning Trajectory-Aware Transformer for Video Super-Resolution
|
2022
|
Chengxu Liu
Huan Yang
Jianlong Fu
Xueming Qian
|
+
|
MiniViT: Compressing Vision Transformers with Weight Multiplexing
|
2022
|
Jinnian Zhang
Houwen Peng
Kan Wu
Mengchen Liu
Bin Xiao
Jianlong Fu
Lü Yuan
|
+
|
Degradation-Guided Meta-Restoration Network for Blind Super-Resolution
|
2022
|
Fuzhi Yang
Huan Yang
Yanhong Zeng
Jianlong Fu
Hongtao Lu
|
+
|
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation
|
2022
|
Chengxu Liu
Huan Yang
Jianlong Fu
Xueming Qian
|
+
|
TinyViT: Fast Pretraining Distillation for Small Vision Transformers
|
2022
|
Kan Wu
Jinnian Zhang
Houwen Peng
Mengchen Liu
Bin Xiao
Jianlong Fu
Lu Yuan
|
+
|
Expanding Language-Image Pretrained Models for General Video Recognition
|
2022
|
Bolin Ni
Houwen Peng
Minghao Chen
Songyang Zhang
Gaofeng Meng
Jianlong Fu
Shiming Xiang
Haibin Ling
|
+
|
Exploring Anchor-based Detection for Ego4D Natural Language Query
|
2022
|
Sipeng Zheng
Qi Zhang
Bei Liu
Qin Jin
Jianlong Fu
|
+
|
Language-Guided Face Animation by Recurrent StyleGAN-based Generator
|
2022
|
Tiankai Hang
Huan Yang
Bei Liu
Jianlong Fu
Xin Geng
Baining Guo
|
+
|
CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment
|
2022
|
Xue Hong-wei
Yuchong Sun
Bei Liu
Jianlong Fu
Ruihua Song
Houqiang Li
Jiebo Luo
|
+
|
4D LUT: Learnable Context-Aware 4D Lookup Table for Image Enhancement
|
2022
|
Chengxu Liu
Huan Yang
Jianlong Fu
Xueming Qian
|
+
|
Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution
|
2022
|
Zhongwei Qiu
Huan Yang
Jianlong Fu
Dongmei Fu
|
+
|
GRIT-VLP: Grouped Mini-batch Sampling for Efficient Vision and Language Pre-training
|
2022
|
Jaeseok Byun
Taebaek Hwang
Jianlong Fu
Taesup Moon
|
+
|
Fine-Grained Image Style Transfer with Visual Transformers
|
2022
|
Jianbo Wang
Huan Yang
Jianlong Fu
Toshihiko Yamasaki
Baining Guo
|
+
|
Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning
|
2022
|
Yuchong Sun
Xue Hong-wei
Ruihua Song
Bei Liu
Huan Yang
Jianlong Fu
|
+
|
Weakly-supervised Pre-training for 3D Human Pose Estimation via Perspective Knowledge
|
2022
|
Zhongwei Qiu
Kai Qiu
Jianlong Fu
Dongmei Fu
|
+
|
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
|
2022
|
Ludan Ruan
Yiyang Ma
Huan Yang
Huiguo He
Bei Liu
Jianlong Fu
Nicholas Jing Yuan
Qin Jin
Baining Guo
|
+
PDF
Chat
|
Expanding Language-Image Pretrained Models for General Video Recognition
|
2022
|
Bolin Ni
Houwen Peng
Minghao Chen
Songyang Zhang
Gaofeng Meng
Jianlong Fu
Shiming Xiang
Haibin Ling
|
+
PDF
Chat
|
TinyViT: Fast Pretraining Distillation for Small Vision Transformers
|
2022
|
Kan Wu
Jinnian Zhang
Houwen Peng
Mengchen Liu
Bin Xiao
Jianlong Fu
Lu Yuan
|
+
PDF
Chat
|
GRIT-VLP: Grouped Mini-batch Sampling for Efficient Vision and Language Pre-training
|
2022
|
Jaeseok Byun
Taebaek Hwang
Jianlong Fu
Taesup Moon
|
+
PDF
Chat
|
Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution
|
2022
|
Zhongwei Qiu
Huan Yang
Jianlong Fu
Dongmei Fu
|
+
|
Learning Spatiotemporal Frequency-Transformer for Low-Quality Video Super-Resolution
|
2022
|
Zhongwei Qiu
Huan Yang
Jianlong Fu
Daochang Liu
Chang Xu
Dongmei Fu
|
+
|
Searching the Search Space of Vision Transformer
|
2021
|
Minghao Chen
Kan Wu
Bolin Ni
Houwen Peng
Bei Liu
Jianlong Fu
Hongyang Chao
Haibin Ling
|
+
PDF
Chat
|
Searching the Search Space of Vision Transformer
|
2021
|
Minghao Chen
Kan Wu
Bolin Ni
Houwen Peng
Bei Liu
Jianlong Fu
Hongyang Chao
Haibin Ling
|
+
PDF
Chat
|
Improving Visual Quality of Image Synthesis by A Token-based Generator
with Transformers
|
2021
|
Yanhong Zeng
Huan Yang
Hongyang Chao
Jianbo Wang
Jianlong Fu
|
+
|
A Picture is Worth a Thousand Words: A Unified System for Diverse Captions and Rich Images Generation
|
2021
|
Yupan Huang
Bei Liu
Jianlong Fu
Yutong Lu
|
+
PDF
Chat
|
A Picture is Worth a Thousand Words
|
2021
|
Yupan Huang
Bei Liu
Jianlong Fu
Yutong Lu
|
+
PDF
Chat
|
Learning Fine-Grained Motion Embedding for Landscape Animation
|
2021
|
Hongwei Xue
Bei Liu
Huan Yang
Jianlong Fu
Houqiang Li
Jiebo Luo
|
+
PDF
Chat
|
A Picture is Worth a Thousand Words
|
2021
|
Yupan Huang
Bei Liu
Jianlong Fu
Yutong Lu
|
+
PDF
Chat
|
Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment
|
2021
|
Heliang Zheng
Huan Yang
Jianlong Fu
Zheng-Jun Zha
Jiebo Luo
|
+
PDF
Chat
|
Rethinking and Improving Relative Position Encoding for Vision Transformer
|
2021
|
Kan Wu
Houwen Peng
Minghao Chen
Jianlong Fu
Hongyang Chao
|
+
PDF
Chat
|
AutoFormer: Searching Transformers for Visual Recognition
|
2021
|
Minghao Chen
Houwen Peng
Jianlong Fu
Haibin Ling
|
+
PDF
Chat
|
Domain-Aware Universal Style Transfer
|
2021
|
Kibeom Hong
Seogkyu Jeon
Huan Yang
Jianlong Fu
Hyeran Byun
|
+
PDF
Chat
|
Learning Spatio-Temporal Transformer for Visual Tracking
|
2021
|
Bin Yan
Houwen Peng
Jianlong Fu
Dong Wang
Huchuan Lu
|
+
PDF
Chat
|
LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search
|
2021
|
Bin Yan
Houwen Peng
Kan Wu
Dong Wang
Jianlong Fu
Huchuan Lu
|
+
PDF
Chat
|
One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking
|
2021
|
Minghao Chen
Jianlong Fu
Haibin Ling
|
+
PDF
Chat
|
Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
|
2021
|
Zhicheng Huang
Zhaoyang Zeng
Yupan Huang
Bei Liu
Dongmei Fu
Jianlong Fu
|
+
|
Learning Spatio-Temporal Transformer for Visual Tracking.
|
2021
|
Bin Yan
Houwen Peng
Jianlong Fu
Dong Wang
Huchuan Lu
|
+
|
One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking
|
2021
|
Minghao Chen
Houwen Peng
Jianlong Fu
Haibin Ling
|
+
|
Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
|
2021
|
Zhicheng Huang
Zhaoyang Zeng
Yupan Huang
Bei Liu
Dongmei Fu
Jianlong Fu
|
+
|
LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search
|
2021
|
Bin Yan
Houwen Peng
Kan Wu
Dong Wang
Jianlong Fu
Huchuan Lu
|
+
|
AutoFormer: Searching Transformers for Visual Recognition
|
2021
|
Minghao Chen
Houwen Peng
Jianlong Fu
Haibin Ling
|
+
|
Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training
|
2021
|
Hongwei Xue
Yupan Huang
Bei Liu
Houwen Peng
Jianlong Fu
Houqiang Li
Jiebo Luo
|
+
PDF
Chat
|
Reference-Based Defect Detection Network
|
2021
|
Zhaoyang Zeng
Bei Liu
Jianlong Fu
Hongyang Chao
|
+
|
Rethinking and Improving Relative Position Encoding for Vision Transformer
|
2021
|
Kan Wu
Houwen Peng
Minghao Chen
Jianlong Fu
Hongyang Chao
|
+
|
Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment
|
2021
|
Heliang Zheng
Huan Yang
Jianlong Fu
Zheng-Jun Zha
Jiebo Luo
|
+
|
Domain-Aware Universal Style Transfer
|
2021
|
Kibeom Hong
Seogkyu Jeon
Huan Yang
Jianlong Fu
Hyeran Byun
|
+
|
Learning Fine-Grained Motion Embedding for Landscape Animation
|
2021
|
Hongwei Xue
Bei Liu
Huan Yang
Jianlong Fu
Houqiang Li
Jiebo Luo
|
+
|
Learning Spatio-Temporal Transformer for Visual Tracking
|
2021
|
Bin Yan
Houwen Peng
Jianlong Fu
Dong Wang
Huchuan Lu
|
+
|
Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers
|
2021
|
Yanhong Zeng
Huan Yang
Hongyang Chao
Jianbo Wang
Jianlong Fu
|
+
|
Searching the Search Space of Vision Transformer
|
2021
|
Minghao Chen
Kan Wu
Bolin Ni
Houwen Peng
Bei Liu
Jianlong Fu
Hongyang Chao
Haibin Ling
|
+
|
Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions
|
2021
|
Hongwei Xue
Tiankai Hang
Yanhong Zeng
Yuchong Sun
Bei Liu
Huan Yang
Jianlong Fu
Baining Guo
|
+
|
Exact weak bosonic zero modes in a spin/fermion chain
|
2021
|
Jianlong Fu
|
+
|
Aggregated Contextual Transformations for High-Resolution Image Inpainting
|
2021
|
Yanhong Zeng
Jianlong Fu
Hongyang Chao
Baining Guo
|
+
PDF
Chat
|
NTIRE 2020 Challenge on Perceptual Extreme Super-Resolution: Methods and Results
|
2020
|
Kai Zhang
Shuhang Gu
Radu Timofte
Taizhang Shang
Qiuju Dai
Shengchen Zhu
Tong Yang
Yandong Guo
Younghyun Jo
Sejong Yang
|
+
PDF
Chat
|
Learning Texture Transformer Network for Image Super-Resolution
|
2020
|
Fuzhi Yang
Huan Yang
Jianlong Fu
Hongtao Lu
Baining Guo
|
+
|
Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language
|
2020
|
Songyang Zhang
Houwen Peng
Jianlong Fu
Jiebo Luo
|
+
PDF
Chat
|
360-Indoor: Towards Learning Real-World Objects in 360° Indoor Equirectangular Images
|
2020
|
Shih-Han Chou
Cheng Sun
Wen‐Yen Chang
Wan‐Ting Hsu
Min Sun
Jianlong Fu
|
+
|
Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Transformers
|
2020
|
Zhicheng Huang
Zhaoyang Zeng
Bei Liu
Dongmei Fu
Jianlong Fu
|
+
|
NTIRE 2020 Challenge on Perceptual Extreme Super-Resolution: Methods and Results
|
2020
|
Kai Zhang
Shuhang Gu
Radu Timofte
Taizhang Shang
Qiuju Dai
Shengchen Zhu
Tong Yang
Yandong Guo
Younghyun Jo
Sejong Yang
|
+
|
Learning Texture Transformer Network for Image Super-Resolution
|
2020
|
Fuzhi Yang
Huan Yang
Jianlong Fu
Hongtao Lu
Baining Guo
|
+
|
Ocean: Object-aware Anchor-free Tracking
|
2020
|
Zhipeng Zhang
Houwen Peng
Jianlong Fu
Bing Li
Weiming Hu
|
+
|
Learning Joint Spatial-Temporal Transformations for Video Inpainting
|
2020
|
Yanhong Zeng
Jianlong Fu
Hongyang Chao
|
+
PDF
Chat
|
Revisiting Anchor Mechanisms for Temporal Action Localization
|
2020
|
Le Yang
Houwen Peng
Dingwen Zhang
Jianlong Fu
Junwei Han
|
+
PDF
Chat
|
Learning Joint Spatial-Temporal Transformations for Video Inpainting
|
2020
|
Yanhong Zeng
Jianlong Fu
Hongyang Chao
|
+
|
Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search
|
2020
|
Houwen Peng
Hao Du
Hongyuan Yu
Qi Li
Jing Liao
Jianlong Fu
|
+
PDF
Chat
|
Ocean: Object-Aware Anchor-Free Tracking
|
2020
|
Zhipeng Zhang
Houwen Peng
Jianlong Fu
Bing Li
Weiming Hu
|
+
|
Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language
|
2020
|
Songyang Zhang
Houwen Peng
Jianlong Fu
Yijuan Lu
Jiebo Luo
|
+
PDF
Chat
|
Learn to Scale: Generating Multipolar Normalized Density Maps for Crowd Counting
|
2019
|
Chenfeng Xu
Kai Qiu
Jianlong Fu
Song Bai
Yongchao Xu
Xiang Bai
|
+
PDF
Chat
|
WSOD2: Learning Bottom-Up and Top-Down Objectness Distillation for Weakly-Supervised Object Detection
|
2019
|
Zhaoyang Zeng
Bei Liu
Jianlong Fu
Hongyang Chao
Lei Zhang
|
+
|
From Words to Sentences: A Progressive Learning Approach for Zero-resource Machine Translation with Visual Pivots
|
2019
|
Shizhe Chen
Qin Jin
Jianlong Fu
|
+
PDF
Chat
|
Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting
|
2019
|
Yanhong Zeng
Jianlong Fu
Hongyang Chao
Baining Guo
|
+
PDF
Chat
|
Looking for the Devil in the Details: Learning Trilinear Attention Sampling Network for Fine-Grained Image Recognition
|
2019
|
Heliang Zheng
Jianlong Fu
Zheng-Jun Zha
Jiebo Luo
|
+
|
Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting
|
2019
|
Yanhong Zeng
Jianlong Fu
Hongyang Chao
Baining Guo
|
+
|
Looking for the Devil in the Details: Learning Trilinear Attention Sampling Network for Fine-grained Image Recognition
|
2019
|
Heliang Zheng
Jianlong Fu
Zheng-Jun Zha
Jiebo Luo
|
+
|
From Words to Sentences: A Progressive Learning Approach for Zero-resource Machine Translation with Visual Pivots
|
2019
|
Shizhe Chen
Qin Jin
Jianlong Fu
|
+
|
Activitynet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Videos
|
2019
|
Shizhe Chen
Yuqing Song
Yida Zhao
Qin Jin
Zhaoyang Zeng
Bei Liu
Jianlong Fu
Alexander G. Hauptmann
|
+
|
Learn to Scale: Generating Multipolar Normalized Density Maps for Crowd Counting
|
2019
|
Chenfeng Xu
Kai Qiu
Jianlong Fu
Song Bai
Yongchao Xu
Xiang Bai
|
+
|
WSOD^2: Learning Bottom-up and Top-down Objectness Distillation for Weakly-supervised Object Detection
|
2019
|
Zhaoyang Zeng
Bei Liu
Jianlong Fu
Hongyang Chao
Lei Zhang
|
+
|
360-Indoor: Towards Learning Real-World Objects in 360° Indoor Equirectangular Images
|
2019
|
Shih-Han Chou
Cheng Sun
Wen‐Yen Chang
Wan‐Ting Hsu
Min Sun
Jianlong Fu
|
+
|
Learning Rich Image Region Representation for Visual Question Answering
|
2019
|
Bei Liu
Zhicheng Huang
Zhaoyang Zeng
Zheyu Chen
Jianlong Fu
|
+
|
Learning Deep Bilinear Transformation for Fine-grained Image Representation
|
2019
|
Heliang Zheng
Jianlong Fu
Zheng-Jun Zha
Jiebo Luo
|
+
|
Neural Storyboard Artist: Visualizing Stories with Coherent Image Sequences
|
2019
|
Shizhe Chen
Bei Liu
Jianlong Fu
Ruihua Song
Qin Jin
Pingping Lin
Xiaoyu Qi
Chunting Wang
Jin Zhou
|
+
|
Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language
|
2019
|
Songyang Zhang
Houwen Peng
Jianlong Fu
Jiebo Luo
|
+
|
Learning Sparse 2D Temporal Adjacent Networks for Temporal Action Localization
|
2019
|
Songyang Zhang
Houwen Peng
Le Yang
Jianlong Fu
Jiebo Luo
|
+
PDF
Chat
|
Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training
|
2018
|
Bei Liu
Jianlong Fu
Makoto P. Kato
Masatoshi Yoshikawa
|
+
PDF
Chat
|
Beyond Narrative Description
|
2018
|
Bei Liu
Jianlong Fu
Makoto P. Kato
Masatoshi Yoshikawa
|
+
PDF
Chat
|
Self-View Grounding Given a Narrated 360° Video
|
2018
|
Shih-Han Chou
Yi‐Chun Chen
Kuo-Hao Zeng
Hou-Ning Hu
Jianlong Fu
Min Sun
|
+
PDF
Chat
|
Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training
|
2018
|
Bei Liu
Jianlong Fu
Makoto P. Kato
Masatoshi Yoshikawa
|
+
|
Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions
|
2018
|
Qing Li
Jianlong Fu
Dongfei Yu
Tao Mei
Jiebo Luo
|
+
|
DA-GAN: Instance-level Image Translation by Deep Attention Generative Adversarial Networks (with Supplementary Materials)
|
2018
|
Shuang Ma
Jianlong Fu
Chang Wen Chen
Tao Mei
|
+
|
Image Inspired Poetry Generation in XiaoIce
|
2018
|
Wen‐Feng Cheng
Chao-Chung Wu
Ruihua Song
Jianlong Fu
Xing Xie
Jian‐Yun Nie
|
+
PDF
Chat
|
Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions
|
2018
|
Qing Li
Jianlong Fu
Dongfei Yu
Tao Mei
Jiebo Luo
|
+
PDF
Chat
|
3D Human Body Reshaping with Anthropometric Modeling
|
2018
|
Yanhong Zeng
Jianlong Fu
Hongyang Chao
|
+
PDF
Chat
|
Show, Adapt and Tell: Adversarial Training of Cross-Domain Image Captioner
|
2017
|
Tseng-Hung Chen
Yuan-Hong Liao
Ching-Yao Chuang
Wan‐Ting Hsu
Jianlong Fu
Min Sun
|
+
|
Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner
|
2017
|
Tseng-Hung Chen
Yuan-Hong Liao
Ching-Yao Chuang
Wan‐Ting Hsu
Jianlong Fu
Min Sun
|
+
|
Self-view Grounding Given a Narrated 360° Video
|
2017
|
Shih-Han Chou
Yi-Chun Chen
Kuo-Hao Zeng
Hou-Ning Hu
Jianlong Fu
Min Sun
|
+
|
Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network
|
2016
|
Yu Liu
Jianlong Fu
Tao Mei
Chang Wen Chen
|