Yixiao Ge

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat Moto: Latent Motion Token as the Bridging Language for Robot Manipulation 2024 Yi‐Ping Phoebe Chen
Yuying Ge
Yizhuo Li
Yixiao Ge
Mingyu Ding
Ying Shan
Xihui Liu
+ PDF Chat EgoPlan-Bench2: A Benchmark for Multimodal Large Language Model Planning in Real-World Scenarios 2024 Lu Qiu
Yuying Ge
Yi‐Ping Phoebe Chen
Yixiao Ge
Ying Shan
Xihui Liu
+ PDF Chat DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models 2024 Yizhuo Li
Yuying Ge
Yixiao Ge
Ping Luo
Ying Shan
+ PDF Chat Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation 2024 Yuying Ge
Yizhuo Li
Yixiao Ge
Ying Shan
+ PDF Chat Taming Scalable Visual Tokenizer for Autoregressive Image Generation 2024 Fengyuan Shi
Zhuoyan Luo
Yixiao Ge
Yujiu Yang
Ying Shan
Limin Wang
+ PDF Chat ATP-LLaVA: Adaptive Token Pruning for Large Vision Language Models 2024 Xubing Ye
Yukang Gan
Yixiao Ge
Xiaoping Zhang
Yansong Tang
+ PDF Chat Geometric Data Fusion for Collaborative Attitude Estimation 2024 Yixiao Ge
Behzad Zamani
Pieter van Goor
Jochen Trumpf
Robert Mahony
+ PDF Chat SEED-Story: Multimodal Long Story Generation with Large Language Model 2024 Shuai Yang
Yuying Ge
Yang Li
Yukang Chen
Yixiao Ge
Ying Shan
Yingcong Chen
+ PDF Chat VoCo-LLaMA: Towards Vision Compression with Large Language Models 2024 Xubing Ye
Yukang Gan
Xiaoke Huang
Yixiao Ge
Ying Shan
Yansong Tang
+ PDF Chat GrootVL: Tree Topology is All You Need in State Space Model 2024 Yicheng Xiao
Lin Song
Shaoli Huang
Jiangshan Wang
Siyu Song
Yixiao Ge
Xiu Li
Ying Shan
+ PDF Chat Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots 2024 Chengyue Wu
Yixiao Ge
Qiushan Guo
Jiahao Wang
Zhixuan Liang
Zeyu Lu
Ying Shan
Ping Luo
+ PDF Chat SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image Editing 2024 Yuying Ge
Sijie Zhao
Chen Li
Yixiao Ge
Ying Shan
+ PDF Chat SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual Comprehension 2024 Bohao Li
Yuying Ge
Yi Chen
Yixiao Ge
Ruimao Zhang
Ying Shan
+ PDF Chat SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation 2024 Yuying Ge
Sijie Zhao
Jinguo Zhu
Yixiao Ge
Kun Yi
Song Lin
Chen Li
Xiaohan Ding
Ying Shan
+ PDF Chat ST-LLM: Large Language Models Are Effective Temporal Learners 2024 Ruyang Liu
Chen Li
Haoran Tang
Yixiao Ge
Ying Shan
Ge Li
+ PDF Chat A Geometric Perspective on Fusing Gaussian Distributions on Lie Groups 2024 Yixiao Ge
Pieter van Goor
Robert Mahony
+ PDF Chat YOLO-World: Real-Time Open-Vocabulary Object Detection 2024 Tianheng Cheng
Lin Song
Yixiao Ge
Wenyu Liu
Xinggang Wang
Ying Shan
+ LLaMA Pro: Progressive LLaMA with Block Expansion 2024 Chengyue Wu
Yukang Gan
Yixiao Ge
Zeyu Lu
Jiahao Wang
Feng Ye
Ping Luo
Ying Shan
+ Towards A Better Metric for Text-to-Video Generation 2024 Jay Zhangjie Wu
Guian Fang
Haoning Wu
Xintao Wang
Yixiao Ge
Xiaodong Cun
David Junhao Zhang
Jia-Wei Liu
Yuchao Gu
Rui Zhao
+ Supervised Fine-tuning in turn Improves Visual Foundation Models 2024 Xiaohu Jiang
Yixiao Ge
Yuying Ge
Chun Yuan
Ying Shan
+ Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities 2024 Yiyuan Zhang
Xiaohan Ding
Kaixiong Gong
Yixiao Ge
Ying Shan
Xiangyu Yue
+ PDF Chat A Note on the Extended Kalman Filter on a Manifold 2023 Yixiao Ge
Pieter van Goor
Robert Mahony
+ PDF Chat Exploring Model Transferability through the Lens of Potential Energy 2023 Xiaotong Li
Zixuan Hu
Yixiao Ge
Ying Shan
Ling‐Yu Duan
+ PDF Chat Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation 2023 Jay Zhangjie Wu
Yixiao Ge
Xintao Wang
Stan Weixian Lei
Yuchao Gu
Yufei Shi
Wynne Hsu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
+ PDF Chat BoxSnake: Polygonal Instance Segmentation with Box Supervision 2023 Rui Yang
Song Lin
Yixiao Ge
Xiu Li
+ PDF Chat Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection 2023 Y.K. Fang
Shusheng Yang
Shijie Wang
Yixiao Ge
Ying Shan
Xinggang Wang
+ Binary Embedding-based Retrieval at Tencent 2023 Yukang Gan
Yixiao Ge
Chang Zhou
Shupeng Su
Zhouchuan Xu
Xuyuan Xu
Quanchao Hui
Xiang Chen
Yexin Wang
Ying Shan
+ PDF Chat Darwinian Model Upgrades: Model Evolving with Selective Compatibility 2023 Binjie Zhang
Shupeng Su
Yixiao Ge
Xuyuan Xu
Yexin Wang
Chun Yuan
Mike Zheng Shou
Ying Shan
+ PDF Chat Accelerating Vision-Language Pretraining with Free Language Modeling 2023 Teng Wang
Yixiao Ge
Feng Zheng
Ran Cheng
Ying Shan
Xiaohu Qie
Ping Luo
+ PDF Chat RILS: Masked Visual Reconstruction in Language Semantic Space 2023 Shusheng Yang
Yixiao Ge
Kun Yi
Dian Li
Ying Shan
Xiaohu Qie
Xinggang Wang
+ PDF Chat Learning Transferable Spatiotemporal Representations from Natural Script Knowledge 2023 Ziyun Zeng
Yuying Ge
Xihui Liu
Bin Chen
Ping Luo
Shu‐Tao Xia
Yixiao Ge
+ PDF Chat All in One: Exploring Unified Video-Language Pre-Training 2023 Jinpeng Wang
Yixiao Ge
Rui Yan
Yuying Ge
Kevin Qinghong Lin
Satoshi Tsutsui
Xudong Lin
Guanyu Cai
Jianping Wu
Ying Shan
+ Modeling Uncertain Feature Representation for Domain Generalization 2023 Xiaotong Li
Zixuan Hu
Jun Liu
Yixiao Ge
Yongxing Dai
Ling‐Yu Duan
+ RILS: Masked Visual Reconstruction in Language Semantic Space 2023 Shusheng Yang
Yixiao Ge
Kun Yi
Dian Li
Ying Shan
Xiaohu Qie
Xinggang Wang
+ Binary Embedding-based Retrieval at Tencent 2023 Yukang Gan
Yixiao Ge
Chang Zhou
Shupeng Su
Zhouchuan Xu
Xuyuan Xu
Quanchao Hui
Xiang Chen
Yexin Wang
Ying Shan
+ BoxSnake: Polygonal Instance Segmentation with Box Supervision 2023 Rui Yang
Song Lin
Yixiao Ge
Xiu Li
+ Accelerating Vision-Language Pretraining with Free Language Modeling 2023 Teng Wang
Yixiao Ge
Feng Zheng
Ran Cheng
Ying Shan
Xiaohu Qie
Ping Luo
+ TagGPT: Large Language Models are Zero-shot Multimodal Taggers 2023 Chen Li
Yixiao Ge
Jiayong Mao
Dian Li
Ying Shan
+ Attack is Good Augmentation: Towards Skeleton-Contrastive Representation Learning 2023 Binqian Xu
Xiangbo Shu
Rui Yan
Guo-Sen Xie
Yixiao Ge
Mike Zheng Shou
+ $π$-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation 2023 Chengyue Wu
Teng Wang
Yixiao Ge
Zeyu Lu
Ruisong Zhou
Ying Shan
Ping Luo
+ Caption Anything: Interactive Image Description with Diverse Multimodal Controls 2023 Teng Wang
Jinrui Zhang
Junjie Fei
Yixiao Ge
Hao Zheng
Yunlong Tang
Zhe Li
Mingqi Gao
Shanshan Zhao
Ying Shan
+ What Makes for Good Visual Tokenizers for Large Language Models? 2023 Guangzhi Wang
Yixiao Ge
Xiaohan Ding
Mohan Kankanhalli
Ying Shan
+ TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale 2023 Ziyun Zeng
Yixiao Ge
Tong Zhan
Xihui Liu
Shu‐Tao Xia
Ying Shan
+ Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models 2023 Yuchao Gu
Xintao Wang
Jay Zhangjie Wu
Yujun Shi
Yunpeng Chen
Zihan Fan
Wuyou Xiao
Rui Zhao
Shuning Chang
Weijia Wu
+ GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction 2023 Rui Yang
Song Lin
Yanwei Li
Sijie Zhao
Yixiao Ge
Xiu Li
Ying Shan
+ Sticker820K: Empowering Interactive Retrieval with Stickers 2023 Sijie Zhao
Yixiao Ge
Zhongang Qi
Song Lin
Xiaohan Ding
Zehua Xie
Ying Shan
+ TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter 2023 Binjie Zhang
Yixiao Ge
Xuyuan Xu
Ying Shan
Mike Zheng Shou
+ PTVD: A Large-Scale Plot-Oriented Multimodal Dataset Based on Television Dramas 2023 C. C. Li
Xutan Peng
Teng Wang
Yixiao Ge
Mengyang Liu
Xuyuan Xu
Yexin Wang
Ying Shan
+ DreamDiffusion: Generating High-Quality Images from Brain EEG Signals 2023 Yunpeng Bai
Xintao Wang
Yan–Pei Cao
Yixiao Ge
Chun Yuan
Ying Shan
+ Planting a SEED of Vision in Large Language Model 2023 Yuying Ge
Yixiao Ge
Ziyun Zeng
Xintao Wang
Ying Shan
+ SEED-Bench: Benchmarking Multimodal LLMs with Generative Comprehension 2023 Bohao Li
Rui Wang
Guangzhi Wang
Yuying Ge
Yixiao Ge
Ying Shan
+ ViT-Lens: Towards Omni-modal Representations 2023 Weixian Lei
Yixiao Ge
Jianfeng Zhang
Dylan Sun
Kun Yi
Ying Shan
Mike Zheng Shou
+ Exploring Model Transferability through the Lens of Potential Energy 2023 Xiaotong Li
Zixuan Hu
Yixiao Ge
Ying Shan
Ling‐Yu Duan
+ Equivariant Symmetries for Inertial Navigation Systems 2023 Alessandro Fornasier
Yixiao Ge
Pieter van Goor
Robert Mahony
Stephan Weiss
+ A Note on the Extended Kalman Filter on a Manifold 2023 Yixiao Ge
Pieter van Goor
Robert Mahony
+ One For All: Video Conversation is Feasible Without Video Instruction Tuning 2023 Ruyang Liu
Chen Li
Yixiao Ge
Ying Shan
Thomas H. Li
Ge Li
+ Making LLaMA SEE and Draw with SEED Tokenizer 2023 Yuying Ge
Sijie Zhao
Ziyun Zeng
Yixiao Ge
Chen Li
Xintao Wang
Ying Shan
+ Meta-Adapter: An Online Few-shot Learner for Vision-Language Model 2023 Cheng Cheng
Lin Song
Ruoyi Xue
Hang Wang
Hongbin Sun
Yixiao Ge
Ying Shan
+ Vision-Language Instruction Tuning: A Review and Analysis 2023 Chen Li
Yixiao Ge
Dian Li
Ying Shan
+ UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition 2023 Xiaohan Ding
Yiyuan Zhang
Yixiao Ge
Sijie Zhao
Lin Song
Xiangyu Yue
Ying Shan
+ ViT-Lens-2: Gateway to Omni-modal Intelligence 2023 Weixian Lei
Yixiao Ge
Kun Yi
Jianfeng Zhang
Difei Gao
Dylan Sun
Yuying Ge
Ying Shan
Mike Zheng Shou
+ SEED-Bench-2: Benchmarking Multimodal Large Language Models 2023 Bohao Li
Yuying Ge
Yixiao Ge
Guangzhi Wang
Rui Wang
Ruimao Zhang
Ying Shan
+ EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models 2023 Yi Chen
Yuying Ge
Yixiao Ge
Mingyu Ding
Bohao Li
Rui Wang
Ruifeng Xu
Ying Shan
Xihui Liu
+ SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models 2023 Yuzhou Huang
Liangbin Xie
Xintao Wang
Ziyang Yuan
Xiaodong Cun
Yixiao Ge
Jiantao Zhou
Chao Dong
Rui Huang
Ruimao Zhang
+ VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation 2023 Jinguo Zhu
Xiaohan Ding
Yixiao Ge
Yuying Ge
Sijie Zhao
Hengshuang Zhao
Xiaohua Wang
Ying Shan
+ Cached Transformers: Improving Transformers with Differentiable Memory Cache 2023 Zhaoyang Zhang
Wenqi Shao
Yixiao Ge
Xiaogang Wang
Jinwei Gu
Ping Luo
+ PDF Chat Equivariant Filter Design for Discrete-time Systems 2022 Yixiao Ge
Pieter van Goor
Robert Mahony
+ PDF Chat Towards Universal Backward-Compatible Representation Learning 2022 Binjie Zhang
Yixiao Ge
Yantao Shen
Shupeng Su
Fanzi Wu
Chun Yuan
Xuyuan Xu
Yexin Wang
Ying Shan
+ PDF Chat Object-aware Video-language Pre-training for Retrieval 2022 Alex Jinpeng Wang
Yixiao Ge
Guanyu Cai
Rui Yan
Xudong Lin
Ying Shan
Xiaohu Qie
Mike Zheng Shou
+ PDF Chat Bridging Video-text Retrieval with Multiple Choice Questions 2022 Yuying Ge
Yixiao Ge
Xihui Liu
Dian Li
Ying Shan
Xiaohu Qie
Ping Luo
+ PDF Chat Structured Domain Adaptation With Online Relation Regularization for Unsupervised Person Re-ID 2022 Yixiao Ge
Feng Zhu
Dapeng Chen
Rui Zhao
Xiaogang Wang
Hongsheng Li
+ All in One: Exploring Unified Video-Language Pre-training 2022 Alex Jinpeng Wang
Yixiao Ge
Rui Yan
Yuying Ge
Xudong Lin
Guanyu Cai
Jianping Wu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
+ Hot-Refresh Model Upgrades with Regression-Alleviating Compatible Training in Image Retrieval 2022 Binjie Zhang
Yixiao Ge
Yantao Shen
Yu Li
Chun Yuan
Xuyuan Xu
Yexin Wang
Ying Shan
+ mc-BEiT: Multi-choice Discretization for Image BERT Pre-training 2022 Xiaotong Li
Yixiao Ge
Kun Yi
Zixuan Hu
Ying Shan
Ling‐Yu Duan
+ Uncertainty Modeling for Out-of-Distribution Generalization 2022 Xiaotong Li
Yongxing Dai
Yixiao Ge
Jun Liu
Ying Shan
Ling‐Yu Duan
+ MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval 2022 Yuying Ge
Yixiao Ge
Xihui Liu
Alex Jinpeng Wang
Jianping Wu
Ying Shan
Xiaohu Qie
Ping Luo
+ Privacy-Preserving Model Upgrades with Bidirectional Compatible Training in Image Retrieval 2022 Shupeng Su
Binjie Zhang
Yixiao Ge
Xuyuan Xu
Yexin Wang
Chun Yuan
Ying Shan
+ Bridging Video-text Retrieval with Multiple Choice Questions 2022 Yuying Ge
Yixiao Ge
Xihui Liu
Dian Li
Ying Shan
Xiaohu Qie
Ping Luo
+ Revitalize Region Feature for Democratizing Video-Language Pre-training of Retrieval 2022 Guanyu Cai
Yixiao Ge
Alex Jinpeng Wang
Rui Yan
Xudong Lin
Ying Shan
Lianghua He
Xiaohu Qie
Jianping Wu
Mike Zheng Shou
+ Towards Universal Backward-Compatible Representation Learning 2022 Binjie Zhang
Yixiao Ge
Yantao Shen
Shupeng Su
Fanzi Wu
Chun Yuan
Xuyuan Xu
Yexin Wang
Ying Shan
+ Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection 2022 Y.K. Fang
Shusheng Yang
Shijie Wang
Yixiao Ge
Ying Shan
Xinggang Wang
+ Masked Image Modeling with Denoising Contrast 2022 Kun Yi
Yixiao Ge
Xiaotong Li
Shusheng Yang
Dian Li
Jianping Wu
Ying Shan
Xiaohu Qie
+ PDF Chat Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space 2022 Wenqi Shao
Xun Zhao
Yixiao Ge
Zhaoyang Zhang
Yang Lei
Xiaogang Wang
Ying Shan
Ping Luo
+ Equivariant Filter Design for Discrete-time systems 2022 Yixiao Ge
Pieter van Goor
Robert Mahony
+ Learning Transferable Spatiotemporal Representations from Natural Script Knowledge 2022 Ziyun Zeng
Yuying Ge
Xihui Liu
Bin Chen
Ping Luo
Shu‐Tao Xia
Yixiao Ge
+ Darwinian Model Upgrades: Model Evolving with Selective Compatibility 2022 Binjie Zhang
Shupeng Su
Yixiao Ge
Xuyuan Xu
Yexin Wang
Chun Yuan
Mike Zheng Shou
Ying Shan
+ Rethinking the Objectives of Vector-Quantized Tokenizers for Image Synthesis 2022 Yuchao Gu
Xintao Wang
Yixiao Ge
Ying Shan
Xiaohu Qie
Mike Zheng Shou
+ Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation 2022 Jay Zhangjie Wu
Yixiao Ge
Xintao Wang
Weixian Lei
Yuchao Gu
Wynne Hsu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
+ PDF Chat mc-BEiT: Multi-choice Discretization for Image BERT Pre-training 2022 Xiaotong Li
Yixiao Ge
Kun Yi
Zixuan Hu
Ying Shan
Ling‐Yu Duan
+ PDF Chat MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval 2022 Yuying Ge
Yixiao Ge
Xihui Liu
Jinpeng Wang
Jianping Wu
Ying Shan
Xiaohu Qie
Ping Luo
+ Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space 2022 Wenqi Shao
Xun Zhao
Yixiao Ge
Zhaoyang Zhang
Yang Lei
Xiaogang Wang
Ying Shan
Ping Luo
+ PDF Chat Dynamic Token Normalization Improves Vision Transformer 2021 Wenqi Shao
Yixiao Ge
Zhaoyang Zhang
Xuyuan Xu
Xiaogang Wang
Ying Shan
Ping Luo
+ PDF Chat Video-Text Pre-training with Learned Regions 2021 Rui Yan
Mike Zheng Shou
Yixiao Ge
Alex Jinpeng Wang
Xudong Lin
Guanyu Cai
Jinhui Tang
+ PDF Chat Object-aware Video-language Pre-training for Retrieval 2021 Alex Jinpeng Wang
Yixiao Ge
Guanyu Cai
Rui Yan
Xudong Lin
Ying Shan
Xiaohu Qie
Mike Zheng Shou
+ PDF Chat Progressive Correspondence Pruning by Consensus Learning 2021 Chen Zhao
Yixiao Ge
Feng Zhu
Rui Zhao
Hongsheng Li
Mathieu Salzmann
+ PDF Chat DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network 2021 Rui Liu
Yixiao Ge
Ching Lam Choi
Xiaogang Wang
Hongsheng Li
+ PDF Chat Refining Pseudo Labels with Clustering Consensus over Generations for Unsupervised Object Re-identification 2021 Xiao Zhang
Yixiao Ge
Yu Qiao
Hongsheng Li
+ PDF Chat DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network 2021 Rui Liu
Yixiao Ge
Ching Lam Choi
Xiaogang Wang
Hongsheng Li
+ Consensus-Guided Correspondence Denoising 2021 Chen Zhao
Yixiao Ge
Jiaqi Yang
Feng Zhu
Rui Zhao
Hongsheng Li
+ PDF Chat Progressive Correspondence Pruning by Consensus Learning 2021 Chen Zhao
Yixiao Ge
Feng Zhu
Rui Zhao
Hongsheng Li
Mathieu Salzmann
+ DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network 2021 Rui Liu
Yixiao Ge
Ching Lam Choi
Xiaogang Wang
Hongsheng Li
+ Self-distillation with Batch Knowledge Ensembling Improves ImageNet Classification 2021 Yixiao Ge
Ching Lam Choi
Xiao Zhang
Peipei Zhao
Feng Zhu
Rui Zhao
Hongsheng Li
+ Refining Pseudo Labels with Clustering Consensus over Generations for Unsupervised Object Re-identification 2021 Xiao Zhang
Yixiao Ge
Yu Qiao
Hongsheng Li
+ Progressive Correspondence Pruning by Consensus Learning 2021 Chen Zhao
Yixiao Ge
Feng Zhu
Rui Zhao
Hongsheng Li
Mathieu Salzmann
+ Video-Text Pre-training with Learned Regions 2021 Rui Yan
Mike Zheng Shou
Yixiao Ge
Alex Jinpeng Wang
Xudong Lin
Guanyu Cai
Jinhui Tang
+ Dynamic Token Normalization Improves Vision Transformers 2021 Wenqi Shao
Yixiao Ge
Zhaoyang Zhang
Xuyuan Xu
Xiaogang Wang
Ying Shan
Ping Luo
+ Object-aware Video-language Pre-training for Retrieval 2021 Alex Jinpeng Wang
Yixiao Ge
Guanyu Cai
Rui Yan
Xudong Lin
Ying Shan
Xiaohu Qie
Mike Zheng Shou
+ PDF Chat Improved Mutual Mean-Teaching for Unsupervised Domain Adaptive Re-ID 2020 Yixiao Ge
Shijie Yu
Dapeng Chen
+ Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification 2020 Yixiao Ge
Dapeng Chen
Hongsheng Li
+ Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification 2020 Yixiao Ge
Dapeng Chen
Hongsheng Li
+ Structured Domain Adaptation for Unsupervised Person Re-identification. 2020 Yixiao Ge
Feng Zhu
Rui Zhao
Hongsheng Li
+ Self-supervising Fine-grained Region Similarities for Large-scale Image Localization 2020 Yixiao Ge
Haibo Wang
Feng Zhu
Rui Zhao
Hongsheng Li
+ Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID 2020 Yixiao Ge
Dapeng Chen
Feng Zhu
Rui Zhao
Hongsheng Li
+ Improved Mutual Mean-Teaching for Unsupervised Domain Adaptive Re-ID 2020 Yixiao Ge
Shijie Yu
Dapeng Chen
+ PDF Chat Self-supervising Fine-Grained Region Similarities for Large-Scale Image Localization 2020 Yixiao Ge
Haibo Wang
Feng Zhu
Rui Zhao
Hongsheng Li
+ Structured Domain Adaptation with Online Relation Regularization for Unsupervised Person Re-ID 2020 Yixiao Ge
Feng Zhu
Dapeng Chen
Rui Zhao
Xiaogang Wang
Hongsheng Li
+ FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification 2018 Yixiao Ge
Zhuowan Li
Haiyu Zhao
Guojun Yin
Shuai Yi
Xiaogang Wang
Hongsheng Li
+ FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification 2018 Yixiao Ge
Zhuowan Li
Haiyu Zhao
Guojun Yin
Shuai Yi
Xiaogang Wang
Hongsheng Li
+ Circuit implementations and bifurcations of a novel fractional-order chaotic system 2015 Huang Wen-Di
Yixiao Ge
Fuhong Min
Enrong Wang
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
16
+ PDF Chat Momentum Contrast for Unsupervised Visual Representation Learning 2020 Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross Girshick
16
+ Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification 2020 Yixiao Ge
Dapeng Chen
Hongsheng Li
7
+ PDF Chat Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval 2021 Max Bain
Arsha Nagrani
GĂźl Varol
Andrew Zisserman
7
+ PDF Chat Structured Domain Adaptation With Online Relation Regularization for Unsupervised Person Re-ID 2022 Yixiao Ge
Feng Zhu
Dapeng Chen
Rui Zhao
Xiaogang Wang
Hongsheng Li
7
+ PDF Chat Performance Measures and a Data Set for Multi-target, Multi-camera Tracking 2016 Ergys Ristani
Francesco Solera
Roger S. Zou
Rita Cucchiara
Carlo Tomasi
7
+ PDF Chat Self-Similarity Grouping: A Simple Unsupervised Cross Domain Adaptation Approach for Person Re-Identification 2019 Yang Fu
Yunchao Wei
Guanshuo Wang
Yuqian Zhou
Humphrey Shi
Uiuc Uiuc
Thomas S. Huang
7
+ PDF Chat Self-Training With Progressive Augmentation for Unsupervised Cross-Domain Person Re-Identification 2019 Xinyu Zhang
Jiewei Cao
Chunhua Shen
Mingyu You
7
+ PDF Chat Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification 2018 Weijian Deng
Liang Zheng
Qixiang Ye
Guoliang Kang
Yi Yang
Jianbin Jiao
6
+ PDF Chat Invariance Matters: Exemplar Memory for Domain Adaptive Person Re-Identification 2019 Zhun Zhong
Liang Zheng
Zhiming Luo
Shaozi Li
Yi Yang
6
+ PDF Chat Deep Clustering for Unsupervised Learning of Visual Features 2018 Mathilde Caron
Piotr Bojanowski
Armand Joulin
Matthijs Douze
6
+ PDF Chat Re-ranking Person Re-identification with k-Reciprocal Encoding 2017 Zhun Zhong
Liang Zheng
Donglin Cao
Shaozi Li
6
+ PDF Chat Less is More: CLIPBERT for Video-and-Language Learning via Sparse Sampling 2021 Jie Lei
Linjie Li
Luowei Zhou
Zhe Gan
Tamara L. Berg
Mohit Bansal
Jingjing Liu
6
+ PDF Chat Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning 2021 Elad Amrani
Rami Ben‐Ari
Daniel Rotman
Alex Bronstein
6
+ PDF Chat Person Transfer GAN to Bridge Domain Gap for Person Re-identification 2018 Longhui Wei
Shiliang Zhang
Wen Gao
Qi Tian
6
+ PDF Chat Random Erasing Data Augmentation 2020 Zhun Zhong
Liang Zheng
Guoliang Kang
Shaozi Li
Yi Yang
6
+ PDF Chat Unsupervised domain adaptive re-identification: Theory and practice 2020 Liangchen Song
Cheng Wang
Lefei Zhang
Bo Du
Qian Zhang
Chang Huang
Xinggang Wang
5
+ PDF Chat A dataset for Movie Description 2015 Anna Rohrbach
Marcus Rohrbach
Niket Tandon
Bernt Schiele
5
+ FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification 2018 Yixiao Ge
Zhuowan Li
Haiyu Zhao
Guojun Yin
Shuai Yi
Xiaogang Wang
Hongsheng Li
5
+ PDF Chat Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net 2018 Xingang Pan
Ping Luo
Jianping Shi
Xiaoou Tang
5
+ PDF Chat Contrastive Multiview Coding 2020 Yonglong Tian
Dilip Krishnan
Phillip Isola
5
+ Representation Learning with Contrastive Predictive Coding 2018 Aäron van den Oord
Yazhe Li
Oriol Vinyals
5
+ PDF Chat Image-to-Image Translation with Conditional Adversarial Networks 2017 Phillip Isola
Jun-Yan Zhu
Tinghui Zhou
Alexei A. Efros
5
+ PDF Chat Multi-modal Transformer for Video Retrieval 2020 Valentin Gabeur
Chen Sun
Karteek Alahari
Cordelia Schmid
5
+ PDF Chat AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-Identification 2020 Yunpeng Zhai
Shijian Lu
Qixiang Ye
Xuebo Shan
Jie Chen
Rongrong Ji
Yonghong Tian
5
+ PDF Chat Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks 2017 Jun-Yan Zhu
Taesung Park
Phillip Isola
Alexei A. Efros
5
+ PDF Chat Localizing Moments in Video with Natural Language 2017 Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef Ĺ ivic
Trevor Darrell
Bryan Russell
5
+ PDF Chat Masked Autoencoders Are Scalable Vision Learners 2022 Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr DollĂĄr
Ross Girshick
5
+ A Simple Framework for Contrastive Learning of Visual Representations 2020 Ting Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
5
+ PDF Chat Unsupervised Person Re-Identification by Soft Multilabel Learning 2019 Hong-Xing Yu
Wei‐Shi Zheng
Ancong Wu
Xiaowei Guo
Shaogang Gong
Jianhuang Lai
5
+ PDF Chat Self-supervising Fine-Grained Region Similarities for Large-Scale Image Localization 2020 Yixiao Ge
Haibo Wang
Feng Zhu
Rui Zhao
Hongsheng Li
5
+ PDF Chat VideoBERT: A Joint Model for Video and Language Representation Learning 2019 Chen Sun
Austin Myers
Carl Vondrick
Kevin Murphy
Cordelia Schmid
5
+ BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding 2018 Jacob Devlin
Ming‐Wei Chang
Kenton Lee
Kristina Toutanova
5
+ PDF Chat HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips 2019 Antoine Miech
Dimitri Zhukov
Jean-Baptiste Alayrac
Makarand Tapaswi
Ivan Laptev
Josef Ĺ ivic
5
+ Attention Is All You Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
5
+ PDF Chat Swin Transformer: Hierarchical Vision Transformer using Shifted Windows 2021 Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
Baining Guo
4
+ PDF Chat ArcFace: Additive Angular Margin Loss for Deep Face Recognition 2019 Jiankang Deng
Jia Guo
Niannan Xue
Stefanos Zafeiriou
4
+ Learning Transferable Visual Models From Natural Language Supervision 2021 Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya Ramesh
Gabriel Goh
Sandhini Agarwal
Girish Sastry
Amanda Askell
Pamela Mishkin
Jack Clark
4
+ Is Space-Time Attention All You Need for Video Understanding? 2021 Gedas Bertasius
Heng Wang
Lorenzo Torresani
4
+ PDF Chat Going deeper with convolutions 2015 Christian Szegedy
Wei Liu
Yangqing Jia
Pierre Sermanet
Scott Reed
Dragomir Anguelov
Dumitru Erhan
Vincent Vanhoucke
Andrew Rabinovich
4
+ ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision 2021 Wonjae Kim
Bokyung Son
Ildoo Kim
4
+ Use What You Have: Video Retrieval Using Representations From Collaborative Experts 2019 Yang Liu
Samuel Albanie
Arsha Nagrani
Andrew Zisserman
4
+ PDF Chat Discriminability Distillation in Group Representation Learning 2020 Manyuan Zhang
Guanglu Song
Hang Zhou
Yu Liu
4
+ PDF Chat Adaptation and Re-identification Network: An Unsupervised Deep Transfer Learning Approach to Person Re-identification 2018 Yu-Jhe Li
Fu-En Yang
Yen‐Cheng Liu
Yu-Ying Yeh
Xiaofei Du
Yu-Chiang Frank Wang
4
+ DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter 2019 Victor Sanh
Lysandre Debut
Julien Chaumond
Thomas Wolf
4
+ PDF Chat Unsupervised Person Re-Identification via Multi-Label Classification 2020 Dongkai Wang
Shiliang Zhang
4
+ PDF Chat Transferable Joint Attribute-Identity Deep Learning for Unsupervised Person Re-identification 2018 Jingya Wang
Xiatian Zhu
Shaogang Gong
Wei Li
4
+ PDF Chat Joint Detection and Identification Feature Learning for Person Search 2017 Tong Xiao
Shuang Li
Bochao Wang
Liang Lin
Xiaogang Wang
4
+ PDF Chat ActBERT: Learning Global-Local Video-Text Representations 2020 Linchao Zhu
Yi Yang
4
+ PDF Chat Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks 2020 Xiujun Li
Xi Yin
Chunyuan Li
Pengchuan Zhang
Xiaowei Hu
Lei Zhang
Lijuan Wang
Houdong Hu
Dong Li
Furu Wei
4