+
PDF
Chat
|
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free
Scale Fusion
|
2024
|
Haonan Qiu
Shiwei Zhang
Yujie Wei
Ruihang Chu
Hangjie Yuan
Xiang Wang
Yingya Zhang
Ziwei Liu
|
+
PDF
Chat
|
PersonalVideo: High ID-Fidelity Video Customization without Dynamic and
Semantic Degradation
|
2024
|
Hengjia Li
Haonan Qiu
Shiwei Zhang
Xiang Wang
Yujie Wei
Zekun Li
Yingya Zhang
Boxi Wu
Cai Deng
|
+
PDF
Chat
|
UniAnimate: Taming Unified Video Diffusion Models for Consistent Human
Image Animation
|
2024
|
Xiang Wang
Shiwei Zhang
Changxin Gao
Jiayu Wang
Xiaoqiang Zhou
Yingya Zhang
Luxin Yan
Nong Sang
|
+
PDF
Chat
|
CMDFusion: Bidirectional Fusion Network With Cross-Modality Knowledge Distillation for LiDAR Semantic Segmentation
|
2023
|
Jun Cen
Shiwei Zhang
Yixuan Pei
Kun Li
Hang Zheng
Maochun Luo
Yingya Zhang
Qifeng Chen
|
+
PDF
Chat
|
HyRSM++: Hybrid relation guided temporal set matching for few-shot action recognition
|
2023
|
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Zhengrong Zuo
Changxin Gao
Rong Jin
Nong Sang
|
+
PDF
Chat
|
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
|
2023
|
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
|
+
PDF
Chat
|
RLIPv2: Fast Scaling of Relational Language-Image Pre-training
|
2023
|
Hangjie Yuan
Shiwei Zhang
Xiang Wang
Samuel Albanie
Yining Pan
Tao Feng
Jianwen Jiang
Dong Ni
Yingya Zhang
Deli Zhao
|
+
PDF
Chat
|
Towards Real-World Visual Tracking With Temporal Contexts
|
2023
|
Ziang Cao
Ziyuan Huang
Liang Pan
Shiwei Zhang
Ziwei Liu
Changhong Fu
|
+
PDF
Chat
|
Enlarging Instance-specific and Class-specific Information for Open-set Action Recognition
|
2023
|
Jun Cen
Shiwei Zhang
Xiang Wang
Yixuan Pei
Zhiwu Qing
Yingya Zhang
Qifeng Chen
|
+
PDF
Chat
|
MAR: <u>M</u>asked Autoencoders for Efficient <u>A</u>ction <u>R</u>ecognition
|
2023
|
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Xiang Wang
Yuehuan Wang
Yiliang Lv
Changxin Gao
Nong Sang
|
+
|
HyRSM++: Hybrid Relation Guided Temporal Set Matching for Few-shot Action Recognition
|
2023
|
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Zhengrong Zuo
Changxin Gao
Rong Jin
Nong Sang
|
+
|
The Devil is in the Wrongly-classified Samples: Towards Unified Open-set Recognition
|
2023
|
Jun Cen
Di Luan
Shiwei Zhang
Yixuan Pei
Yingya Zhang
Deli Zhao
Shaojie Shen
Qifeng Chen
|
+
|
Auto-Parallelizing Large Models with Rhino: A Systematic Approach on Production AI Platform
|
2023
|
Shiwei Zhang
Lansong Diao
Siyu Wang
Zongyan Cao
Yiliang Gu
Chang Si
Ziji Shi
Zheng Zhen
Chuan Wu
Wei Lin
|
+
PDF
Chat
|
ParamCrop: Parametric Cubic Cropping for Video Contrastive Learning
|
2023
|
Zhiwu Qing
Ziyuan Huang
Shiwei Zhang
Mingqian Tang
Changxin Gao
Rong Jin
Marcelo H. Ang
Nong Sang
|
+
|
CLIP-guided Prototype Modulating for Few-shot Action Recognition
|
2023
|
Xiang Wang
Shiwei Zhang
Jun Cen
Changxin Gao
Yingya Zhang
Deli Zhao
Nong Sang
|
+
|
Enlarging Instance-specific and Class-specific Information for Open-set Action Recognition
|
2023
|
Jun Cen
Shiwei Zhang
Xiang Wang
Yixuan Pei
Zhiwu Qing
Yingya Zhang
Qifeng Chen
|
+
|
MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition
|
2023
|
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Changxin Gao
Yingya Zhang
Deli Zhao
Nong Sang
|
+
|
Crowd Counting with Sparse Annotation
|
2023
|
Shiwei Zhang
Zhengzheng Wang
Qing Liu
Fei Wang
Wei Ke
Tong Zhang
|
+
|
VideoComposer: Compositional Video Synthesis with Motion Controllability
|
2023
|
Xiang Wang
Hangjie Yuan
Shiwei Zhang
Dayou Chen
Jiuniu Wang
Yingya Zhang
Yujun Shen
Deli Zhao
Jingren Zhou
|
+
|
CMDFusion: Bidirectional Fusion Network with Cross-modality Knowledge Distillation for LIDAR Semantic Segmentation
|
2023
|
Jun Cen
Shiwei Zhang
Yixuan Pei
Kun Li
Hang Zheng
Maochun Luo
Yingya Zhang
Qifeng Chen
|
+
|
Temporally-Adaptive Models for Efficient Video Understanding
|
2023
|
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Yingya Zhang
Ziwei Liu
Marcelo H. Ang
|
+
|
ModelScope Text-to-Video Technical Report
|
2023
|
Jiuniu Wang
Hangjie Yuan
Dayou Chen
Yingya Zhang
Xiang Wang
Shiwei Zhang
|
+
|
RLIPv2: Fast Scaling of Relational Language-Image Pre-training
|
2023
|
Hangjie Yuan
Shiwei Zhang
Xiang Wang
Samuel Albanie
Yining Pan
Tao Feng
Jianwen Jiang
Dong Ni
Yingya Zhang
Deli Zhao
|
+
|
Towards Real-World Visual Tracking with Temporal Contexts
|
2023
|
Ziang Cao
Ziyuan Huang
Liang Pan
Shiwei Zhang
Ziwei Liu
Changhong Fu
|
+
|
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
|
2023
|
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
|
+
|
Few-shot Action Recognition with Captioning Foundation Models
|
2023
|
Xiang Wang
Shiwei Zhang
Hangjie Yuan
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
|
+
|
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
|
2023
|
Shiwei Zhang
Jiayu Wang
Yingya Zhang
Kang Zhao
Hangjie Yuan
Zhiwu Qin
Xiang Wang
Deli Zhao
Jingren Zhou
|
+
|
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation
|
2023
|
Biao Gong
Siteng Huang
Yutong Feng
Shiwei Zhang
Yuyuan Li
Yu Liu
|
+
|
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
|
2023
|
Zhiwu Qing
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yujie Wei
Yingya Zhang
Changxin Gao
Nong Sang
|
+
|
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
|
2023
|
Yujie Wei
Shiwei Zhang
Zhiwu Qing
Hangjie Yuan
Zhi‐Heng Liu
Yu Liu
Yingya Zhang
Jingren Zhou
Hongming Shan
|
+
|
VideoLCM: Video Latent Consistency Model
|
2023
|
Xiang Wang
Shiwei Zhang
Han Zhang
Yu Liu
Yingya Zhang
Changxin Gao
Nong Sang
|
+
|
InstructVideo: Instructing Video Diffusion Models with Human Feedback
|
2023
|
Hangjie Yuan
Shiwei Zhang
Xiang Wang
Yujie Wei
Tao Feng
Yining Pan
Yingya Zhang
Ziwei Liu
Samuel Albanie
Dong Ni
|
+
|
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
|
2023
|
Xiang Wang
Shiwei Zhang
Hangjie Yuan
Zhiwu Qing
Biao Gong
Yingya Zhang
Yujun Shen
Changxin Gao
Nong Sang
|
+
|
Accelerating large-scale distributed neural network training with SPMD parallelism
|
2022
|
Shiwei Zhang
Lansong Diao
Chuan Wu
Siyu Wang
Wei Lin
|
+
PDF
Chat
|
Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency
|
2022
|
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yi Xu
Xiang Wang
Mingqian Tang
Changxin Gao
Rong Jin
Nong Sang
|
+
PDF
Chat
|
TCTrack: Temporal Contexts for Aerial Tracking
|
2022
|
Ziang Cao
Ziyuan Huang
Liang Pan
Shiwei Zhang
Ziwei Liu
Changhong Fu
|
+
PDF
Chat
|
End-to-End Temporal Action Detection With Transformer
|
2022
|
Xiaolong Liu
Qimeng Wang
Yao Hu
Xu Tang
Shiwei Zhang
Song Bai
Xiang Bai
|
+
|
Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency
|
2022
|
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yi Xu
Xiang Wang
Mingqian Tang
Changxin Gao
Rong Jin
Nong Sang
|
+
|
Hybrid Relation Guided Set Matching for Few-shot Action Recognition
|
2022
|
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Mingqian Tang
Zhengrong Zuo
Changxin Gao
Rong Jin
Nong Sang
|
+
|
TCTrack: Temporal Contexts for Aerial Tracking
|
2022
|
Ziang Cao
Ziyuan Huang
Liang Pan
Shiwei Zhang
Ziwei Liu
Changhong Fu
|
+
|
Open-world Semantic Segmentation for LIDAR Point Clouds
|
2022
|
Jun Cen
Yun Peng
Shiwei Zhang
Junhao Cai
Di Luan
Michael Yu Wang
Ming Liu
Mingqian Tang
|
+
|
MAR: Masked Autoencoders for Efficient Action Recognition
|
2022
|
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Xiang Wang
Yuehuan Wang
Yiliang Lv
Changxin Gao
Nong Sang
|
+
|
Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning
|
2022
|
Yixuan Pei
Zhiwu Qing
Jun Cen
Xiang Wang
Shiwei Zhang
Yaxiong Wang
Mingqian Tang
Nong Sang
Xueming Qian
|
+
PDF
Chat
|
Open-world Semantic Segmentation for LIDAR Point Clouds
|
2022
|
Jun Cen
Yun Peng
Shiwei Zhang
Junhao Cai
Di Luan
Mingqian Tang
Ming Liu
Michael Yu Wang
|
+
PDF
Chat
|
OadTR: Online Action Detection with Transformers
|
2021
|
Wang Xiang
Shiwei Zhang
Zhiwu Qing
Yuanjie Shao
Zhengrong Zuo
Changxin Gao
Nong Sang
|
+
PDF
Chat
|
Support-Set Based Cross-Supervision for Video Grounding
|
2021
|
Xinpeng Ding
Nannan Wang
Shiwei Zhang
De Cheng
Xiaomeng Li
Ziyuan Huang
Mingqian Tang
Xinbo Gao
|
+
PDF
Chat
|
Self-Supervised Learning for Semi-Supervised Temporal Action Proposal
|
2021
|
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Yuanjie Shao
Changxin Gao
Nong Sang
|
+
PDF
Chat
|
Self-supervised Motion Learning from Static Images
|
2021
|
Ziyuan Huang
Shiwei Zhang
Jianwen Jiang
Mingqian Tang
Rong Jin
Marcelo H. Ang
|
+
|
Self-supervised Motion Learning from Static Images
|
2021
|
Ziyuan Huang
Shiwei Zhang
Jianwen Jiang
Mingqian Tang
Rong Jin
Marcelo H. Ang
|
+
|
Relation Modeling in Spatio-Temporal Action Localization
|
2021
|
Yutong Feng
Jianwen Jiang
Ziyuan Huang
Zhiwu Qing
Xiang Wang
Shiwei Zhang
Mingqian Tang
Yue Gao
|
+
|
Towards Training Stronger Video Vision Transformers for EPIC-KITCHENS-100 Action Recognition
|
2021
|
Ziyuan Huang
Zhiwu Qing
Wang Xiang
Yutong Feng
Shiwei Zhang
Jianwen Jiang
Zhurong Xia
Mingqian Tang
Nong Sang
Marcelo H. Ang
|
+
|
A Stronger Baseline for Ego-Centric Action Detection
|
2021
|
Zhiwu Qing
Ziyuan Huang
Xiang Wang
Yutong Feng
Shiwei Zhang
Jianwen Jiang
Mingqian Tang
Changxin Gao
Marcelo H. Ang
Nong Sang
|
+
|
OadTR: Online Action Detection with Transformers
|
2021
|
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Yuanjie Shao
Zhengrong Zuo
Changxin Gao
|
+
|
Weakly-Supervised Temporal Action Localization Through Local-Global Background Modeling
|
2021
|
Xiang Wang
Zhiwu Qing
Ziyuan Huang
Yutong Feng
Shiwei Zhang
Jianwen Jiang
Mingqian Tang
Yuanjie Shao
Nong Sang
|
+
|
Exploring Stronger Feature for Temporal Action Localization
|
2021
|
Zhiwu Qing
Xiang Wang
Ziyuan Huang
Yutong Feng
Shiwei Zhang
Jianwen Jiang
Mingqian Tang
Changxin Gao
Nong Sang
|
+
|
Proposal Relation Network for Temporal Action Detection
|
2021
|
Xiang Wang
Zhiwu Qing
Ziyuan Huang
Yutong Feng
Shiwei Zhang
Jianwen Jiang
Mingqian Tang
Changxin Gao
Nong Sang
|
+
|
ParamCrop: Parametric Cubic Cropping for Video Contrastive Learning
|
2021
|
Zhiwu Qing
Ziyuan Huang
Shiwei Zhang
Mingqian Tang
Changxin Gao
Marcelo H. Ang
Rong Ji
Nong Sang
|
+
|
Support-Set Based Cross-Supervision for Video Grounding
|
2021
|
Xinpeng Ding
Nannan Wang
Shiwei Zhang
De Cheng
Xiaomeng Li
Ziyuan Huang
Mingqian Tang
Xinbo Gao
|
+
|
OadTR: Online Action Detection With Transformers
|
2021
|
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Yuanjie Shao
Zhengrong Zuo
Changxin Gao
Nong Sang
|
+
|
Discovery-and-Selection: Towards Optimal Multiple Instance Learning for Weakly Supervised Object Detection
|
2021
|
Shiwei Zhang
Wei Ke
Lin Yang
Qixiang Ye
Xiaopeng Hong
Yihong Gong
Tong Zhang
|
+
|
TAda! Temporally-Adaptive Convolutions for Video Understanding
|
2021
|
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Mingqian Tang
Ziwei Liu
Marcelo H. Ang
|
+
|
CBR-Net: Cascade Boundary Refinement Network for Action Detection: Submission to ActivityNet Challenge 2020 (Task 1)
|
2020
|
Xiang Wang
Baiteng Ma
Zhiwu Qing
Yongpeng Sang
Changxin Gao
Shiwei Zhang
Nong Sang
|
+
|
Temporal Fusion Network for Temporal Action Localization:Submission to ActivityNet Challenge 2020 (Task E)
|
2020
|
Zhiwu Qing
Xiang Wang
Yongpeng Sang
Changxin Gao
Shiwei Zhang
Nong Sang
|
+
|
Multi-Level Temporal Pyramid Network for Action Detection
|
2020
|
Xiang Wang
Changxin Gao
Shiwei Zhang
Nong Sang
|
+
PDF
Chat
|
Multi-level Temporal Pyramid Network for Action Detection
|
2020
|
Xiang Wang
Changxin Gao
Shiwei Zhang
Nong Sang
|