Ziheng Wu

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion 2024 Zhenzhen Chu
Jiayu Chen
Cen Chen
Chengyu Wang
Ziheng Wu
Jun Huang
Weining Qian
+ PDF Chat Scale-Aware Modulation Meet Transformer 2023 Weifeng Lin
Ziheng Wu
Jiayu Chen
Jun Huang
Lianwen Jin
+ SC-ML: Self-supervised Counterfactual Metric Learning for Debiased Visual Question Answering 2023 Xinyao Shu
Shiyang Yan
Yang Xu
Ziheng Wu
Zhongfeng Chen
Zhenyu Lu
+ Scale-Aware Modulation Meet Transformer 2023 Weifeng Lin
Ziheng Wu
Jiayu Chen
Jun Huang
Lianwen Jin
+ DiffSynth: Latent In-Iteration Deflickering for Realistic Video Synthesis 2023 Zhongjie Duan
Lizhou You
Chengyu Wang
Cen Chen
Ziheng Wu
Weining Qian
Jun Huang
Fei Chao
+ DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion 2023 Zhenzhen Chu
Jiayu Chen
Cen Chen
Chengyu Wang
Ziheng Wu
Jun Huang
Weining Qian
+ EasyPhoto: Your Smart AI Photo Generator 2023 Ziheng Wu
Jiaqi Xu
Xinyi Zou
Kunzhe Huang
Xing Shi
Jun Huang
+ Hierarchical Side-Tuning for Vision Transformers 2023 Weifeng Lin
Ziheng Wu
Jiayu Chen
Wentao Yang
Mingxin Huang
Jun Huang
Lianwen Jin
+ BeautifulPrompt: Towards Automatic Prompt Engineering for Text-to-Image Synthesis 2023 Tingfeng Cao
Chengyu Wang
Bingyan Liu
Ziheng Wu
Jinhui Zhu
Jun Huang
+ BeautifulPrompt: Towards Automatic Prompt Engineering for Text-to-Image Synthesis 2023 Tingfeng Cao
Chengyu Wang
Bingyan Liu
Ziheng Wu
Jinhui Zhu
Jun Huang
+ YOLOX-PAI: An Improved YOLOX, Stronger and Faster than YOLOv6 2022 Xinyi Zou
Ziheng Wu
Wenmeng Zhou
Jun Huang
+ PDF Chat Three-dimensional multi-scale model of deformable platelets adhesion to vessel wall in blood flow 2014 Ziheng Wu
Zhiliang Xu
Олег Ким
Mark Alber
+ Multiscale 3D Model of Platelet-Vessel Wall Interactions in Blood Flow 2013 Ziheng Wu
Zhiliang Xu
Mark Alber
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ Very Deep Convolutional Networks for Large-Scale Image Recognition 2014 Karen Simonyan
Andrew Zisserman
1
+ PDF Chat Implementation of on-site velocity boundary conditions for D3Q19 lattice Boltzmann simulations 2010 Martin Hecht
Jens Harting
1
+ Deep Unsupervised Learning using Nonequilibrium Thermodynamics 2015 Jascha Sohl‐Dickstein
Eric A. Weiss
Niru Maheswaranathan
Surya Ganguli
1
+ PDF Chat Rethinking the Inception Architecture for Computer Vision 2016 Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jon Shlens
Zbigniew Wojna
1
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
1
+ Generative Adversarial Text to Image Synthesis 2016 Scott Reed
Zeynep Akata
Xinchen Yan
Lajanugen Logeswaran
Bernt Schiele
Honglak Lee
1
+ PDF Chat Aggregated Residual Transformations for Deep Neural Networks 2017 Saining Xie
Ross Girshick
Piotr Dollár
Zhuowen Tu
Kaiming He
1
+ Proximal Policy Optimization Algorithms 2017 John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
1
+ PDF Chat Squeeze-and-Excitation Networks 2018 Jie Hu
Li Shen
Gang Sun
1
+ PDF Chat CBAM: Convolutional Block Attention Module 2018 Sanghyun Woo
Jongchan Park
Joon‐Young Lee
In So Kweon
1
+ PDF Chat Unified Perceptual Parsing for Scene Understanding 2018 Tete Xiao
Yingcheng Liu
Bolei Zhou
Yuning Jiang
Jian Sun
1
+ PDF Chat Res2Net: A New Multi-Scale Backbone Architecture 2019 Shanghua Gao
Ming‐Ming Cheng
Kai Zhao
Xinyu Zhang
Ming–Hsuan Yang
Philip H. S. Torr
1
+ Microsoft COCO: Common Objects in Context 2014 Tsung-Yi Lin
Michael Maire
Serge Belongie
Lubomir Bourdev
Ross Girshick
James Hays
Pietro Perona
Deva Ramanan
C. Lawrence Zitnick
Piotr Dollár
1
+ PDF Chat Non-local Neural Networks 2018 Xiaolong Wang
Ross Girshick
Abhinav Gupta
Kaiming He
1
+ PDF Chat ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices 2018 Xiangyu Zhang
Xinyu Zhou
Mengxiao Lin
Jian Sun
1
+ PDF Chat MobileNetV2: Inverted Residuals and Linear Bottlenecks 2018 Mark Sandler
Andrew Howard
Menglong Zhu
Andrey Zhmoginov
Liang-Chieh Chen
1
+ PDF Chat Focal Loss for Dense Object Detection 2017 Tsung-Yi Lin
Priya Goyal
Ross Girshick
Kaiming He
Piotr Dollár
1
+ PDF Chat Cascade R-CNN: Delving Into High Quality Object Detection 2018 Zhaowei Cai
Nuno Vasconcelos
1
+ PDF Chat Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning 2017 Christian Szegedy
Sergey Ioffe
Vincent Vanhoucke
Alexander A. Alemi
1
+ Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks 2019 Nils Reimers
Iryna Gurevych
1
+ Fine-Tuning Language Models from Human Preferences 2019 Daniel M. Ziegler
Nisan Stiennon
Jeffrey Wu
T. B. Brown
Alec Radford
Dario Amodei
Paul F. Christiano
Geoffrey Irving
1
+ PDF Chat Designing Network Design Spaces 2020 Ilija Radosavovic
Raj Prateek Kosaraju
Ross Girshick
Kaiming He
Piotr Dollár
1
+ PDF Chat Bridging the Gap Between Anchor-Based and Anchor-Free Detection via Adaptive Training Sample Selection 2020 Shifeng Zhang
Cheng Chi
Yongqiang Yao
Zhen Lei
Stan Z. Li
1
+ Denoising Diffusion Probabilistic Models 2020 Jonathan Ho
Ajay N. Jain
Pieter Abbeel
1
+ Zero-Shot Text-to-Image Generation 2021 Aditya Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
1
+ PDF Chat Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions 2021 Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lü
Ping Luo
Ling Shao
1
+ PDF Chat Swin Transformer: Hierarchical Vision Transformer using Shifted Windows 2021 Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
Baining Guo
1
+ PDF Chat Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding 2021 Pengchuan Zhang
Xiyang Dai
Jianwei Yang
Bin Xiao
Lu Yuan
Lei Zhang
Jianfeng Gao
1
+ PDF Chat CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification 2021 Chun-Fu Richard Chen
Quanfu Fan
Rameswar Panda
1
+ Twins: Revisiting the Design of Spatial Attention in Vision Transformers 2021 Xiangxiang Chu
Zhi Tian
Yuqing Wang
Bo Zhang
Haibing Ren
Xiaolin Wei
Huaxia Xia
Chunhua Shen
1
+ Learning Transferable Visual Models From Natural Language Supervision 2021 Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya Ramesh
Gabriel Goh
Sandhini Agarwal
Girish Sastry
Amanda Askell
Pamela Mishkin
Jack Clark
1
+ Shuffle Transformer: Rethinking Spatial Shuffle for Vision Transformer 2021 Zilong Huang
Youcheng Ben
Guozhong Luo
Pei Cheng
Gang Yu
Bin Fu
1
+ DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification 2021 Yongming Rao
Wenliang Zhao
Benlin Liu
Jiwen Lu
Jie Zhou
Cho‐Jui Hsieh
1
+ LoRA: Low-Rank Adaptation of Large Language Models 2021 J. Edward Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Weizhu Chen
1
+ CoAtNet: Marrying Convolution and Attention for All Data Sizes 2021 Zihang Dai
Hanxiao Liu
Quoc V. Le
Mingxing Tan
1
+ PDF Chat Bottleneck Transformers for Visual Recognition 2021 Aravind Srinivas
Tsung-Yi Lin
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
1
+ PDF Chat Sparse R-CNN: End-to-End Object Detection with Learnable Proposals 2021 Peize Sun
Rufeng Zhang
Yi Jiang
Tao Kong
Chenfeng Xu
Wei Zhan
Masayoshi Tomizuka
Lei Li
Zehuan Yuan
Changhu Wang
1
+ PDF Chat PVT v2: Improved baselines with Pyramid Vision Transformer 2022 Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lü
Ping Luo
Ling Shao
1
+ Focal Self-attention for Local-Global Interactions in Vision Transformers 2021 Jianwei Yang
Chunyuan Li
Pengchuan Zhang
Xiyang Dai
Bin Xiao
Lu Yuan
Jianfeng Gao
1
+ PDF Chat Mobile-Former: Bridging MobileNet and Transformer 2022 Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Xiaoyi Dong
Lu Yuan
Zicheng Liu
1
+ PDF Chat CvT: Introducing Convolutions to Vision Transformers 2021 Haiping Wu
Bin Xiao
Noel Codella
Mengchen Liu
Xiyang Dai
Lu Yuan
Lei Zhang
1
+ PDF Chat Co-Scale Conv-Attentional Image Transformers 2021 Weijian Xu
Yifan Xu
Tyler A. Chang
Zhuowen Tu
1
+ Hierarchical Text-Conditional Image Generation with CLIP Latents 2022 Aditya Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
1
+ BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation 2022 Junnan Li
Dongxu Li
Caiming Xiong
Steven C. H. Hoi
1
+ Focal Modulation Networks 2022 Jianwei Yang
Chunyuan Li
Jianfeng Gao
1
+ Training language models to follow instructions with human feedback 2022 Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
1
+ Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding 2022 Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. Sara Mahdavi
Rapha Gontijo Lopes
1
+ EfficientFormer: Vision Transformers at MobileNet Speed 2022 Yanyu Li
Geng Yuan
Yang Wen
Eric Hu
Georgios Evangelidis
Sergey Tulyakov
Yanzhi Wang
Jian Ren
1
+ Next-ViT: Next Generation Vision Transformer for Efficient Deployment in Realistic Industrial Scenarios 2022 Jiashi Li
Xin Xia
Wei Li
Huixia Li
Xing Wang
Xuefeng Xiao
Rui Wang
Min Zheng
Xin Pan
1
+ HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions 2022 Yongming Rao
Wenliang Zhao
Yansong Tang
Jie Zhou
Ser-Nam Lim
Jiwen Lu
1