Ran Xu

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models 2024 Jieyu Zhang
Le Xue
Linxin Song
Jun Wang
Weikai Huang
Manli Shu
An Yan
Zixian Ma
Juan Carlos Niebles
Silvio Savarese
+ PDF Chat xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs 2024 Michael S. Ryoo
Honglu Zhou
Shrikant Kendre
Can Qin
Le Xue
Manli Shu
Silvio Savarese
Ran Xu
Caiming Xiong
Juan Carlos Niebles
+ PDF Chat xLAM: A Family of Large Action Models to Empower AI Agent Systems 2024 Jianguo Zhang
Tian Lan
Ming Zhu
Zuxin Liu
Thai Hoang
Shirley Kokane
Weiran Yao
Juntao Tan
Anish Prabhakar
Haolin Chen
+ PDF Chat xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations 2024 Can Qin
Congying Xia
Krithika Ramakrishnan
Michael S. Ryoo
Lihui Tu
Yihao Feng
Manli Shu
Honglu Zhou
Anas Awadalla
Jun Wang
+ PDF Chat MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases 2024 Rithesh Murthy
Liangwei Yang
Juntao Tan
Tulika Awalgaonkar
Yilun Zhou
Shelby Heinecke
Sachin N. Desai
Jason Wu
Ran Xu
Sarah Tan
+ PDF Chat Hierarchical Point Attention for Indoor 3D Object Detection 2024 Manli Shu
Le Xue
Ning Yu
Roberto MartĂ­n-MartĂ­n
Caiming Xiong
Tom Goldstein
Juan Carlos Niebles
Ran Xu
+ PDF Chat Deformer: Dynamic Fusion Transformer for Robust Hand Pose Estimation 2023 Qichen Fu
Xingyu Liu
Ran Xu
Juan Carlos Niebles
Kris Kitani
+ PDF Chat Tackling Data Heterogeneity in Federated Learning with Class Prototypes 2023 Yutong Dai
Zeyuan Chen
Junnan Li
Shelby Heinecke
Lichao Sun
Ran Xu
+ PDF Chat Mask-Free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations 2023 Vibashan VS
Ning Yu
Xing Chen
Can Qin
Mingfei Gao
Juan Carlos Niebles
Vishal M. Patel
Ran Xu
+ Model-Agnostic Hierarchical Attention for 3D Object Detection 2023 Manli Shu
Le Xue
Ning Yu
Roberto MartĂ­n-MartĂ­n
Juan Carlos Niebles
Caiming Xiong
Ran Xu
+ Neighborhood-Regularized Self-Training for Learning with Few Labels 2023 Ran Xu
Yue Yu
Hejie Cui
Xuan Kan
Yanqiao Zhu
Joyce C. Ho
Chao Zhang
Carl Yang
+ Deformer: Dynamic Fusion Transformer for Robust Hand Pose Estimation 2023 Qichen Fu
Xingyu Liu
Ran Xu
Juan Carlos Niebles
Kris Kitani
+ HIVE: Harnessing Human Feedback for Instructional Visual Editing 2023 Shu Zhang
Xinyi Yang
Yihao Feng
Can Qin
Chia-Chih Chen
Ning Yu
Zeyuan Chen
Huan Wang
Silvio Savarese
Stefano Ermon
+ GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation 2023 Can Qin
Ning Yu
Xing Chen
Shu Zhang
Zeyuan Chen
Stefano Ermon
Yun Fu
Caiming Xiong
Ran Xu
+ Mask-free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations 2023 Vibashan VS
Ning Yu
Xing Chen
Can Qin
Mingfei Gao
Juan Carlos Niebles
Vishal M. Patel
Ran Xu
+ ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding 2023 Le Xue
Ning Yu
Shu Zhang
Junnan Li
Roberto MartĂ­n-MartĂ­n
Jiajun Wu
Caiming Xiong
Ran Xu
Juan Carlos Niebles
Silvio Savarese
+ UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild 2023 Can Qin
Shu Zhang
Ning Yu
Yihao Feng
Xinyi Yang
Yingbo Zhou
Huan Wang
Juan Carlos Niebles
Caiming Xiong
Silvio Savarese
+ A Survey on Knowledge Graphs for Healthcare: Resources, Applications, and Promises 2023 Hejie Cui
Jiaying Lu
Shi-Yu Wang
Ran Xu
Wenjing Ma
Shaojun Yu
Yue Yu
Xuan Kan
Ling Chen
Joyce C. Ho
+ REX: Rapid Exploration and eXploitation for AI Agents 2023 Rithesh Murthy
Shelby Heinecke
Juan Carlos Niebles
Zhiwei Liu
Le Xue
Weiran Yao
Yihao Feng
Zeyuan Chen
Akash Gokul
Devansh Arpit
+ BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents 2023 Zhiwei Liu
Weiran Yao
Jianguo Zhang
Le Xue
Shelby Heinecke
Rithesh Murthy
Yihao Feng
Zeyuan Chen
Juan Carlos Niebles
Devansh Arpit
+ Use All The Labels: A Hierarchical Multi-Label Contrastive Learning Framework 2022 Shu Zhang
Ran Xu
Caiming Xiong
Chetan Ramaiah
+ Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Propagation Approach 2022 Yue Yu
Rongzhi Zhang
Ran Xu
Jieyu Zhang
Jiaming Shen
Chao Zhang
+ TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation 2022 Jun Wang
Mingfei Gao
Yuqian Hu
Ramprasaath R. Selvaraju
Chetan Ramaiah
Ran Xu
Joseph F. JĂĄJĂĄ
Larry S. Davis
+ Learning Task-Aware Effective Brain Connectivity for fMRI Analysis with Graph Neural Networks 2022 Yue Yu
Xuan Kan
Hejie Cui
Ran Xu
Yujia Zheng
Xiangchen Song
Yanqiao Zhu
Kun Zhang
Razieh Nabi
Ying Guo
+ Tackling Data Heterogeneity in Federated Learning with Class Prototypes 2022 Yutong Dai
Zeyuan Chen
Junnan Li
Shelby Heinecke
Lichao Sun
Ran Xu
+ ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding 2022 Le Xue
Mingfei Gao
Xing Chen
Roberto MartĂ­n-MartĂ­n
Jiajun Wu
Caiming Xiong
Ran Xu
Juan Carlos Niebles
Silvio Savarese
+ LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer 2022 Ning Yu
Chia-Chih Chen
Zeyuan Chen
Rui Meng
Gang Wu
Paul Josel
Juan Carlos Niebles
Caiming Xiong
Ran Xu
+ PDF Chat Burn After Reading: Online Adaptation for Cross-domain Streaming Data 2021 Luyu Yang
Mingfei Gao
Zeyuan Chen
Ran Xu
Abhinav Shrivastava
Chetan Ramaiah
+ Value Retrieval with Arbitrary Queries for Form-like Documents 2021 Mingfei Gao
Le Xue
Chetan Ramaiah
Xing Chen
Ran Xu
Caiming Xiong
+ Virtuoso: Video-based Intelligence for real-time tuning on SOCs 2021 Jayoung Lee
Pengcheng Wang
Ran Xu
Venkat Dasari
Noah Weston
Yin Li
Saurabh Bagchi
Somali Chaterji
+ Open Vocabulary Object Detection with Pseudo Bounding-Box Labels 2021 Mingfei Gao
Xing Chen
Juan Carlos Niebles
Junnan Li
Ran Xu
Wenhao Liu
Caiming Xiong
+ Field Extraction from Forms with Unlabeled Data 2021 Mingfei Gao
Zeyuan Chen
Nikhil Naik
Kazuma Hashimoto
Caiming Xiong
Ran Xu
+ PDF Chat WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos 2020 Mingfei Gao
Yingbo Zhou
Ran Xu
Richard Socher
Caiming Xiong
+ JANUS: Benchmarking Commercial and Open-Source Cloud and Edge Platforms for Object and Anomaly Detection Workloads 2020 Karthick Shankar
Pengcheng Wang
Ran Xu
Ashraf Mahgoub
Somali Chaterji
+ Collecting and Annotating the Large Continuous Action Dataset 2015 Daniel Paul Barrett
Ran Xu
Haonan Yu
Jeffrey Mark Siskind
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat Swin Transformer: Hierarchical Vision Transformer using Shifted Windows 2021 Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
Baining Guo
2
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
2
+ PDF Chat SMOTE: Synthetic Minority Over-sampling Technique 2002 Nitesh V. Chawla
Kevin W. Bowyer
Lawrence Hall
W. Philip Kegelmeyer
1
+ PDF Chat Multi-view 3D Object Detection Network for Autonomous Driving 2017 Xiaozhi Chen
Huimin Ma
Ji Wan
Bo Li
Tian Xia
1
+ PDF Chat Learning Deep Features for Discriminative Localization 2016 Bolei Zhou
Aditya Khosla
Àgata Lapedriza
Aude Oliva
Antonio Torralba
1
+ PDF Chat Stacked Hourglass Networks for Human Pose Estimation 2016 Alejandro Newell
Kaiyu Yang
Jia Deng
1
+ PDF Chat Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose 2017 Georgios Pavlakos
Xiaowei Zhou
Konstantinos G. Derpanis
Kostas Daniilidis
1
+ PDF Chat Embodied hands 2017 Javier Romero
Dimitrios Tzionas
Michael J. Black
1
+ PDF Chat FoldingNet: Point Cloud Auto-Encoder via Deep Grid Deformation 2018 Yaoqing Yang
Chen Feng
Yiru Shen
Dong Tian
1
+ PDF Chat OctNet: Learning Deep 3D Representations at High Resolutions 2017 Gernot Riegler
Ali Osman Ulusoy
Andreas Geiger
1
+ PDF Chat PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation 2017 Raffaelli Charles
Hao Su
Kaichun Mo
Leonidas Guibas
1
+ PDF Chat Feature Pyramid Networks for Object Detection 2017 Tsung-Yi Lin
Piotr DollĂĄr
Ross Girshick
Kaiming He
Bharath Hariharan
Serge Belongie
1
+ PDF Chat ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes 2017 Angela Dai
Anne Lynn S. Chang
Manolis Savva
Maciej Halber
Thomas Funkhouser
Matthias Nießner
1
+ PDF Chat UntrimmedNets for Weakly Supervised Action Recognition and Detection 2017 Limin Wang
Yuanjun Xiong
Dahua Lin
Luc Van Gool
1
+ Local SGD Converges Fast and Communicates Little 2018 Sebastian U. Stich
1
+ PDF Chat SpiderCNN: Deep Learning on Point Sets with Parameterized Convolutional Filters 2018 Yifan Xu
Tianqi Fan
Mingye Xu
Long Zeng
Yu Qiao
1
+ PDF Chat SPLATNet: Sparse Lattice Networks for Point Cloud Processing 2018 Hang Su
Varun Jampani
Deqing Sun
Subhransu Maji
Evangelos Kalogerakis
Ming–Hsuan Yang
Jan Kautz
1
+ Deep Learning using Rectified Linear Units (ReLU) 2018 Abien Fred Agarap
1
+ PDF Chat Class-Balanced Loss Based on Effective Number of Samples 2019 Yin Cui
Menglin Jia
Tsung-Yi Lin
Yang Song
Serge Belongie
1
+ PDF Chat Focal Loss for Dense Object Detection 2018 Tsung-Yi Lin
Priya Goyal
Ross Girshick
Kaiming He
Piotr DollĂĄr
1
+ BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding 2018 Jacob Devlin
Ming‐Wei Chang
Kenton Lee
Kristina Toutanova
1
+ PDF Chat Parallel Restarted SGD with Faster Convergence and Less Communication: Demystifying Why Model Averaging Works for Deep Learning 2019 Hao Yu
Sen Yang
Shenghuo Zhu
1
+ PDF Chat Deep Learning vs. Traditional Computer Vision 2019 Niall O’Mahony
Sean Campbell
Anderson Carvalho
Suman Harapanahalli
Gustavo Velasco-Hernandez
Lenka KrpĂĄlkovĂĄ
Daniel Riordan
J. L. Walsh
1
+ PDF Chat PointRCNN: 3D Object Proposal Generation and Detection From Point Cloud 2019 Shaoshuai Shi
Xiaogang Wang
Hongsheng Li
1
+ PDF Chat Object Counting and Instance Segmentation With Image-Level Supervision 2019 Hisham Cholakkal
Guolei Sun
Fahad Shahbaz Khan
Ling Shao
1
+ PDF Chat Volumetric and Multi-view CNNs for Object Classification on 3D Data 2016 Charles R. Qi
Hao Su
Matthias NieBner
Angela Dai
Mengyuan Yan
Leonidas Guibas
1
+ PDF Chat Learning to Estimate 3D Hand Pose from Single RGB Images 2017 Christian Zimmermann
Thomas Brox
1
+ PDF Chat Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization 2017 Ramprasaath R. Selvaraju
Michael Cogswell
Abhishek Das
Ramakrishna Vedantam
Devi Parikh
Dhruv Batra
1
+ PDF Chat Modeling Point Clouds With Self-Attention and Gumbel Subset Sampling 2019 Jiancheng Yang
Qiang Zhang
Bingbing Ni
Linguo Li
Jinxian Liu
Mengdie Zhou
Qi Tian
1
+ PDF Chat Character-Level Language Modeling with Deeper Self-Attention 2019 Rami Al‐Rfou
Dokook Choe
Noah Constant
Mandy Guo
Llion Jones
1
+ PDF Chat Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs 2017 Martin Simonovsky
Nikos Komodakis
1
+ PDF Chat 3D Human Pose Estimation Using Convolutional Neural Networks with 2D Pose Information 2016 Sungheon Park
Jihye Hwang
Nojun Kwak
1
+ PDF Chat 3D Semantic Segmentation with Submanifold Sparse Convolutional Networks 2018 Benjamin Graham
Martin Engelcke
Laurens van der Maaten
1
+ Large-Scale Point Cloud Semantic Segmentation with Superpoint Graphs 2018 LoĂŻc Landrieu
Martin Simonovsky
1
+ PDF Chat Weakly Supervised Instance Segmentation Using Class Peak Response 2018 Yanzhao Zhou
Yi Zhu
Qixiang Ye
Qiang Qiu
Jianbin Jiao
1
+ PDF Chat Focal Loss for Dense Object Detection 2017 Tsung-Yi Lin
Priya Goyal
Ross Girshick
Kaiming He
Piotr DollĂĄr
1
+ PDF Chat Weakly Supervised Deep Detection Networks 2016 Hakan Bilen
Andrea Vedaldi
1
+ PDF Chat Tell Me Where to Look: Guided Attention Inference Network 2018 Kunpeng Li
Ziyan Wu
Kuan-Chuan Peng
Jan Ernst
Yun Fu
1
+ Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation 2014 Kyunghyun Cho
Bart van MerriĂŤnboer
Çaǧlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
1
+ PDF Chat VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection 2018 Yin Zhou
Oncel Tuzel
1
+ PDF Chat Convolutional Mesh Regression for Single-Image Human Shape Reconstruction 2019 Nikos Kolotouros
Georgios Pavlakos
Kostas Daniilidis
1
+ PDF Chat Zero-Shot Object Detection 2018 Ankan Bansal
Karan Sikka
Gaurav Sharma
Rama Chellappa
Ajay Divakaran
1
+ PDF Chat Cost-Sensitive Learning of Deep Feature Representations From Imbalanced Data 2017 Salman Khan
Munawar Hayat
Mohammed Bennamoun
Ferdous Sohel
Roberto Togneri
1
+ PDF Chat Frustum PointNets for 3D Object Detection from RGB-D Data 2018 Charles R. Qi
Wei Liu
Chenxia Wu
Hao Su
Leonidas Guibas
1
+ PDF Chat Pushing the Envelope for RGB-Based Dense 3D Hand Pose Estimation via Neural Rendering 2019 Seungryul Baek
Kwang In Kim
Tae‐Kyun Kim
1
+ PDF Chat MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features 2018 Liang-Chieh Chen
Alexander Hermans
George Papandreou
Florian Schroff
Peng Wang
Hartwig Adam
1
+ PDF Chat Dissimilarity Coefficient Based Weakly Supervised Object Detection 2019 Aditya Arun
C. V. Jawahar
Manish Kumar
1
+ ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks 2019 Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
1
+ VisualBERT: A Simple and Performant Baseline for Vision and Language 2019 Liunian Harold Li
Mark Yatskar
Da Yin
Cho‐Jui Hsieh
Kai-Wei Chang
1
+ Distilling the Knowledge in a Neural Network 2015 Geoffrey E. Hinton
Oriol Vinyals
Jay B. Dean
1