Delving into Multimodal Prompting for Fine-grained Visual Classification

Type: Preprint

Publication Date: 2023-01-01

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2309.08912

Locations

  • arXiv (Cornell University) - View
  • DataCite API - View

Similar Works

Action Title Year Authors
+ PDF Chat Delving into Multimodal Prompting for Fine-Grained Visual Classification 2024 Xin Jiang
Hao Tang
Junyao Gao
Xiaoyu Du
Shengfeng He
Zechao Li
+ Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model 2022 Yinghui Xing
Qirui Wu
De Cheng
Shizhou Zhang
Guoqiang Liang
Yanning Zhang
+ PDF Chat Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model 2023 Yinghui Xing
Qirui Wu
De Cheng
Shizhou Zhang
Guoqiang Liang
Peng Wang
Yanning Zhang
+ PDF Chat Multi-modal Attribute Prompting for Vision-Language Models 2024 Xin Liu
Jiamin Wu
Wenfei Yang
Xu Zhou
Tianzhu Zhang
+ PDF Chat Multi-modal Attribute Prompting for Vision-Language Models 2024 Xin Liu
Jiamin Wu
Anne En-Tzu Yang
Xu Zhou
Tianzhu Zhang
+ Tuning Multi-mode Token-level Prompt Alignment across Modalities 2023 Dongsheng Wang
Miaoge Li
Xinyang Liu
Mingsheng Xu
Bo Chen
Hanwang Zhang
+ PDF Chat Progressive Multi-modal Conditional Prompt Tuning 2024 Xiaoyu Qiu
Hao Feng
Yuechen Wang
Wengang Zhou
Houqiang Li
+ PDF Chat MePT: Multi-Representation Guided Prompt Tuning for Vision-Language Model 2024 Xinyang Wang
Yi Yang
Minfeng Zhu
Kecheng Zheng
Shi Liu
Wei Chen
+ Task-Oriented Multi-Modal Mutual Leaning for Vision-Language Models 2023 Sifan Long
Zhen Zhao
Junkun Yuan
Zichang Tan
Jiangjiang Liu
Luping Zhou
Shengsheng Wang
Jingdong Wang
+ Progressive Multi-modal Conditional Prompt Tuning 2024 Xiaoyu Qiu
Hao Feng
Yuechen Wang
Wengang Zhou
Houqiang Li
+ APLe: Token-Wise Adaptive for Multi-Modal Prompt Learning 2024 Guiming Cao
Kaize Shi
Hong Fu
Huaiwen Zhang
Guandong Xu
+ PDF Chat TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning 2024 Jingjing Xie
Yuan‐Ting Zhang
Jun Peng
Zhaohong Huang
Liujuan Cao
+ CLIP-Adapter: Better Vision-Language Models with Feature Adapters 2021 Peng Gao
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Yongfeng Zhang
Hongsheng Li
Yu Qiao
+ Adapting Pre-trained Language Models to Vision-Language Tasks via Dynamic Visual Prompting 2023 Shubin Huang
Qiong Wu
Yiyi Zhou
Weijie Chen
Rongsheng Zhang
Xiaoshuai Sun
Rongrong Ji
+ Q-Boost: On Visual Quality Assessment Ability of Low-level Multi-Modality Foundation Models 2023 Zicheng Zhang
Haoning Wu
Zhongpeng Ji
Chunyi Li
Erli Zhang
Wei Sun
Xiaohong Liu
Xiongkuo Min
Fengyu Sun
Shangling Jui
+ PDF Chat Multi-Modal Prompt Learning on Blind Image Quality Assessment 2024 Wensheng Pan
Timin Gao
Yan Zhang
Runze Hu
Xiawu Zheng
Enwei Zhang
Yuting Gao
Yutao Liu
Yunhang Shen
Ke Li
+ PDF Chat Deeply Coupled Cross-Modal Prompt Learning 2023 Xuejing Liu
Wei Tang
Jinghui Lu
Rui Zhao
Zhaojun Guo
Fei Tan
+ PDF Chat DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception 2024 Xiaotong Li
Fan Zhang
Haiwen Diao
Yueze Wang
Xinlong Wang
Ling‐Yu Duan
+ DPL: Decoupled Prompt Learning for Vision-Language Models 2023 Xu Chen
Yuhan Zhu
Guozhen Zhang
Haocheng Shen
Yixuan Liao
Xiaoxin Chen
Gangshan Wu
Limin Wang
+ Concept-Guided Prompt Learning for Generalization in Vision-Language Models 2024 Yi Zhang
Ce Zhang
Ke Yu
Yushun Tang
Zhihai He

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors