Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks
and Algorithms
Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks
and Algorithms
Modern vision models are trained on very large noisy datasets. While these models acquire strong capabilities, they may not follow the user's intent to output the desired results in certain aspects, e.g., visual aesthetic, preferred style, and responsibility. In this paper, we target the realm of visual aesthetics and aim …