PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language
Instructions
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language
Instructions
This paper presents a versatile image-to-image visual assistant, PixWizard, designed for image generation, manipulation, and translation based on free-from language instructions. To this end, we tackle a variety of vision tasks into a unified image-text-to-image generation framework and curate an Omni Pixel-to-Pixel Instruction-Tuning Dataset. By constructing detailed instruction templates in …