Visual Instruction Tuning with Polite Flamingo

Delong Chen, Jianfeng Liu, Wenliang Dai, Baoyuan Wang

June, 2023

Abstract

During visual instruction tuning of multi-modal LLM, we introduced a multi-modal response rewriter called "Polite Flamingo" to address the degeneration of response politness, which is a typical instance of the "multi-modal alignment tax.

Type

Preprint

Publication

ArXiv Preprint [arXiv]