Visual Instruction Tuning with Polite Flamingo
Delong Chen, Jianfeng Liu, Wenliang Dai, Baoyuan Wang
June, 2023
Abstract
During visual instruction tuning of multi-modal LLM, we introduced a multi-modal response rewriter called "Polite Flamingo" to address the degeneration of response politness, which is a typical instance of the "multi-modal alignment tax.
![Delong Chen](/authors/admin/avatar_hu0a6e32c613742b32fb538008922dd091_193279_270x270_fill_q75_lanczos_center.jpeg)
Delong Chen
PhD Student
PhD Student at HKUST