Contents

Instruct Tuning

arXiv Hugging Face

Motivation

Qwen-VL-Chat is the instruction-tuned vision-language chatbot based on Qwen-VL. As shown in Fig. 2, Qwen-VL-Chat is able to interact with users and perceive the input images following the intention of users. aligned with user intent through fine-tuning instructions, showcasing strong interactive capabilities

Contribution

Method

Model Architecture

Experiment

Reference

Question