Hello!
First and foremost, I'd like to congratulate you on this incredible work. I have a question regarding the creation of the dataset for In-Context Captioning and Interleaved Image-Text Analysis dimensions. How were the in-context examples chosen during this process?
Hello!
First and foremost, I'd like to congratulate you on this incredible work. I have a question regarding the creation of the dataset for In-Context Captioning and Interleaved Image-Text Analysis dimensions. How were the in-context examples chosen during this process?