Reproducibility Issue

I see some discrepancy in the model training [script](https://github.com/TIGER-AI-Lab/VLM2Vec/blob/main/experiments/public/train/train_v2-qwen2vl-2B_imageonly.sh) and the paper training details. It would really helpful if someone from the team can clarify these:

- The script uses `"qkv_proj,o_proj,gate_up_proj,down_proj,k_proj,q_proj,out_proj,v_proj"` as the target lora modules, but `Qwen2` does not have `qkv_proj`, `gate_up_proj` but instead `qkv` `proj` `gate_proj` and `up_proj` modules. Is this a typo? What exact modules were trained with LoRA as I wish to reproduce the result and I am running into some issues with it. 
- The script mentions that the lora scaling $\alpha$ is 64 (default), but the paper mentions it as 32.  

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproducibility Issue #170

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Reproducibility Issue #170

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions