[Model] Update Molmo Eval to Match Official Implementation #648

jamespark3922 · 2024-12-05T00:02:53Z

Changes:

Process up to max 36 crops per image instead of 12 crops in default setting.
Integrate prompts from the official Molmo implementation to match the reported numbers in the paper. (https://molmo.allenai.org/paper.pdf).

jamespark3922 · 2024-12-13T20:44:54Z

@kennymckormick Could you take a look at this PR?

kennymckormick · 2024-12-19T10:17:26Z

Hi, @jamespark3922 ,
Will merge this PR after validating the implementation with 1 model.

kennymckormick · 2024-12-30T07:06:08Z

TODO: Re-evaluate Molmo Series

kennymckormick · 2025-01-01T12:48:53Z

Hi, @jamespark3922 ,
I find that when using this piece of codes for evaluation, the scores of Molmo-7B-O and Molmo-7B-D on MMVet dropped by around 10% (absolute). Would you please evaluate these two models on MMVet to double check?

…ass#648) * add molmo prompts * fix lint format

* update vlrewardbench * pre-commit fix * formatter * [Improvement] Better `AUTO_SPLIT` and model split for InternVL2 * [Minor] Improve CC-OCR Import * [Model] Support QVQ * [Model] Update Molmo Eval to Match Official Implementation (#648) * add molmo prompts * fix lint format * [Fix] Refine Qwen-VL2 device assignment * [Fix] Fix RealWorldQA md5 * update MMMU_DEV_VAL tsv * [Fix] Fix confusing image width&height (#704) Co-authored-by: Yuan Ye <yuany2@chinatelecom.cn> * Update llama_vision.py (#705) * [Fix] Fix Lint * Fix Lint * Fix Lint --------- Co-authored-by: kennymckormick <dhd.efz@gmail.com> Co-authored-by: jamespark3922 <jspark96@cs.washington.edu> Co-authored-by: CMeteor <CMeteor@users.noreply.github.com> Co-authored-by: Yuan Ye <yuany2@chinatelecom.cn> Co-authored-by: Guowei Xu <113534787+XuGW-Kevin@users.noreply.github.com>

jamespark3922 added 2 commits December 4, 2024 23:39

add molmo prompts

b9c3fbe

fix lint format

46ff5f3

kennymckormick approved these changes Dec 30, 2024

View reviewed changes

kennymckormick merged commit c08ab64 into open-compass:main Dec 30, 2024
1 check passed

kennymckormick pushed a commit to TobiasLee/VLMEvalKit that referenced this pull request Jan 1, 2025

[Model] Update Molmo Eval to Match Official Implementation (open-comp…

40bbc75

…ass#648) * add molmo prompts * fix lint format

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Update Molmo Eval to Match Official Implementation #648

[Model] Update Molmo Eval to Match Official Implementation #648

jamespark3922 commented Dec 5, 2024

jamespark3922 commented Dec 13, 2024

kennymckormick commented Dec 19, 2024

kennymckormick commented Dec 30, 2024

kennymckormick commented Jan 1, 2025

[Model] Update Molmo Eval to Match Official Implementation #648

[Model] Update Molmo Eval to Match Official Implementation #648

Conversation

jamespark3922 commented Dec 5, 2024

jamespark3922 commented Dec 13, 2024

kennymckormick commented Dec 19, 2024

kennymckormick commented Dec 30, 2024

kennymckormick commented Jan 1, 2025