Fix NoneType Error in `flatten` Function for Text-Only Tasks in LLAVA Models #501

bibisbar · 2025-01-15T15:11:11Z

PR Description

This PR resolves a NoneType iteration error in the flatten function when handling text-only tasks (e.g., MMLU or HellaSwag) in LLAVA family models.

Problem

The flatten function processes batched visual tokens but receives None or empty inputs in text-only tasks. This leads to a NoneType iteration error in the original implementation.

Solution

Added input validation to flatten:
- Checks if the input is None or empty.
- Returns an empty list if validation fails.
Ensures sub-elements are validated before iteration.

This update ensures compatibility with text-only tasks while maintaining functionality for visual tasks.

bibisbar · 2025-01-16T14:54:52Z

New PR: Fix and Refactor Text-Only and Multimodal Input Handling

Description

This PR refactors the code to properly handle both text-only and multimodal inputs during evaluation. The following changes were made:

Text-Only Task Handling:
- For tasks without image input, the model now properly processes and handles input_ids and labels without passing any image data.
Multimodal Task Handling:
- When image inputs are provided, the image and image_sizes are correctly passed to the model along with input_ids and labels.
- Added conditional checks to differentiate between text-only and multimodal evaluation.
Shape Adjustments for Input and Labels:
- Ensured that the input_ids and labels tensors have the correct shape ((batch_size, seq_len)).
- The context part of the labels is set to -100 to exclude it from loss calculation.

Changes

Refactored the input preprocessing and tensor handling logic to support both text-only and multimodal inputs.
Added checks for image and appropriately handled different cases.
Ensured tensor shapes are consistent across both text and multimodal tasks.

How to Test

Run the evaluation script with text-only tasks and check that the model processes correctly.
Run the evaluation script with multimodal tasks (i.e., tasks that include both text and images) and verify that the image input is correctly handled.
Check the shape of input_ids and labels tensors to ensure they are in the correct format for both text-only and multimodal tasks.

Motivation and Context

The goal of this PR is to ensure that both text-only and multimodal inputs are handled correctly during model evaluation. This change improves the flexibility of the evaluation pipeline and prepares it for different task types.

Checklist

Code compiles and runs as expected.
All tests pass (if applicable).
Code style and formatting are consistent.
The PR description is clear and includes sufficient details about the changes made.

pufanyi · 2025-01-17T02:27:20Z

Hi!!! Thank you so much for your contribution!!! Can you try to run pre-commit run --all-files so that it can format the code to pass the lint check?

bibisbar · 2025-01-17T09:13:30Z

It should work now. Looks like it was accidentally deleted during one of the commits, but I’ve restored it.

pufanyi · 2025-01-17T11:38:28Z

Looks good! Thank you so much!

lmms_eval/models/tinyllava.py

refine the flatten function for text only task

720d28e

pufanyi self-requested a review January 17, 2025 02:27

update text only tinyllava

fdd39ce

bibisbar force-pushed the main branch from b9740ac to fdd39ce Compare January 17, 2025 09:07

pufanyi approved these changes Jan 17, 2025

View reviewed changes

lmms_eval/models/tinyllava.py Outdated Show resolved Hide resolved

pufanyi merged commit b1fbf55 into EvolvingLMMs-Lab:main Jan 17, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix NoneType Error in `flatten` Function for Text-Only Tasks in LLAVA Models #501

Fix NoneType Error in `flatten` Function for Text-Only Tasks in LLAVA Models #501

bibisbar commented Jan 15, 2025

bibisbar commented Jan 16, 2025

pufanyi commented Jan 17, 2025

bibisbar commented Jan 17, 2025

pufanyi commented Jan 17, 2025

Fix NoneType Error in flatten Function for Text-Only Tasks in LLAVA Models #501

Fix NoneType Error in flatten Function for Text-Only Tasks in LLAVA Models #501

Conversation

bibisbar commented Jan 15, 2025

PR Description

Problem

Solution

bibisbar commented Jan 16, 2025

New PR: Fix and Refactor Text-Only and Multimodal Input Handling

Description

Changes

How to Test

Motivation and Context

Checklist

pufanyi commented Jan 17, 2025

bibisbar commented Jan 17, 2025

pufanyi commented Jan 17, 2025

Fix NoneType Error in `flatten` Function for Text-Only Tasks in LLAVA Models #501

Fix NoneType Error in `flatten` Function for Text-Only Tasks in LLAVA Models #501