fix: document retrieval metrics for non-document_id document_relevance_criteria #3885

tstadel · 2023-01-18T11:47:39Z

Related Issues

fixes /~https://github.com/deepset-ai/haystack-private/issues/31

Proposed Changes:

fix metrics calculation for document retrieval metrics if document_relevance_criteria does not contain document_id
fix metrics calculation for document retrieval metrics if document_relevance_criteria does not contain document_id and there are multiple labels
streamline and make more explicit metrics values in case of no_answer queries

How did you test it?

unit tests to be implemented

Notes for the reviewer

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added tests that demonstrate the correct behavior of the change
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.
I documented my code
I ran pre-commit hooks and fixed any issue

ZanSara · 2023-01-19T17:23:46Z

@tstadel please rebase to main to sort out the mypy issues. Sorry for the mess!

haystack/schema.py

tstadel · 2023-02-01T13:19:56Z

haystack/pipelines/base.py

@@ -1690,6 +1690,7 @@ def _build_eval_dataframe(
                    df_docs.map_rows = partial(df_docs.apply, axis=1)
                    df_docs.rename(columns={"id": "document_id", "content": "context"}, inplace=True)
                    df_docs["gold_document_ids"] = [gold_document_ids] * len(df_docs)
+                    df_docs["gold_answers"] = [gold_answers] * len(df_docs)


added for easier analysis, e.g. if document_relevance_criterion="answer"

Just to clarify this won't be a problem if there are no gold_answers in the eval set? For example in the case we use context for the relevance criteria gold_answers may not be available.

Exactly. I've added tests for this case.

Thanks @tstadel this looks good to me! I don't fully understand how the code is structured, but I appreciate that the tests for the metrics look correct now.

I agree, the current code can be better structured. As there are no other using parts besides calculate metrics, refactoring it, wouldn't bring too much value. Let's refactor in another PR, probably connecting it to a feature to support custom metrics.

sjrl

Thanks @tstadel this looks good to me! I don't fully understand how the code is structured, but I appreciate that the tests for the metrics look correct now.

haystack/schema.py

…e_criteria (#3885) * fix document retrieval metrics for all document_relevance_criteria * fix tests * fix eval_batch metrics * small refactorings * evaluate metrics on label level * document retrieval tests added * fix pylint * fix test * support file retrieval * add comment about threshold * rename test

tstadel added 2 commits January 18, 2023 12:43

fix document retrieval metrics for all document_relevance_criteria

16ab13f

fix tests

ba03889

ZanSara added topic:document_store topic:eval type:bug Something isn't working and removed topic:document_store labels Jan 19, 2023

tstadel and others added 3 commits January 30, 2023 11:38

Merge branch 'main' into fix/doc_retrieval_metrics

f3ab787

fix eval_batch metrics

5d8de31

Merge branch 'main' into fix/doc_retrieval_metrics

bef74f7

github-actions bot added topic:pipeline topic:tests and removed topic:eval labels Jan 31, 2023

tstadel commented Jan 31, 2023

View reviewed changes

haystack/schema.py Outdated Show resolved Hide resolved

tstadel commented Jan 31, 2023

View reviewed changes

haystack/schema.py Outdated Show resolved Hide resolved

tstadel and others added 8 commits January 31, 2023 13:52

small refactorings

569ea30

evaluate metrics on label level

d973dc5

document retrieval tests added

234b38e

fix pylint

59904f4

fix test

26e17fb

support file retrieval

ff99963

Merge branch 'main' into fix/doc_retrieval_metrics

efb3a2a

add comment about threshold

3c2cf58

tstadel marked this pull request as ready for review February 1, 2023 13:18

tstadel requested a review from a team as a code owner February 1, 2023 13:18

tstadel requested review from sjrl and removed request for a team February 1, 2023 13:18

tstadel commented Feb 1, 2023

View reviewed changes

tstadel and others added 2 commits February 1, 2023 14:24

rename test

62fe781

Merge branch 'main' into fix/doc_retrieval_metrics

1de7d14

sjrl approved these changes Feb 2, 2023

View reviewed changes

sjrl reviewed Feb 2, 2023

View reviewed changes

haystack/schema.py Show resolved Hide resolved

ZanSara added this to the 1.13.1 milestone Feb 2, 2023

sjrl merged commit 9611b64 into main Feb 2, 2023

sjrl deleted the fix/doc_retrieval_metrics branch February 2, 2023 14:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: document retrieval metrics for non-document_id document_relevance_criteria #3885

fix: document retrieval metrics for non-document_id document_relevance_criteria #3885

tstadel commented Jan 18, 2023 •

edited

Loading

ZanSara commented Jan 19, 2023

tstadel Feb 1, 2023

sjrl Feb 1, 2023

tstadel Feb 1, 2023

tstadel Feb 2, 2023

sjrl left a comment

fix: document retrieval metrics for non-document_id document_relevance_criteria #3885

fix: document retrieval metrics for non-document_id document_relevance_criteria #3885

Conversation

tstadel commented Jan 18, 2023 • edited Loading

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

ZanSara commented Jan 19, 2023

tstadel Feb 1, 2023

Choose a reason for hiding this comment

sjrl Feb 1, 2023

Choose a reason for hiding this comment

tstadel Feb 1, 2023

Choose a reason for hiding this comment

tstadel Feb 2, 2023

Choose a reason for hiding this comment

sjrl left a comment

Choose a reason for hiding this comment

tstadel commented Jan 18, 2023 •

edited

Loading