Remove wrong retriever top_1 metrics from `print_eval_report` #2510

tstadel · 2022-05-06T09:22:49Z

Currently top_1 metrics are shown for Retriever metrics too although they were calculated with simulated_top_k_reader=1. This is confusing and plain wrong. We should not show top_1 Retriever metrics at all.

Proposed changes:

remove wrong retriever top_1 metrics from print_eval_report
small additional improvement: don't show wrong examples section if n_wrong_examples is 0

Status (please check what you already did):

First draft (up for discussions & feedback)
Final code

@Timoeller @neo-alex

julian-risch

Looks good to me! 👍 Before merging could you please have a look again at test_eval.py to check whether there are other tests where TransformersReader can be excluded?

julian-risch · 2022-05-10T13:33:32Z

test/test_eval.py

@@ -785,6 +785,7 @@ def test_extractive_qa_eval_wrong_examples(reader, retriever_with_docs):

 @pytest.mark.parametrize("retriever_with_docs", ["tfidf"], indirect=True)
 @pytest.mark.parametrize("document_store_with_docs", ["memory"], indirect=True)
+@pytest.mark.parametrize("reader", ["farm"], indirect=True)


This excludes the TransformersReader from this test, which is a good change in my opinion. I was just surprised to find it here. Now that you are at it, please check also the other tests in test_eval.py. For example, I see that the same change could be made to test_extractive_qa_eval_translation

julian-risch

LGTM! 👍

tstadel and others added 2 commits May 6, 2022 11:21

remove wrong retriever top_1 metrics

3388c7c

Update Documentation & Code Style

7ba64bd

julian-risch added the topic:eval label May 6, 2022

don't show wrong examples frame when n_wrong_examples is 0

d37bc5c

tstadel requested review from Timoeller and julian-risch May 6, 2022 12:21

tstadel marked this pull request as ready for review May 6, 2022 12:22

github-actions bot and others added 3 commits May 6, 2022 12:31

Update Documentation & Code Style

62aff79

Merge branch 'master' into fix_eval_report

4fdf8d0

Update Documentation & Code Style

43468f2

julian-risch requested changes May 10, 2022

View reviewed changes

only use farm reader during eval tests

e08be4c

tstadel requested a review from julian-risch May 12, 2022 09:42

julian-risch approved these changes May 12, 2022

View reviewed changes

tstadel merged commit 771ed0b into master May 12, 2022

tstadel deleted the fix_eval_report branch May 12, 2022 10:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove wrong retriever top_1 metrics from `print_eval_report` #2510

Remove wrong retriever top_1 metrics from `print_eval_report` #2510

tstadel commented May 6, 2022 •

edited

Loading

julian-risch left a comment

julian-risch May 10, 2022

tstadel May 12, 2022

julian-risch left a comment

Remove wrong retriever top_1 metrics from print_eval_report #2510

Remove wrong retriever top_1 metrics from print_eval_report #2510

Conversation

tstadel commented May 6, 2022 • edited Loading

julian-risch left a comment

Choose a reason for hiding this comment

julian-risch May 10, 2022

Choose a reason for hiding this comment

tstadel May 12, 2022

Choose a reason for hiding this comment

julian-risch left a comment

Choose a reason for hiding this comment

Remove wrong retriever top_1 metrics from `print_eval_report` #2510

Remove wrong retriever top_1 metrics from `print_eval_report` #2510

tstadel commented May 6, 2022 •

edited

Loading