EvaluationSetClient for deepset cloud to fetch evaluation sets and la… #2345

FHardow · 2022-03-22T14:22:54Z

…bels for one specific evaluation set

Proposed changes:
This PR adds a EvaluationSetClient class, that will handle the communication with deepset cloud, to allow the fetching of labels of an evaluation set uploaded to deepset cloud.
New functionality:

list all labels for a given index name
fetch the number of labels for a given index
fetch index names from deepset cloud

Status (please check what you already did):

First draft (up for discussions & feedback)
Final code
Added tests
Updated documentation

closes #1985

…bels for one specific evaluation set

…names

…entStore

…d document store

tstadel

I can't find the tests either, so please recheck for them.
The code looks already pretty good. Here and there you can make some improvements.

haystack/utils/deepsetcloud.py

FHardow

Tests magically appeared again 🎉

haystack/utils/deepsetcloud.py

…luation set functionality

tstadel · 2022-03-25T09:02:03Z

haystack/utils/deepsetcloud.py

+        for response in self.client.get_with_auto_paging(
+            url=evaluation_set_url, query_params={"name": evaluation_set_name}
+        ):
+            return response.json().get("data", [])


That doesn't seem to fit together. get_with_auto_paging returns a generator of the objects within the "data" property.

Ahh true, let me change that :)

tstadel · 2022-03-25T09:03:11Z

haystack/utils/deepsetcloud.py

@@ -5,6 +5,8 @@
 import time
 from typing import Any, Dict, Generator, List, Optional, Tuple, Union

+from haystack import Label, Document, Answer


I think this import introduces a cyclic dependency causing all tests to fail. Could you please try from haystack.schema import ...?

haystack/utils/deepsetcloud.py

tstadel

Sorry for my confusion. But let's use label_index in DeepsetCloudDocumentStore again. Besides that the imports in deepsetcloud module seem to cause a cyclic dependency and we should return all the info about the data sets, not only the names.

ArzelaAscoIi

Nice ! :) Let's add the requested changes from @tstadel and merge it :)

test/test_document_store.py

…eady

tstadel

Some more things need to be improved: get_labels and get_label_count need to match the base class's abstract method signature. Additionally label_index should work like in any other DocumentStore and be a init param. And I think the url of test_DeepsetCloudDocumentStore_fetches_lables_for_evaluation_set's responses definition is not correct.

tstadel · 2022-03-25T14:12:25Z

haystack/document_stores/deepsetcloud.py

    def get_all_labels(
        self,
-        index: Optional[str] = None,
+        label_index: Optional[str] = None,


I think this needs to be index to match the abstractmethod's signature. Besides that, I think we should accept label_index as an init-param too and take that if no index is being passed to these methods. This is actually how all the other DocumentStores work.

I've switch it to use the index parameter from the init function, see line 68.

test/test_document_store.py

…djust tests for it, fix typos, make linter happy

…pset-ai/haystack into feature/fetch-evaluation-set-from-dc

…hen no evaluation set was found to count labels on

…pset-ai/haystack into feature/fetch-evaluation-set-from-dc

… json.dumps

…fetch-evaluation-set-from-dc

…rename label_index to evaluation_set

…ponse as there is a name clash with the input variable

…pset-ai/haystack into feature/fetch-evaluation-set-from-dc

tstadel

Just some minor changes to make it ready-to-merge left.

haystack/document_stores/deepsetcloud.py

haystack/utils/deepsetcloud.py

haystack/document_stores/deepsetcloud.py

haystack/utils/deepsetcloud.py

… string, rename label_index to evaluation_set in EvaluationSetClient

tstadel

LGTM! Found some minor docstrings that we should improve. Besides that: ready to merge!

haystack/utils/deepsetcloud.py

…pset-ai/haystack into feature/fetch-evaluation-set-from-dc

haystack/utils/deepsetcloud.py

ArzelaAscoIi

Looks good ! 🚀 Nice PR!

FHardow added 2 commits March 22, 2022 15:11

EvaluationSetClient for deepset cloud to fetch evaluation sets and la…

d0106db

…bels for one specific evaluation set

make DeepsetCloudDocumentStore able to fetch uploaded evaluation set …

382c2fb

…names

FHardow requested review from ArzelaAscoIi and tstadel March 22, 2022 14:22

FHardow added type:feature New feature or request topic:document_store labels Mar 22, 2022

FHardow added 3 commits March 22, 2022 15:35

fix missing renaming of get_evaluation_set_names in DeepsetCloudDocum…

45fcf05

…entStore

update documentation for evaluation set functionality in deepset clou…

1534666

…d document store

DeepsetCloudDocumentStore tests for evaluation set functionality

3f559f0

tstadel requested changes Mar 22, 2022

View reviewed changes

FHardow commented Mar 23, 2022

View reviewed changes

haystack/utils/deepsetcloud.py Outdated Show resolved Hide resolved

haystack/utils/deepsetcloud.py Outdated Show resolved Hide resolved

haystack/utils/deepsetcloud.py Show resolved Hide resolved

FHardow added 3 commits March 23, 2022 09:27

rename index to evaluation_set_name for DeepsetCloudDocumentStore eva…

21de609

…luation set functionality

raise DeepsetCloudError when no labels were found for evaluation set

7168807

make use of .get_with_auto_paging in EvaluationSetClient

8e68bff

tstadel reviewed Mar 25, 2022

View reviewed changes

tstadel requested changes Mar 25, 2022

View reviewed changes

ArzelaAscoIi reviewed Mar 25, 2022

View reviewed changes

test/test_document_store.py Show resolved Hide resolved

FHardow and others added 6 commits March 25, 2022 14:12

Return result of get_with_auto_paging() as it parses the response alr…

dc16792

…eady

Make schema import source more specific

2d5219b

fetch all evaluation sets for a workspace in deepset Cloud

9229110

Rename evaluation_set_name to label_index

164b8ec

make use of generator functionality for fetching labels

ab22631

Update Documentation & Code Style

68d7fbb

FHardow requested review from tstadel and ArzelaAscoIi March 25, 2022 13:33

FHardow marked this pull request as ready for review March 25, 2022 13:36

tstadel requested changes Mar 25, 2022

View reviewed changes

FHardow added 3 commits March 25, 2022 16:04

Adjust function input for DeepsetCloudDocumentStore.get_all_labels, a…

fdefaa5

…djust tests for it, fix typos, make linter happy

Merge branch 'feature/fetch-evaluation-set-from-dc' of github.com:dee…

4487b53

…pset-ai/haystack into feature/fetch-evaluation-set-from-dc

Match error message with pytest.raises

56bdb6c

github-actions bot and others added 10 commits March 25, 2022 15:12

Update Documentation & Code Style

23bee8d

DeepsetCloudDocumentStore.get_labels_count raises DeepsetCloudError w…

e2154b2

…hen no evaluation set was found to count labels on

Merge branch 'feature/fetch-evaluation-set-from-dc' of github.com:dee…

ab138a3

…pset-ai/haystack into feature/fetch-evaluation-set-from-dc

remove unneeded import in tests

6b6b8bf

DeepsetCloudDocumentStore tests, make reponse bodies a string through…

9604991

… json.dumps

Merge branch 'master' of github.com:deepset-ai/haystack into feature/…

86ee3f2

…fetch-evaluation-set-from-dc

DeepsetcloudDocumentStore.get_label_count - move raise to return

f3b03bc

stringify uuid before json.dump as uuid is not serilizable

2e7059c

DeepsetcloudDocumentStore - adjust response mocking in tests

c75d5e6

DeepsetcloudDocumentStore - json dump response body in test

5f3e4bb

FHardow requested a review from tstadel March 29, 2022 14:56

FHardow and others added 5 commits March 30, 2022 09:40

DeepsetCloudDocumentStore introduce label_index, EvaluationSetClient …

32b901a

…rename label_index to evaluation_set

Update Documentation & Code Style

2553129

DeepsetCloudDocumentStore rename evaluation_set to evaluation_set_res…

eb1054d

…ponse as there is a name clash with the input variable

Merge branch 'feature/fetch-evaluation-set-from-dc' of github.com:dee…

805b9ca

…pset-ai/haystack into feature/fetch-evaluation-set-from-dc

DeepsetCloudDocumentStore - rename missed variable in test

f539c7c

tstadel requested changes Mar 30, 2022

View reviewed changes

FHardow and others added 2 commits March 30, 2022 16:03

DeepsetCloudDocumentStore - rename missed label_index to index in doc…

a8b33aa

… string, rename label_index to evaluation_set in EvaluationSetClient

Update Documentation & Code Style

a93e7f3

tstadel approved these changes Mar 30, 2022

View reviewed changes

haystack/utils/deepsetcloud.py Outdated Show resolved Hide resolved

haystack/utils/deepsetcloud.py Outdated Show resolved Hide resolved

FHardow added 2 commits March 30, 2022 17:02

DeepsetCloudDocumentStore - update docstrings for EvaluationSetClient

10eff9c

Merge branch 'feature/fetch-evaluation-set-from-dc' of github.com:dee…

6638585

…pset-ai/haystack into feature/fetch-evaluation-set-from-dc

tstadel reviewed Mar 30, 2022

View reviewed changes

haystack/utils/deepsetcloud.py Outdated Show resolved Hide resolved

DeepsetCloudDocumentStore - fix typo in doc string

709ac21

ArzelaAscoIi approved these changes Mar 30, 2022

View reviewed changes

FHardow merged commit a273c3a into master Mar 31, 2022

FHardow deleted the feature/fetch-evaluation-set-from-dc branch March 31, 2022 06:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EvaluationSetClient for deepset cloud to fetch evaluation sets and la… #2345

EvaluationSetClient for deepset cloud to fetch evaluation sets and la… #2345

FHardow commented Mar 22, 2022 •

edited by tstadel

Loading

tstadel left a comment

FHardow left a comment

tstadel Mar 25, 2022

FHardow Mar 25, 2022

tstadel Mar 25, 2022

tstadel left a comment

ArzelaAscoIi left a comment

tstadel left a comment

tstadel Mar 25, 2022

FHardow Mar 25, 2022

tstadel left a comment

tstadel left a comment

ArzelaAscoIi left a comment

EvaluationSetClient for deepset cloud to fetch evaluation sets and la… #2345

EvaluationSetClient for deepset cloud to fetch evaluation sets and la… #2345

Conversation

FHardow commented Mar 22, 2022 • edited by tstadel Loading

tstadel left a comment

Choose a reason for hiding this comment

FHardow left a comment

Choose a reason for hiding this comment

tstadel Mar 25, 2022

Choose a reason for hiding this comment

FHardow Mar 25, 2022

Choose a reason for hiding this comment

tstadel Mar 25, 2022

Choose a reason for hiding this comment

tstadel left a comment

Choose a reason for hiding this comment

ArzelaAscoIi left a comment

Choose a reason for hiding this comment

tstadel left a comment

Choose a reason for hiding this comment

tstadel Mar 25, 2022

Choose a reason for hiding this comment

FHardow Mar 25, 2022

Choose a reason for hiding this comment

tstadel left a comment

Choose a reason for hiding this comment

tstadel left a comment

Choose a reason for hiding this comment

ArzelaAscoIi left a comment

Choose a reason for hiding this comment

FHardow commented Mar 22, 2022 •

edited by tstadel

Loading