Adding adversarialQA dataset #1714

maxbartolo · 2021-01-08T21:46:09Z

Adding the adversarialQA dataset (https://adversarialqa.github.io/) from Beat the AI (https://arxiv.org/abs/2002.00293)

thomwolf · 2021-01-08T22:08:08Z

Oh that's a really cool one, we'll review/merge it soon!

In the meantime, do you have any specific positive/negative feedback on the process of adding a datasets Max?
Did you follow the instruction in the detailed step-by-step?

maxbartolo · 2021-01-09T00:04:50Z

Thanks Thom, been a while, hope all is well!

Yes, I followed the step by step instructions and found them pretty straightforward. The only things I wasn't sure of were what should go into the YAML tags field for the dataset card, and whether there was a list of options somewhere (maybe akin to the metrics?) of the possible supported tasks. I found the rest very intuitive and the automated metadata and dummy data generation very handy. Thanks!

thomwolf · 2021-01-09T09:35:28Z

Good point! pinging @yjernite here so he can improve this part!

yjernite · 2021-01-11T14:01:46Z

@maxbartolo cool addition!

For the YAML tag, you should use the tagging app we provide to choose from a drop-down menu:
/~https://github.com/huggingface/datasets-tagging

The process is described toward the end of the step-by-step guide, do you have any suggestions for making it easier to find?

Otherwise, the dataset card is really cool, thanks for making it so complete!

…sarial_qa

maxbartolo · 2021-01-11T14:45:50Z

@yjernite

Thanks, YAML tags added. I think my main issue was with the flow of the step-by-step guide. For example, the card creator is introduced in Step 4, right after creating an empty directory for your dataset. The first field it requires are the YAML tags, which (at least for me) was the last step of the process.

I'd suggest having the guide structured in the same order as the creation process. For me it was something like:

Step 1: Preparing your env
Step 2: Write the loading/processing code
Step 3: Automatically generate dummy data and dataset_infos.json
Step 4: Tag the dataset
Step 5: Write the dataset card using the card creator
Step 6: Open a Pull Request on the main HuggingFace repo and share your work!!

Thanks again!

lhoestq

Looks all good to me !
Thank you for adding it :) and good dataset card as well

I just did one minor change in the dummy_data.zip files: I removed unused files to make them lighter.

* Adding adversarialQA dataset * Added YAML tags * reduce dummy_data.zip files sizes Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>

Adding adversarialQA dataset

b5abd5b

maxbartolo added 2 commits January 11, 2021 15:21

Added YAML tags

2e2fd8d

Merge remote-tracking branch 'upstream/master' into add_dataset/adver…

93df98b

…sarial_qa

reduce dummy_data.zip files sizes

b3aba20

lhoestq approved these changes Jan 13, 2021

View reviewed changes

lhoestq merged commit 0d4a686 into huggingface:master Jan 13, 2021

eusip pushed a commit to eusip/datasets that referenced this pull request Jan 21, 2021

Adding adversarialQA dataset (huggingface#1714)

e46ad30

* Adding adversarialQA dataset * Added YAML tags * reduce dummy_data.zip files sizes Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding adversarialQA dataset #1714

Adding adversarialQA dataset #1714

maxbartolo commented Jan 8, 2021

thomwolf commented Jan 8, 2021

maxbartolo commented Jan 9, 2021

thomwolf commented Jan 9, 2021

yjernite commented Jan 11, 2021 •

edited

Loading

maxbartolo commented Jan 11, 2021 •

edited

Loading

lhoestq left a comment

Adding adversarialQA dataset #1714

Adding adversarialQA dataset #1714

Conversation

maxbartolo commented Jan 8, 2021

thomwolf commented Jan 8, 2021

maxbartolo commented Jan 9, 2021

thomwolf commented Jan 9, 2021

yjernite commented Jan 11, 2021 • edited Loading

maxbartolo commented Jan 11, 2021 • edited Loading

lhoestq left a comment

Choose a reason for hiding this comment

yjernite commented Jan 11, 2021 •

edited

Loading

maxbartolo commented Jan 11, 2021 •

edited

Loading