[METRICS] Various improvements on metrics #466

thomwolf · 2020-08-01T11:03:45Z

Disallow the use of positional arguments to avoid predictions vs references mistakes
Allow to directly feed numpy/pytorch/tensorflow/pandas objects in metrics

lhoestq · 2020-08-12T15:53:04Z

The cast function is now called inside features.encode_example.
I also added encode_batch that was missing.

Moreover I used the cast function in Dataset.map to support torch/tensorflow tensors or numpy arrays inputs.

There are tests for tensors inputs in metrics and in .map

thomwolf

This is really cool and it's really nice to start seeing some test on metrics :)

A few comments

src/nlp/metric.py

thomwolf · 2020-08-12T17:52:03Z

src/nlp/utils/file_utils.py

@@ -89,6 +90,38 @@
 INCOMPLETE_SUFFIX = ".incomplete"


+@contextmanager
+def temp_seed(seed: int, set_pytorch=False, set_tensorflow=False):


Here there is an error with the tensorflow seed setter that I didn't have time to debug before my time off, maybe you can take a look? You can see the error simply by running the colab at https://colab.research.google.com/drive/1I_B1mcX0cOzOskr0rJN8u8xh_0CIken-?usp=sharing and using set_tensorflow=True in its call of temp_seed.

I fixed tensorflow's rng for eager mode.
Not sure how to do it when eager mode is off. I tried a few things but it didn't work

thomwolf

Awesome!

lhoestq

I think it's all good :)
Did you want to add something else @thomwolf ?

thomwolf · 2020-08-17T15:14:54Z

I think we can merge

* disallow positional arguments in metrics methods * allow to use numpy/pytorch/tf/pandas objects in metrics * clean up GLUE dataset and metric doc * better checks to avoid positional arguments * style and quality * temp seed for everyone * fixes * more control * move cast to python in encode_example + add encode_batch * add metrics tests * cast to python objects in map * test cast to python objects in map * style * quality * remove kwargs in add and add_batch * fix download issue * add local + aws signature test on metrics * fix temp_seed for TF + add tests * better test Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>

thomwolf added 8 commits August 1, 2020 12:53

disallow positional arguments in metrics methods

b607c1b

allow to use numpy/pytorch/tf/pandas objects in metrics

96149ea

clean up GLUE dataset and metric doc

3749989

better checks to avoid positional arguments

4ab9b1f

style and quality

e8f0b7b

temp seed for everyone

9130709

fixes

cc0481d

more control

5d11300

lhoestq force-pushed the master branch from db3f399 to 21e8091 Compare August 3, 2020 17:24

lhoestq added 7 commits August 12, 2020 16:21

Merge branch 'master' into improve-metrics

11b475b

move cast to python in encode_example + add encode_batch

18ba5c6

add metrics tests

5d55d54

cast to python objects in map

3f19baf

test cast to python objects in map

951cfb8

style

303e654

quality

48a4f7f

lhoestq marked this pull request as ready for review August 12, 2020 15:46

thomwolf commented Aug 12, 2020

View reviewed changes

lhoestq added 5 commits August 14, 2020 11:54

remove kwargs in add and add_batch

2a58801

fix download issue

43e1a4c

add local + aws signature test on metrics

b29115d

fix temp_seed for TF + add tests

3daa5a8

better test

e59f778

thomwolf commented Aug 14, 2020

View reviewed changes

lhoestq approved these changes Aug 17, 2020

View reviewed changes

thomwolf merged commit 1840f3f into master Aug 17, 2020

thomwolf deleted the improve-metrics branch August 17, 2020 15:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[METRICS] Various improvements on metrics #466

[METRICS] Various improvements on metrics #466

thomwolf commented Aug 1, 2020

lhoestq commented Aug 12, 2020

thomwolf left a comment

thomwolf Aug 12, 2020

lhoestq Aug 14, 2020

thomwolf left a comment

lhoestq left a comment

thomwolf commented Aug 17, 2020

[METRICS] Various improvements on metrics #466

[METRICS] Various improvements on metrics #466

Conversation

thomwolf commented Aug 1, 2020

lhoestq commented Aug 12, 2020

thomwolf left a comment

Choose a reason for hiding this comment

thomwolf Aug 12, 2020

Choose a reason for hiding this comment

lhoestq Aug 14, 2020

Choose a reason for hiding this comment

thomwolf left a comment

Choose a reason for hiding this comment

lhoestq left a comment

Choose a reason for hiding this comment

thomwolf commented Aug 17, 2020