-
Notifications
You must be signed in to change notification settings - Fork 10
Taken Down in the Dynabench for Large Track #90
Comments
Hi, I see that you uploaded a new model in the meantime that seems to work correctly is that correct ? Here are the log for your older model.
|
Hi, after I fixed the environment setup, I successfully submit the two models "https://dynabench.org/models/121" (Base M2M: Both have scores on devtest and dev sets.) and "https://dynabench.org/models/123" (Large M2M: Only has BLEU points on the devtest set). But I find that "https://dynabench.org/models/123" shows "taken down" but has BLEU points on devtest set and failed on the dev set. Could you please further help me see the detailed log to solve the problem? Is there a time limit, which caused the failure on the dev set? Or maybe Out Of Memory caused the failure? |
hmm it seems you are pushing a bit the limit of the system, we may need to revise some constraints. I'll increase the timeout in the meantime, but we will need to redeploy the evaluation servers and I don't have access to it. So it won't happen before tomorrow. In the meantime can you try an intermediary batch size ? 96 ? |
@gwenzek Hi, I try to set the batchsize=96 but also find "Taken Down (https://dynabench.org/models/134)". Could you please help check the detailed log why the task failed? Could you please tell us the detailed time limit such as 20 min or 40 min, which ensures that the inference time of our model will not exceed the limited time? Thanks very much! |
I did not see a model with batch size of 96, only 64 and 128. And it seems that your last model with batchsize of 128 succeeded. |
The timeout increase is live, you can try re-upload your model. Thanks for your patience. |
Thanks very much @gwenzek ! I have successfully upload the model. Moreover, I want to enable the same translation direction in one minibatch (Minibatch=64) (Such as en->fr in one batch Not en->fr cs->de in one batch). Would you mind giving some suggestions help us realize this goal? |
so the samples arrive grouped by language IIRC, first all the en->de then all the en->fr, ... |
Thanks very much! I also tried submitting the model to the large track (https://dynabench.org/models/157). But it shows "Taken Down". Could please help me check the reason? What is the detailed time limit of the large track (nearly 10000 translation directions)? |
sorry, the large track is currently not working. I should have a fix for next week. |
Hi, I think I think I've fixed the issues with large track. The large track was hitting several limits on Dynabench design, and I had to push a few walls :-) I think your two models for the large track have started their evaluation, I will keep you updated. |
Your model failed, it seems that your model is missing language "zt".
|
@gwenzek , thanks for your efforts! We have trying uploading base and large models for FLORES-FULL track (https://www.dynabench.org/models/312 and https://www.dynabench.org/models/315). We find that these two jobs have shown "taken down" condition. Could you please help provide the detailed log? The deadline will come, we haven't uploaded a model to the full track successfully. Thanks very much! |
Hi,
I passed the local and integrated tests following the model submission workflow on GitHub and submitted my model. But I notice that "Your model t1 has been successfully deployed. You can find and publish the model at https://dynabench.org/models/119. (python handler.py, dynalab-cli test --local, dynalab-cli test -n all successfully passed the test on our local server)
"。 But the status shows that "Taken Down". Would you mind sending the detailed log information to me for debugging our code? Thanks very much !
The text was updated successfully, but these errors were encountered: