-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
dataset(aslg_pc12): initial loading script #731
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for adding this one :)
To fix the CI you'll need to add the encodings in the open
calls, and also add the dummy data.
Thanks @lhoestq |
Yes the idea is just to have a few examples to properly test the script and make sure it keeps working in the long run. And FYI there's a command to help you name the dummy data files correctly. More info in the documentation here |
@lhoestq passes all tests |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome thanks !
This contains the only current public part of this corpus.
The rest of the corpus is not yet been made public, but this sample is still being used by researchers.