-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Datasets] Improve Covost 2 #3281
[Datasets] Improve Covost 2 #3281
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Cool thanks !
I removed some dummy data files leftovers. I also tried to remove the file you added in the .hypothesis
directory but for some reason GitHub doesn't allow me to remove it in the browser.
Feel free to delete this file and merge :)
I am trying to use Steps I have followed: 1. untar: 2. load data: 0 rows are loading as shown below:
Can you please provide a sample working example code to load the dataset? |
Hi ! I think it only works with the subsets of Common Voice Corpus 4, not Common Voice Corpus 1 |
It's currently quite confusing to understand the manual data download instruction of Covost and not very user-friendly.
Currenty the user has to:
.tar
downloaded file)This PR improves this to:
Note: This PR is not at all time-critical