-
Notifications
You must be signed in to change notification settings - Fork 27.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bug in gpt2 notebook (in tensorflow) #13332
Comments
summoning the masters @LysandreJik @sgugger @Rocketknight1 💯 |
Hey! There are a couple of issues here. The first is that we're trying to move away from TFTrainer towards Keras - there'll be a new version of that notebook coming soon, like I promised! In the meantime, your approach should work, though. The error you're getting is because |
Thanks @Rocketknight1 ! Actually I was getting the same error even when I was using a |
The good news is I'm moving to working on those TF notebooks right now, so hopefully I'll have a proper example to show you soon. However, the official launch of the new notebooks might depend on the PR at huggingface/datasets#2731 being accepted and making it to release, since I'm planning to use that new method in a lot of them. Still, I'll make sure to ping you as soon as I have a LM example ready - just be aware that you might have to install a pre-release version of |
got it. happy to try out the beta version of them at my risk and peril ;-) |
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Please note that issues that do not follow the contributing guidelines are likely to be ignored. |
Same question at year 2023 for |
Hello there!
I tried to use the language-modeling-from-scratch notebook https://colab.research.google.com/github/huggingface/notebooks/blob/master/examples/language_modeling_from_scratch.ipynb#scrollTo=JEA1ju653l-p
More specifically, I need to run it by using
tensorflow
. The simple strategy of using theTF
versions of thehuggingface
functions everything seems to work correctly until I reach thetrainer
step and then I get a mysterious cardinality issue.This looks like a bug... Can you please have a look at the code below?
What do you think?
Thanks!
The text was updated successfully, but these errors were encountered: