Minor fixes so PretrainedTransformerIndexer works with roberta #3203

matt-gardner · 2019-08-26T22:59:38Z

You currently need to manually install pytorch-transformers from master, because there is no released version that includes roberta in the AutoTokenizer and such. But this makes the indexer compatible with roberta.

nelson-liu · 2019-08-27T02:12:46Z

allennlp/data/token_indexers/pretrained_transformer_indexer.py

@@ -51,6 +51,8 @@ def __init__(self,
        self.tokenizer = AutoTokenizer.from_pretrained(model_name, do_lower_case=do_lowercase)


So I'm trying this PR out, and in the case where model name isn't loadable from AutoTokenizer.from_pretrained, it returns None. Maybe consider catching that and throwing a more informative error message here? (instead, it errors when you try to use self.tokenizer to get the id of the token.

SGTM, PR welcome.

…ai#3203)

Minor fixes so PretrainedTransformerIndexer works with roberta

0b244fe

matt-gardner mentioned this pull request Aug 26, 2019

Configuration of using RoBERTa #3161

Closed

matt-gardner requested a review from joelgrus August 26, 2019 23:32

joelgrus approved these changes Aug 27, 2019

View reviewed changes

matt-gardner merged commit 993034f into allenai:master Aug 27, 2019

matt-gardner deleted the roberta branch August 27, 2019 00:21

nelson-liu reviewed Aug 27, 2019

View reviewed changes

reiyw pushed a commit to reiyw/allennlp that referenced this pull request Nov 12, 2019

Minor fixes so PretrainedTransformerIndexer works with roberta (allen…

9baf151

…ai#3203)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor fixes so PretrainedTransformerIndexer works with roberta #3203

Minor fixes so PretrainedTransformerIndexer works with roberta #3203

matt-gardner commented Aug 26, 2019

nelson-liu Aug 27, 2019

matt-gardner Aug 27, 2019

		@@ -51,6 +51,8 @@ def __init__(self,
		self.tokenizer = AutoTokenizer.from_pretrained(model_name, do_lower_case=do_lowercase)

Minor fixes so PretrainedTransformerIndexer works with roberta #3203

Minor fixes so PretrainedTransformerIndexer works with roberta #3203

Conversation

matt-gardner commented Aug 26, 2019

nelson-liu Aug 27, 2019

Choose a reason for hiding this comment

matt-gardner Aug 27, 2019

Choose a reason for hiding this comment