T5_pretraining_finetuning

This project explores new T5 model by Hugging Face (https://huggingface.co/docs/transformers/model_doc/t5).

You can download ConceptNet assertions (/~https://github.com/commonsense/conceptnet5/wiki/Downloads) and save in "data" folder.

Pre-processing steps:

The pre-training is done following the example of T5 (https://huggingface.co/docs/transformers/model_doc/t5#training) on Masked Language Modeling (MLM) using previously pre-processed ConceptNet triplets.

In the end T5 model is fine-tuned on TellMeWhy dataset (https://stonybrooknlp.github.io/tellmewhy/) for Q&A task.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
T5_finetuning.ipynb		T5_finetuning.ipynb
T5_pretraining.ipynb		T5_pretraining.ipynb
prepare_conceptnet.py		prepare_conceptnet.py
pretrain_t5.py		pretrain_t5.py
requirements.txt		requirements.txt

Provide feedback