transformers/examples/summarization/t5 at master · lukovnikov/transformers

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
__init__.py		__init__.py
download_cnn_daily_mail.py		download_cnn_daily_mail.py
evaluate_cnn.py		evaluate_cnn.py
test_t5_examples.py		test_t5_examples.py

README.md

This script evaluates the the multitask pre-trained checkpoint for t5-base (see paper here) on the CNN/Daily Mail test dataset. Please note that the results in the paper were attained using a model fine-tuned on summarization, so that results will be worse here by approx. 0.5 ROUGE points

Get the CNN Data

First, you need to download the CNN data. It's about ~400 MB and can be downloaded by running

python download_cnn_daily_mail.py cnn_articles_input_data.txt cnn_articles_reference_summaries.txt

You should confirm that each file has 11490 lines:

wc -l cnn_articles_input_data.txt # should print 11490
wc -l cnn_articles_reference_summaries.txt # should print 11490

Generating Summaries

To create summaries for each article in dataset, run:

python evaluate_cnn.py cnn_articles_input_data.txt cnn_generated_articles_summaries.txt cnn_articles_reference_summaries.txt rouge_score.txt

The default batch size, 8, fits in 16GB GPU memory, but may need to be adjusted to fit your system. The rouge scores "rouge1, rouge2, rougeL" are automatically created and saved in rouge_score.txt.

Finetuning

Pass model_type=t5 and model examples/summarization/bart/finetune.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

t5

t5

README.md

Get the CNN Data

Generating Summaries

Finetuning

Files

t5

Directory actions

More options

Directory actions

More options

Latest commit

History

t5

Folders and files

parent directory

README.md

Get the CNN Data

Generating Summaries

Finetuning