Model Card for t5_small Summarization Model

Model Details

Dataset: The model was fine-tuned on the CNN/DailyMail dataset.
Dataset Details:
- Contains over 287,000 news articles paired with human-written summaries.
- The content ranges from global news, events, and articles covering various topics.
Preprocessing:
- Text normalization and tokenization were performed using the T5 tokenizer.
- Input articles were truncated or padded to a maximum length of 512 tokens.
- Summaries were truncated or padded to a maximum length of 150 tokens.

Hyperparameters:
- Batch Size: 4
- Learning Rate: 2e-5
- Number of Epochs: 5
- Gradient Accumulation Steps: 4

tokenizer = T5Tokenizer.from_pretrained('path_to_your_model') model = T5ForConditionalGeneration.from_pretrained('path_to_your_model')