skypro1111
/

mbart-large-50-verbalization

Text2Text Generation

Inference Endpoints

Model card Files Files and versions Community

skypro1111 commited on Feb 13

Commit

eb01b22

•

1 Parent(s): 9df5160

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -6,13 +6,14 @@ tags: []
 # Model Card for mbart-large-50-verbalization
 ## Model Description
-`mbart-large-50-verbalization` is a fine-tuned version of the `mbart-large-50` model, specifically designed for the task of verbalizing Ukrainian text to prepare it for Text-to-Speech (TTS) systems. This model aims to transform structured data like numbers and dates into their fully expanded textual representations in Ukrainian.
 ## Architecture
-This model is based on the `mbart-large-50` architecture, renowned for its effectiveness in translation and text generation tasks across numerous languages.
 ## Training Data
 The model was fine-tuned on a subset of 96,780 sentences from the Ubertext dataset, focusing on news content. The verbalized equivalents were created using Google Gemini Pro, providing a rich basis for learning text transformation tasks.
 ## Training Procedure
 The model underwent nearly 70,000 training steps, amounting to almost 2 epochs, to ensure thorough learning from the training dataset.

 # Model Card for mbart-large-50-verbalization
 ## Model Description
+`mbart-large-50-verbalization` is a fine-tuned version of the [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) model, specifically designed for the task of verbalizing Ukrainian text to prepare it for Text-to-Speech (TTS) systems. This model aims to transform structured data like numbers and dates into their fully expanded textual representations in Ukrainian.
 ## Architecture
+This model is based on the [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) architecture, renowned for its effectiveness in translation and text generation tasks across numerous languages.
 ## Training Data
 The model was fine-tuned on a subset of 96,780 sentences from the Ubertext dataset, focusing on news content. The verbalized equivalents were created using Google Gemini Pro, providing a rich basis for learning text transformation tasks.
+Dataset [skypro1111/ubertext-2-news-verbalized](https://huggingface.co/datasets/skypro1111/ubertext-2-news-verbalized)
 ## Training Procedure
 The model underwent nearly 70,000 training steps, amounting to almost 2 epochs, to ensure thorough learning from the training dataset.