gen-synth-data / DATASET_README_BASE.md
ignacioct's picture
recommiting all files
8773ff3

A newer version of the Streamlit SDK is available: 1.40.2

Upgrade

Domain Dataset Grower

This dataset was generated by distilabel as a domain specific dataset for the domain of farming. The dataset used this seed data to generate the samples. The seed data was define by a domain expert and the generated data can be reviewed in this Argilla space here: Argilla

If you want to define a domain specific seed dataset for your own domain, you can use the distilabel tool to generate the dataset, and seed your dataset here