ibm
/

materials.smi-ted

Feature Extraction

foundation models

Inference Endpoints

Model card Files Files and versions Community

eduardosoares99 commited on Jul 25

Commit

4c21438

•

1 Parent(s): d20b340

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -22,6 +22,8 @@ GitHub: [GitHub Link](https://github.com/IBM/materials/tree/main)
 # SMILES-based Transformer Encoder-Decoder (SMI-TED)
 This repository provides PyTorch source code associated with our publication, "A Large Encoder-Decoder Family of Foundation Models for Chemical Language".
 Paper: [Arxiv Link](https://github.com/IBM/materials/blob/main/smi-ted/paper/smi_ted_preprint.pdf)
@@ -33,8 +35,6 @@ We provide the model weights in two formats:
 For more information contact: [email protected] or [email protected].
-![ted-smi](smi-ted.png)
 ## Introduction
 We present a large encoder-decoder chemical foundation model, SMILES-based Transformer Encoder-Decoder (SMI-TED), pre-trained on a curated dataset of 91 million SMILES samples sourced from PubChem, equivalent to 4 billion molecular tokens. SMI-TED supports various complex tasks, including quantum property prediction, with two main variants (289M and 8X289M). Our experiments across multiple benchmark datasets demonstrate state-of-the-art performance for various tasks. For more information contact: [email protected] or [email protected].

 # SMILES-based Transformer Encoder-Decoder (SMI-TED)
+![ted-smi](smi-ted.png)
 This repository provides PyTorch source code associated with our publication, "A Large Encoder-Decoder Family of Foundation Models for Chemical Language".
 Paper: [Arxiv Link](https://github.com/IBM/materials/blob/main/smi-ted/paper/smi_ted_preprint.pdf)
 For more information contact: [email protected] or [email protected].
 ## Introduction
 We present a large encoder-decoder chemical foundation model, SMILES-based Transformer Encoder-Decoder (SMI-TED), pre-trained on a curated dataset of 91 million SMILES samples sourced from PubChem, equivalent to 4 billion molecular tokens. SMI-TED supports various complex tasks, including quantum property prediction, with two main variants (289M and 8X289M). Our experiments across multiple benchmark datasets demonstrate state-of-the-art performance for various tasks. For more information contact: [email protected] or [email protected].