EYFP mutant fitness prediction
Collection
We trained fitness prediction models using 30K private EYFP mutant fitness data. The fine-tuned model achieves 0.95 spearman's ρ on the test set.
•
4 items
•
Updated
Base model: westlake-repl/SaProt_650M_AF2
Task type: protein-level regression
Dataset: This dataset contains single-site and double-site mutants derived from the wild type EYFP protein. The number of samples for training, validation and test is 26168, 3087 and 3088. All single-site mutants and 80% of double-site mutants for training, 10% of double-site mutants for validation and test respectively. This model was trained by Jia Zheng's lab at Westlake University. The dataset will be released later by this team.
Model input type: Amino acid sequence
Performance (on test set): 0.95 Spearman's ρ
LoRA config:
Training config: