File size: 1,298 Bytes
ed1b843
 
9394cd9
 
ed1b843
 
20cda11
 
 
 
 
 
003fc50
 
 
 
 
 
20cda11
ed1b843
 
 
20cda11
 
 
7511acc
568abf2
7511acc
568abf2
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
---
license: mit
tags:
- biology
---

Pre-trained single-cell genomics models based on:
- [BarlowTwins](https://proceedings.mlr.press/v139/zbontar21a/zbontar21a.pdf)
- [Bootstrap Your Own Latent](https://papers.nips.cc/paper/2020/file/f3ada80d5c4ee70142b17b8192b2958e-Paper.pdf)
- [Masked Autoencoder](https://openaccess.thecvf.com/content/CVPR2022/papers/He_Masked_Autoencoders_Are_Scalable_Vision_Learners_CVPR_2022_paper.pdf)
- Gene-Program Masked Autoencoder

Finetuned models for the downstream tasks of:
- Cell Type Prediction
- Gene Expression Reconstruction
- Cross-Modality Prediction (RNA->Proteomics)
- Data Integration

Training details and adaptations to single-cell data in our project can be found in our paper below. To use the model directly, the same genes must be used in the same order as in the `var.parquet` file. Otherwise, follow the instructions from the repositories below to train a model for custom datasets. 

If you find our work useful, please cite the following paper:

[**Delineating the Effective Use of Self-Supervised Learning in Single-Cell Genomics**](https://doi.org/10.1101/2024.02.16.580624)

See also:

[Repository of the full analysis](https://github.com/theislab/ssl_in_scg)

[Lean repository for minimal pre-training](https://github.com/theislab/sc_mae)