hmTEAMS

Historical Multilingual TEAMS Models. Following languages are currently covered:

English (British Library Corpus - Books)
German (Europeana Newspaper)
French (Europeana Newspaper)
Finnish (Europeana Newspaper, Digilib)
Swedish (Europeana Newspaper, Digilib)
Dutch (Delpher Corpus)
Norwegian (NCC Corpus)

More details can be found in our GitHub repository.

Leaderboard

We test our pretrained language models on various datasets from HIPE-2020, HIPE-2022 and Europeana. The following table shows an overview of used datasets.

Language	Datasets
English	AjMC - TopRes19th
German	AjMC - NewsEye - HIPE-2020
French	AjMC - ICDAR-Europeana - LeTemps - NewsEye - HIPE-2020
Finnish	NewsEye
Swedish	NewsEye
Dutch	ICDAR-Europeana

All results can be found in the hmLeaderboard.

Acknowledgements

We thank Luisa März, Katharina Schmid and Erion Çano for their fruitful discussions about Historical Language Models.

Research supported with Cloud TPUs from Google's TPU Research Cloud (TRC). Many Thanks for providing access to the TPUs ❤️

hmTEAMS

AI & ML interests

hmTEAMS

Leaderboard

Acknowledgements

models 18

hmteams/teams-base-historic-multilingual-generator

hmteams/teams-base-historic-multilingual-discriminator

hmteams/flair-hipe-2022-newseye-de

hmteams/flair-hipe-2022-hipe2020-fr

hmteams/flair-hipe-2022-hipe2020-de

hmteams/flair-hipe-2022-newseye-sv

hmteams/flair-hipe-2022-newseye-fi

hmteams/flair-hipe-2022-newseye-fr

hmteams/flair-hipe-2022-topres19th-en

hmteams/flair-hipe-2022-letemps-fr

datasets 1

hmteams/vocab-corpus

AI & ML interests

Team members 1

hmTEAMS

Leaderboard

Acknowledgements

models 18 Sort: Recently updated

datasets 1

models 18