PleIAs

company

AI & ML interests

Open Science LLMs

Organization Card

Community About org cards

PleIAs is a French private AI Lab training the next generation of Language Models for document processing.

PleIAs is committed to open science and has coordinated the release of some of the largest open corpus for pre-training.

For more information, visit our website : https://pleias.fr/

Contact us : [email protected]

Collections 5

spaces 5

Vintage OCR Corrector (GPU)

Vintage OCR Corrector (CPU)

Finance Commons Explorer

Reversed-Zotero

Pleias-Editor

models 13

PleIAs/Pleias-Rag

Updated 6 days ago • 110 • 2

PleIAs/journaux-lm-v1

Updated 16 days ago • 36 • 2

PleIAs/OCRonos-Vintage-CT2

Updated 17 days ago • 5

PleIAs/celadon

Text Classification • Updated 18 days ago • 217 • 15

PleIAs/Cassandre-RAG

Updated Oct 18 • 131 • 6

PleIAs/Segmentext

Token Classification • Updated Aug 30 • 127 • 12

PleIAs/Florence-PDF

Updated Aug 25 • 53 • 2

PleIAs/OCRonos-Vintage

Text Generation • Updated Aug 8 • 1.3k • 73

PleIAs/OCRonos

Text Generation • Updated Jul 18 • 632 • 56

PleIAs/OCRerrcr

Token Classification • Updated Jul 18 • 41 • 9

datasets 43

PleIAs/post-ocr

Viewer • Updated about 13 hours ago • 618k • 2.72k • 4

PleIAs/common_corpus

Viewer • Updated 6 days ago • 397M • 31.2k • 143

PleIAs/new-tokenized-annealing

Updated 11 days ago • 299

PleIAs/statistics_compiled

Viewer • Updated 17 days ago • 809M • 15

PleIAs/ToxicCommons

Viewer • Updated 18 days ago • 1.96M • 56 • 6

PleIAs/Openalex-Metadata

Viewer • Updated Aug 6 • 11.7M • 11

PleIAs/Persian-PD

Viewer • Updated Aug 3 • 1.38k • 22

PleIAs/Arabic-PD

Viewer • Updated Aug 3 • 1.82k • 14

PleIAs/Bengali-PD

Viewer • Updated Aug 3 • 3.23k • 17

PleIAs/Urdu-PD

Viewer • Updated Aug 3 • 2.28k • 19