Image-to-Text
HTRflow
Swedish

Example finetuning?

#1
by patrikwm - opened

Hi,

Could you provide an example on how to fine-tune this model?
Is it enough to have the correct text for fine-tuning, or do i need Image + text?

I am trying to transcribe 1700th century court records from northern Sweden/Finland.
The hardest part for the model seems to be all the different names.

Do you have any pre trained model that only has been trained on 1600-1800?

best regards,
Patrik

AI Riksarkivet / AIRA org

Hi!

Yes, here are two links you could follow on how to could train a such model:

You will need image+text paris:
image.png

Here is an example:

image.png

So if you are going to use the SATRN architecture from MMOCR (which is very good out of domain) you can use our model as our base.

You need to modify the config file to our or the plain one from mmocr:
image.png

Altough, you dont need to change any of the hyperparameters if you start with our config file. But perhaps the learning rate can be adjusted if you are only going to fine tune on a smaller dataset.

TrOCR is much more simplier to train just follow the link i sent :)

Thanks, great!

I will try this. But it will probably take some time to get a good amount of image+text pairs.

This comment has been hidden

Sign up or log in to comment