Best (most accurate) trained LSTM models.
Go to file
Stefan Weil e12c65a915 Rename frk -> deu_latf (ISO 639-3, ISO 15924)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 11:04:42 +01:00
script Move trained data for scripts to new subdirectory 2018-03-10 21:12:04 +01:00
tessconfigs@3decf1c825 Update tessconfigs 2019-10-23 13:32:42 +02:00
.gitmodules Update URL for tessconfigs submodule (use HTTPS) 2019-10-11 13:08:43 +02:00
LICENSE Rename license file 2018-02-02 10:18:00 +01:00
README.md Update README 2020-03-08 17:07:13 -04:00
afr.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
amh.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ara.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
asm.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
aze.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
aze_cyrl.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
bel.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ben.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
bod.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
bos.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
bre.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
bul.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
cat.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ceb.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ces.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
chi_sim.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
chi_sim_vert.traineddata Fix extra intra-word spacing for Chinese (GitHub issue #991) 2019-05-21 17:48:52 +02:00
chi_tra.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
chi_tra_vert.traineddata Fix extra intra-word spacing for Chinese (GitHub issue #991) 2019-05-21 17:48:52 +02:00
chr.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
configs Add tessconfigs submodule and links for required tessdata files 2019-09-03 16:07:05 +02:00
cos.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
cym.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
dan.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
deu.traineddata deu: Remove unwanted dependency 2018-02-01 15:29:03 +01:00
deu_latf.traineddata Rename frk -> deu_latf (ISO 639-3, ISO 15924) 2024-03-09 11:04:42 +01:00
div.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
dzo.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ell.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
eng.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
enm.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
epo.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
equ.traineddata Add equ.traineddata (copy from tessdata) 2023-07-24 10:03:38 +02:00
est.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
eus.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
fao.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
fas.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
fil.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
fin.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
fra.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
frm.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
fry.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
gla.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
gle.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
glg.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
grc.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
guj.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
hat.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
heb.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
hin.traineddata Fix config files from Use Tesseract/LSTM combiner to LSTM only 2017-09-15 18:37:50 +05:30
hrv.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
hun.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
hye.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
iku.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ind.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
isl.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ita.traineddata ita: Remove ita.config from ita.traineddata 2020-11-30 22:03:13 +01:00
ita_old.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
jav.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
jpn.traineddata Fix extra intra-word spacing for Japanese (GitHub issue #991) 2019-05-21 17:49:35 +02:00
jpn_vert.traineddata Fix extra intra-word spacing for Japanese (GitHub issue #991) 2019-05-21 17:49:35 +02:00
kan.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
kat.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
kat_old.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
kaz.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
khm.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
kir.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
kmr.traineddata correct name kur_ara to kmr - Kurmanji (Latin script) 2018-04-25 22:47:45 +05:30
kor.traineddata Fix config file for Korean, remove `tessedit_load_sublangs chi_tra` 2018-04-09 19:58:26 +05:30
kor_vert.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
lao.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
lat.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
lav.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
lit.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ltz.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
mal.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
mar.traineddata Fix Config files to LSTM only for nep and mar 2017-09-15 21:22:28 +05:30
mkd.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
mlt.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
mon.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
mri.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
msa.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
mya.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
nep.traineddata Fix Config files to LSTM only for nep and mar 2017-09-15 21:22:28 +05:30
nld.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
nor.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
oci.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ori.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
osd.traineddata Use legacy Orientation Script Detector (OSD) because that is the only thing that currently works. 2017-09-15 11:44:08 -07:00
pan.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
pdf.ttf Add tessconfigs submodule and links for required tessdata files 2019-09-03 16:07:05 +02:00
pol.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
por.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
pus.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
que.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ron.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
rus.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
san.traineddata Fix config files from Use Tesseract/LSTM combiner to LSTM only 2017-09-15 18:37:50 +05:30
sin.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
slk.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
slv.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
snd.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
spa.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
spa_old.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
sqi.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
srp.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
srp_latn.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
sun.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
swa.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
swe.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
syr.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
tam.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
tat.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
tel.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
tgk.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
tha.traineddata Fix extra intra-word spacing for Thai (GitHub issue #991) 2019-05-21 17:50:06 +02:00
tir.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ton.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
tur.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
uig.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ukr.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
urd.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
uzb.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
uzb_cyrl.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
vie.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
yid.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
yor.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00

README.md

tessdata_best Best (most accurate) trained models

This repository contains the best trained models for the Tesseract Open Source OCR Engine.

These models only work with the LSTM OCR engine of Tesseract 4.

See the Tesseract docs for additional information.

All data in the repository are licensed under the Apache-2.0 License, see file LICENSE.