Best (most accurate) trained LSTM models.
Find a file
2021-09-01 14:48:52 +05:30
script Move trained data for scripts to new subdirectory 2018-03-10 21:12:04 +01:00
tessconfigs@3decf1c825 Update tessconfigs 2019-10-23 13:32:42 +02:00
.DS_Store Improved Tamil and Sinhala traineddata 2021-09-01 14:48:52 +05:30
.gitmodules Update URL for tessconfigs submodule (use HTTPS) 2019-10-11 13:08:43 +02:00
afr.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
amh.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ara.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
asm.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
aze.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
aze_cyrl.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
bel.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ben.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
bod.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
bos.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
bre.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
bul.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
cat.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ceb.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ces.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
chi_sim.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
chi_sim_vert.traineddata Fix extra intra-word spacing for Chinese (GitHub issue #991) 2019-05-21 17:48:52 +02:00
chi_tra.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
chi_tra_vert.traineddata Fix extra intra-word spacing for Chinese (GitHub issue #991) 2019-05-21 17:48:52 +02:00
chr.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
configs Add tessconfigs submodule and links for required tessdata files 2019-09-03 16:07:05 +02:00
cos.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
cym.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
dan.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
deu.traineddata deu: Remove unwanted dependency 2018-02-01 15:29:03 +01:00
div.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
dzo.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ell.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
eng.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
enm.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
epo.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
est.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
eus.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
fao.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
fas.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
fil.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
fin.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
fra.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
frk.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
frm.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
fry.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
gla.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
gle.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
glg.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
grc.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
guj.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
hat.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
heb.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
hin.traineddata Fix config files from Use Tesseract/LSTM combiner to LSTM only 2017-09-15 18:37:50 +05:30
hrv.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
hun.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
hye.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
iku.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ind.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
isl.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ita.traineddata ita: Remove ita.config from ita.traineddata 2020-11-30 22:03:13 +01:00
ita_old.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
jav.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
jpn.traineddata Fix extra intra-word spacing for Japanese (GitHub issue #991) 2019-05-21 17:49:35 +02:00
jpn_vert.traineddata Fix extra intra-word spacing for Japanese (GitHub issue #991) 2019-05-21 17:49:35 +02:00
kan.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
kat.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
kat_old.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
kaz.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
khm.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
kir.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
kmr.traineddata correct name kur_ara to kmr - Kurmanji (Latin script) 2018-04-25 22:47:45 +05:30
kor.traineddata Fix config file for Korean, remove tessedit_load_sublangs chi_tra 2018-04-09 19:58:26 +05:30
kor_vert.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
lao.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
lat.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
lav.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
LICENSE Rename license file 2018-02-02 10:18:00 +01:00
lit.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ltz.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
mal.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
mar.traineddata Fix Config files to LSTM only for nep and mar 2017-09-15 21:22:28 +05:30
mkd.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
mlt.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
mon.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
mri.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
msa.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
mya.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
nep.traineddata Fix Config files to LSTM only for nep and mar 2017-09-15 21:22:28 +05:30
nld.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
nor.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
oci.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ori.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
osd.traineddata Use legacy Orientation Script Detector (OSD) because that is the only thing that currently works. 2017-09-15 11:44:08 -07:00
pan.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
pdf.ttf Add tessconfigs submodule and links for required tessdata files 2019-09-03 16:07:05 +02:00
pol.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
por.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
pus.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
que.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
README.md Update README 2020-03-08 17:07:13 -04:00
ron.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
rus.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
san.traineddata Fix config files from Use Tesseract/LSTM combiner to LSTM only 2017-09-15 18:37:50 +05:30
sin.traineddata Improved Tamil and Sinhala traineddata 2021-09-01 14:48:52 +05:30
slk.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
slv.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
snd.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
spa.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
spa_old.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
sqi.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
srp.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
srp_latn.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
sun.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
swa.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
swe.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
syr.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
tam.traineddata Improved Tamil and Sinhala traineddata 2021-09-01 14:48:52 +05:30
tat.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
tel.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
tgk.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
tha.traineddata Fix extra intra-word spacing for Thai (GitHub issue #991) 2019-05-21 17:50:06 +02:00
tir.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ton.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
tur.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
uig.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
ukr.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
urd.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
uzb.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
uzb_cyrl.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
vie.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
yid.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
yor.traineddata Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00

tessdata_best Best (most accurate) trained models

This repository contains the best trained models for the Tesseract Open Source OCR Engine.

These models only work with the LSTM OCR engine of Tesseract 4.

See the Tesseract docs for additional information.

All data in the repository are licensed under the Apache-2.0 License, see file LICENSE.