Compare commits

...

12 commits
4.0.0 ... main

Author SHA1 Message Date
Stefan Weil e12c65a915 Rename frk -> deu_latf (ISO 639-3, ISO 15924)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 11:04:42 +01:00
Stefan Weil fa8481f199 Add equ.traineddata (copy from tessdata)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2023-07-24 10:03:38 +02:00
Stefan Weil e2aad9b983 ita: Remove ita.config from ita.traineddata
It added a user_words_suffix which should be reserved for
user configurations.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2020-11-30 22:03:13 +01:00
zdenop 9e8aeef07c
Merge pull request #47 from SherSpock/patch-2
Update README
2020-03-09 08:28:45 +01:00
Ryder Timberlake d288680f57
Update README
Replace unsupported wiki link with equivalent hosted doc link
2020-03-08 17:07:13 -04:00
Stefan Weil c5e0a7294a Update tessconfigs
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-23 13:32:42 +02:00
Stefan Weil e4173f4456 Update URL for tessconfigs submodule (use HTTPS)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-11 13:08:43 +02:00
Stefan Weil 41e829655f Add tessconfigs submodule and links for required tessdata files
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-09-03 16:07:05 +02:00
zdenop e9f15884bc
Merge pull request #37 from stweil/master
Fix extra intra-word spacing for several Asian languages (GitHub issue #991)
2019-05-22 12:15:06 +02:00
Stefan Weil ea00692e71 Fix extra intra-word spacing for Thai (GitHub issue #991)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-05-21 17:50:06 +02:00
Stefan Weil 80b4d76313 Fix extra intra-word spacing for Japanese (GitHub issue #991)
Fix also the encoding of tessedit_char_blacklist.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-05-21 17:49:35 +02:00
Stefan Weil 5075f27776 Fix extra intra-word spacing for Chinese (GitHub issue #991)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-05-21 17:48:52 +02:00
13 changed files with 7 additions and 1 deletions

3
.gitmodules vendored Normal file
View file

@ -0,0 +1,3 @@
[submodule "tessconfigs"]
path = tessconfigs
url = https://github.com/tesseract-ocr/tessconfigs.git

View file

@ -5,7 +5,7 @@ This repository contains the best trained models for the
These models only work with the LSTM OCR engine of Tesseract 4.
See the [Tesseract wiki](https://github.com/tesseract-ocr/tesseract/wiki/Data-Files)
See the [Tesseract docs](https://tesseract-ocr.github.io/tessdoc/Data-Files.html)
for additional information.
All data in the repository are licensed under the

Binary file not shown.

Binary file not shown.

1
configs Symbolic link
View file

@ -0,0 +1 @@
tessconfigs/configs

BIN
equ.traineddata Normal file

Binary file not shown.

Binary file not shown.

Binary file not shown.

Binary file not shown.

1
pdf.ttf Symbolic link
View file

@ -0,0 +1 @@
tessconfigs/pdf.ttf

1
tessconfigs Submodule

@ -0,0 +1 @@
Subproject commit 3decf1c8252ba6dbeef0bf908f4b0aab7f18d113

Binary file not shown.