Commit graph

44 commits

Author SHA1 Message Date
Stefan Weil 8e30872d8d Add deprecation hint for frk to the documentation
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 11:23:20 +01:00
Stefan Weil 5d934452d7 Create symbolic link for frk.traineddata
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 11:23:20 +01:00
Stefan Weil 2a9d6efdca Rename frk -> deu_latf (ISO 639-3, ISO 15924)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-03-09 11:23:20 +01:00
zdenop dd24b9d887
Merge pull request #30 from tfmorris/patch-1
Update README to point to doc site instead of wiki
2023-10-24 06:52:44 +02:00
Tom Morris 3410f075e9
Fix obsolete pointer to wiki 2023-10-23 18:45:03 -04:00
Stefan Weil 65727574df Add equ.traineddata (copy from tessdata)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-30 08:44:48 +01:00
Stefan Weil 355bdbe925 Update tessconfigs
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-23 13:31:56 +02:00
Stefan Weil e0100b34fa Update URL for tessconfigs submodule (use HTTPS)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-10-11 13:08:13 +02:00
zdenop 2b4f5879f4
Merge pull request #24 from stweil/master
Add Apache license file
2019-06-13 20:40:18 +02:00
Stefan Weil 27cfc71a88 Add Apache license file
Use the same license as for the other projects of tesseract-ocr.
Remove the old COPYING file which included a shortened license text.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-06-13 20:30:10 +02:00
zdenop f03782aa24
Merge pull request #23 from stweil/master
Add tessconfigs submodule and links for required tessdata files
2019-05-23 07:45:48 +02:00
Stefan Weil 61e4ac5fa3 Add tessconfigs submodule and links for required tessdata files
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2019-05-22 22:47:58 +02:00
zdenop b893ed3917
Merge pull request #18 from stweil/master
These models don't work with old versions of Tesseract.
2018-10-23 18:19:46 +02:00
Stefan Weil a7cb5a8845 These models don't work with old versions of Tesseract.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-10-23 17:16:45 +02:00
zdenop 7274cfad45
Merge pull request #16 from Shreeshrii/master
correct name kur_ara to kmr - Kurmanji (Latin script)
2018-04-25 19:31:20 +02:00
Shree Devi Kumar a98a17f9e5 correct name kur_ara to kmr - Kurmanji (Latin script) 2018-04-25 22:52:09 +05:30
zdenop f7a4c12372
Merge pull request #13 from Shreeshrii/patch-1
Updated based on Ray's comment
2018-03-20 06:57:36 +01:00
Shreeshrii b3ad50c3a7
Updated based on Ray's comment
https://github.com/tesseract-ocr/tessdata_best/pull/17#issuecomment-374455979
2018-03-20 10:29:25 +05:30
zdenop f102e00ba3
Merge pull request #12 from stweil/script
Move trained data for scripts to new subdirectory
2018-03-10 20:53:16 +01:00
Stefan Weil 9f875fb819 Move trained data for scripts to new subdirectory
This fixes a name conflict for Lao.traineddata and lao.traineddata
which could not be distinguished on case insensitive filesystems
(for example macOS, Windows).

It makes it also easier for users to see which data is for scripts.
Choosing a script works now like this: tesseract -l script/Latin ...

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-03-10 10:18:55 +01:00
zdenop 0e00fe67ae
Merge pull request #10 from Shreeshrii/master
Add config files to fix auto PSM issue 1273
2018-02-26 16:32:25 +01:00
Shreeshrii 4e7c9ce934
Add config files to fix auto PSM issue 1273
Fixes https://github.com/tesseract-ocr/tesseract/issues/1273 for
Devanagari
Hindi
Nepali 
Marathi
2018-02-26 20:21:11 +05:30
zdenop b2832c5054
Merge pull request #9 from Shreeshrii/patch-2
Add sections, better formatting
2018-02-21 09:48:11 +01:00
Shreeshrii 86db1f450b
Update README.md 2018-02-21 14:16:38 +05:30
Shreeshrii c5aa3acbea
Add sections, better formatting 2018-02-21 14:08:40 +05:30
zdenop 8203e55ebb
Merge pull request #6 from Shreeshrii/patch-1
Update README.md for script level traineddata file info
2018-02-21 08:43:09 +01:00
zdenop c5261254cb
Merge pull request #8 from Shreeshrii/Shreeshrii-extraspaces_chi_tra
Fix extra spaces in chi tra
2018-02-21 08:41:02 +01:00
Shreeshrii 719cfd41d4
Fix extra spaces in words for chi_tra
#Fixes https://github.com/tesseract-ocr/tesseract/issues/991
preserve_interword_spaces 1
2018-02-20 22:50:06 +05:30
Shreeshrii 80d92b7db6
Fix extra intra-word spaces by adding config file
with preserve_interword_spaces 1
2018-02-20 20:18:27 +05:30
Shreeshrii 066ce2dc04
Formatting Changes
All added info is from Ray's comments on different issues in different repositories.
2018-01-17 15:49:45 +05:30
Shreeshrii cd93ef77f8
update with info re jpn and Japanese 2018-01-17 15:39:19 +05:30
Shreeshrii c9e5053d48
Update README.md 2018-01-17 15:24:23 +05:30
zdenop 45ed289c6b Merge pull request #3 from Shreeshrii/master
Fix config file for default oem mode, change to --oem 1
2017-09-16 09:03:21 +02:00
Jeff Breidenbach 139ff127aa Use legacy Orientation Script Detector (OSD) because that is the only thing that currently works. 2017-09-15 11:49:11 -07:00
Shreeshrii 999c0449c8 Fix config file for default oem mode, change to --oem 1
Config files had tessedit_ocr_engine_mode	2
causing processing with --oem 3 (default mode based on config file) to fail

Failed loading language 'san'
Tesseract couldn't load any languages!
Could not initialize tesseract.

Only changed the config file, not version.
2017-09-15 18:44:40 +05:30
zdenop 7588b0386b Merge pull request #2 from stweil/master
README: Improve description and add link to Tesseract wiki
2017-09-15 14:35:16 +02:00
Stefan Weil 4888b72756 README: Improve description and add link to Tesseract wiki
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-15 11:43:44 +02:00
zdenop f7218f89f1 Merge pull request #1 from stweil/master
Improve GitHub integration
2017-09-15 08:16:23 +02:00
Stefan Weil 56fa30165e README: Add text from former COPYRIGHT and add links
Format also the text, so it looks nicer on GitHub.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-15 07:36:49 +02:00
Stefan Weil 7a0584012c Use the full Apache License text
Now GitHub is able to detect and show the project license.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-15 07:27:16 +02:00
Jeff Breidenbach 25cb87d4bb add license info 2017-09-14 15:05:21 -07:00
Jeff Breidenbach 923915d4ce Initial import to github (on behalf of Ray) 2017-09-14 14:35:44 -07:00
Jeff Breidenbach 0415860a32 Testing permissions 2017-09-14 14:28:32 -07:00
theraysmith f7ec066362 Initial commit 2017-09-11 18:11:20 +01:00