Commit graph

27 commits

Author SHA1 Message Date
Shreeshrii
b3ad50c3a7
Updated based on Ray's comment
https://github.com/tesseract-ocr/tessdata_best/pull/17#issuecomment-374455979
2018-03-20 10:29:25 +05:30
zdenop
f102e00ba3
Merge pull request #12 from stweil/script
Move trained data for scripts to new subdirectory
2018-03-10 20:53:16 +01:00
Stefan Weil
9f875fb819 Move trained data for scripts to new subdirectory
This fixes a name conflict for Lao.traineddata and lao.traineddata
which could not be distinguished on case insensitive filesystems
(for example macOS, Windows).

It makes it also easier for users to see which data is for scripts.
Choosing a script works now like this: tesseract -l script/Latin ...

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-03-10 10:18:55 +01:00
zdenop
0e00fe67ae
Merge pull request #10 from Shreeshrii/master
Add config files to fix auto PSM issue 1273
2018-02-26 16:32:25 +01:00
Shreeshrii
4e7c9ce934
Add config files to fix auto PSM issue 1273
Fixes https://github.com/tesseract-ocr/tesseract/issues/1273 for
Devanagari
Hindi
Nepali 
Marathi
2018-02-26 20:21:11 +05:30
zdenop
b2832c5054
Merge pull request #9 from Shreeshrii/patch-2
Add sections, better formatting
2018-02-21 09:48:11 +01:00
Shreeshrii
86db1f450b
Update README.md 2018-02-21 14:16:38 +05:30
Shreeshrii
c5aa3acbea
Add sections, better formatting 2018-02-21 14:08:40 +05:30
zdenop
8203e55ebb
Merge pull request #6 from Shreeshrii/patch-1
Update README.md for script level traineddata file info
2018-02-21 08:43:09 +01:00
zdenop
c5261254cb
Merge pull request #8 from Shreeshrii/Shreeshrii-extraspaces_chi_tra
Fix extra spaces in chi tra
2018-02-21 08:41:02 +01:00
Shreeshrii
719cfd41d4
Fix extra spaces in words for chi_tra
#Fixes https://github.com/tesseract-ocr/tesseract/issues/991
preserve_interword_spaces 1
2018-02-20 22:50:06 +05:30
Shreeshrii
80d92b7db6
Fix extra intra-word spaces by adding config file
with preserve_interword_spaces 1
2018-02-20 20:18:27 +05:30
Shreeshrii
066ce2dc04
Formatting Changes
All added info is from Ray's comments on different issues in different repositories.
2018-01-17 15:49:45 +05:30
Shreeshrii
cd93ef77f8
update with info re jpn and Japanese 2018-01-17 15:39:19 +05:30
Shreeshrii
c9e5053d48
Update README.md 2018-01-17 15:24:23 +05:30
zdenop
45ed289c6b Merge pull request #3 from Shreeshrii/master
Fix config file for default oem mode, change to --oem 1
2017-09-16 09:03:21 +02:00
Jeff Breidenbach
139ff127aa Use legacy Orientation Script Detector (OSD) because that is the only thing that currently works. 2017-09-15 11:49:11 -07:00
Shreeshrii
999c0449c8 Fix config file for default oem mode, change to --oem 1
Config files had tessedit_ocr_engine_mode	2
causing processing with --oem 3 (default mode based on config file) to fail

Failed loading language 'san'
Tesseract couldn't load any languages!
Could not initialize tesseract.

Only changed the config file, not version.
2017-09-15 18:44:40 +05:30
zdenop
7588b0386b Merge pull request #2 from stweil/master
README: Improve description and add link to Tesseract wiki
2017-09-15 14:35:16 +02:00
Stefan Weil
4888b72756 README: Improve description and add link to Tesseract wiki
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-15 11:43:44 +02:00
zdenop
f7218f89f1 Merge pull request #1 from stweil/master
Improve GitHub integration
2017-09-15 08:16:23 +02:00
Stefan Weil
56fa30165e README: Add text from former COPYRIGHT and add links
Format also the text, so it looks nicer on GitHub.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-15 07:36:49 +02:00
Stefan Weil
7a0584012c Use the full Apache License text
Now GitHub is able to detect and show the project license.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2017-09-15 07:27:16 +02:00
Jeff Breidenbach
25cb87d4bb add license info 2017-09-14 15:05:21 -07:00
Jeff Breidenbach
923915d4ce Initial import to github (on behalf of Ray) 2017-09-14 14:35:44 -07:00
Jeff Breidenbach
0415860a32 Testing permissions 2017-09-14 14:28:32 -07:00
theraysmith
f7ec066362 Initial commit 2017-09-11 18:11:20 +01:00