Commit graph

19 commits

Author SHA1 Message Date
zdenop 7a1c6b06d7
Merge pull request #21 from stweil/script
Move trained data for scripts to new subdirectory
2018-03-10 21:29:59 +01:00
Stefan Weil a2f7ced76b Move trained data for scripts to new subdirectory
This fixes a name conflict for Lao.traineddata and lao.traineddata
which could not be distinguished on case insensitive filesystems
(for example macOS, Windows).

It makes it also easier for users to see which data is for scripts.
Choosing a script works now like this: tesseract -l script/Latin ...

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-03-10 21:12:04 +01:00
zdenop 51ebb64c29
Merge pull request #19 from stweil/master
Add Devanagari config file to fix auto PSM issue #1273
2018-02-27 08:25:38 +01:00
Stefan Weil 84bd10ed89 Add Devanagari config file to fix auto PSM issue #1273
Devanagari.config was copied from tessdata_fast.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-02-27 07:33:28 +01:00
zdenop 208f104882
Merge pull request #1 from stweil/master
Improve GitHub integration
2018-02-02 10:37:00 +01:00
Stefan Weil e744fa9056 Rename license file
Tesseract uses the file LICENSE to show the Apache License,
so rename COPYING to LICENSE.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-02-02 10:18:00 +01:00
Stefan Weil 9963c18ace README: Improve description and add link to Tesseract wiki
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-02-02 10:09:47 +01:00
Stefan Weil fb9ae6ba2d README: Add text from former COPYRIGHT and add links
Format also the text, so it looks nicer on GitHub.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-02-02 10:09:47 +01:00
Stefan Weil 4928952a62 Use the full Apache License text
Now GitHub is able to detect and show the project license.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-02-02 10:09:47 +01:00
zdenop 3e6ec162ae
Merge pull request #17 from stweil/deu
deu: Remove unwanted dependency
2018-02-02 10:02:28 +01:00
Stefan Weil ed5410b928 deu: Remove unwanted dependency
The data included a configuration which required frk.traineddata
("tessedit_load_sublangs frk"). Remove that.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
2018-02-01 15:29:03 +01:00
Jeff Breidenbach f1d12682c0 Use legacy Orientation Script Detector (OSD) because that is the only thing that currently works. 2017-09-15 11:44:08 -07:00
zdenop 5cf1eaafa4 Merge pull request #3 from Shreeshrii/master
Fix Config files to LSTM only for nep and mar
2017-09-15 17:56:59 +02:00
Shreeshrii 9c5c2cb2e7 Fix Config files to LSTM only for nep and mar
Change default mode to
tessedit_ocr_engine_mode	1
2017-09-15 21:22:28 +05:30
zdenop 84ae67cd6f Merge pull request #2 from Shreeshrii/master
Fix config files - Tesseract/LSTM combiner to LSTM only
2017-09-15 17:04:17 +02:00
Shreeshrii 09e4326246 Fix config files from Use Tesseract/LSTM combiner to LSTM only
Config files had tessedit_ocr_engine_mode	2
causing processing with --oem 3 (default mode based on config file) to fail 

Failed loading language 'san' / 'hin'
Tesseract couldn't load any languages!
Could not initialize tesseract.
2017-09-15 18:37:50 +05:30
Jeff Breidenbach c222ed852e add license info 2017-09-14 15:04:55 -07:00
Jeff Breidenbach 9ddc24e750 Initial import (on behalf of Ray) 2017-09-14 14:45:10 -07:00
theraysmith 549354e9f1 Initial commit 2017-09-11 18:12:33 +01:00