diff options
Diffstat (limited to 'graphics/tesseract/README')
-rw-r--r-- | graphics/tesseract/README | 21 |
1 files changed, 21 insertions, 0 deletions
diff --git a/graphics/tesseract/README b/graphics/tesseract/README new file mode 100644 index 0000000000000..f9be7a1c4c17a --- /dev/null +++ b/graphics/tesseract/README @@ -0,0 +1,21 @@ +Tesseract is a commercial quality OCR engine originally developed at HP +between 1985 and 1995. In 1995, this engine was among the top 3 evaluated +by UNLV. It was open-sourced by HP and UNLV in 2005. + +You will need to get one of the language packs in order to do anything +useful with tesseract, and that language pack tarball should be present +in the same directory as the SlackBuild script when the package is created. +See http://code.google.com/p/tesseract-ocr/downloads/list for a list of +all available language packs. Note that you can install more than one +(or even all) of the language packs, as they do not conflict with each +other. The build script defaults to use English, but this is easily +changed by passing an alternate value on the command line. + +Here is the relevant code from the build script: + # Language pack(s) to use + # We'll install English by default, but you can pass another one (or all) + # of them on the command line (space delimited). If you pass more than one + # (again, space delimited), you must enclose the string in quotes. Examples: + # TESSLANG=fra ./tesseract.SlackBuild + # TESSLANG="deu deu-f eng fra ita nld por spa vie" ./tesseract.SlackBuild + TESSLANG=${TESSLANG:-eng} # Default to English |