../data/correction
This directory contains files that support OCR error correction. For files that can be overridden/customized by users, click on the filename for more detailed information below.
A dictionary of ordinary vocabulary words | |
---|---|
The data table controlling character substitution during OCR error correction | |
A dictionary of non-ordinary vocabulary words, such as names and abbreviations | |
Any file ending in |
lexicon.dat
This file is a dictionary of ordinary words that are recognized by Salience OCR error correction out of the box. You should not modify this file. If you need to add more ordinary vocabulary, you should create your own "lexicon.aux" file ( as described in OCR Correction Dictionaries) in the correction
section of a user
directory.
Back to top
errorstats.dat
This file controls character substitution during OCR error correction. This file should never be modified.
Back to top
auxiliary.bin
This file is a dictionary of non-ordinary vocabulary words, such as names and abbreviations. Since it is a binary file, it is not viewable, much less modifiable. If you need to add additional names or special vocabulary, you should create your own "*.aux" file (as described in OCR Correction Dictionaries) in the correction
section of a user
directory.
Back to top
*.aux
Any file ending in .aux
is treated as an auxiliary dictionary (as described in OCR Correction Dictionaries). It is usually the best practice to maintain these in the correction
section of user
directory rather then in the correction
section of your data
directory.
Back to top
Updated about 1 year ago