<Lexalytics root>/data/spelling

<<Sentencetype | Back to Data Directory index | Stopwords>>

This directory contains files that support the identification of alternate forms for tokens and phrases. For each file, click on the filename for more detailed information below.

alternateforms.dat List of misspellings and other alternate forms for common phrases

These files may be customized within a spelling section of a user directory.

Customizing alternate form functionality in user/spelling

alternateforms.dat

This datafile contains common misspellings and other alternate forms for common phrases. These alternate forms do not directly and automatically rewrite provided content that contains the alternate forms, instead Salience considers both possible forms when performing text analytics functions such as chunking and POS tagging.

Users can override this file in a user/spelling directory to expand coverage of the alternate forms seen in their domain of content.