Back to Options index | Concept options>>
Tagging Threshold
Sets threshold of percentage of alphanumeric characters for text processing.
|
Min/Max range
| 1-100
|
Default value
| 80
|
|
C API
|
SalienceOption oOption;
oOption.nOption = SALIENCEOPTION_TAGGINGTHRESHOLD;
oOption.nValue = <integer value>;
lxaSetSalienceOption(pSession,&oOption);
|
.NET
| SalienceEngine.TextThreshold = integer value
|
Java
| Salience.Options.Base.setTaggingThreshold(integer value)
|
Python
| salience.setOption_TaggingThreshold(session,integer value, configuration ID)
|
Back to top
Max Execution Time
Maximum time in milliseconds certain functions are allowed to run for.
|
Min/Max range
| 0-MAX INT
|
Default value
| 120
|
|
C API
|
SalienceOption oOption;
oOption.nOption = SALIENCEOPTION_MAXEXECUTIONTIME;
oOption.nValue = <integer value>;
lxaSetSalienceOption(pSession,&oOption);
|
.NET
| SalienceEngine.MaxExecutionTime = integer value
|
Java
| Salience.Options.Base.setMaxExecutionTime(integer value)
|
Python
| salience.setOption_MaxExecutionTime(session,integer value, configuration ID)
|
Back to top
Fail on Very Long Sentences
When a document contains a very long sentence, should you process the rest of the document? With the default value of 1 for this option, a document containing a sentence with more than 1000 words will throw an exception in the prepare text stage. If this option is set to 0, the invalid sentence is ignored and the remainder of the document content is processed.
|
Min/Max range
| 0 or 1
|
Default value
| 1
|
|
C API
|
SalienceOption oOption;
oOption.nOption = SALIENCEOPTION_FAILONLONGSENTENCE;
oOption.nValue = <integer value>;
lxaSetSalienceOption(pSession,&oOption);
|
.NET
| SalienceEngine.FailOnLongSentences = boolean value
|
Java
| Salience.Options.Base.setFailLongSentence(boolean value)
|
Python
| salience.setOption_FailOnLongSentences(session,integer value, configuration ID)
|
Back to top
User Directory
Sets path to user directory of customizations.
|
Min/Max range
| N/A
|
Default value
| path to data directory/user
|
|
C API
|
SalienceOption oOption;
oOption.nOption = SALIENCEOPTION_USERDIRECTORY;
oOption.acValue = <string value>;
lxaSetSalienceOption(pSession,&oOption);
|
.NET
| SalienceEngine.UserDirectory = string value
|
Java
| Salience.Options.Base.setUserDirectory(string value)
|
Python
| salience.setOption_UserDirectory(session,string value, configuration ID)
|
Back to top
Concept Slop
Threshold indicating overlap for concepts, used when merging themes together. Lower values generate far more merges.
|
Min/Max range
| 0-1
|
Default value
| 0.8
|
|
C API
|
SalienceOption oOption;
oOption.nOption = SALIENCEOPTION_CONCEPTSLOP;
oOption.fValue = <float value>;
lxaSetSalienceOption(pSession,&oOption);
|
.NET
| SalienceEngine.ConceptSlop = float value
|
Java
| Salience.Options.Base.setConceptSlop(float value)
|
Python
| salience.setOption_ConceptSlop(session,float value, configuration ID)
|
Back to top
Document Semantics
Use concept matrix for themes on a per document basis. If set to 1, will roll themes together.
|
Min/Max range
| 0 or 1
|
Default value
| 0
|
|
C API
|
SalienceOption oOption;
oOption.nOption = SALIENCEOPTION_USEDOCUMENTSEMANTICS;
oOption.nValue = <integer value>;
lxaSetSalienceOption(pSession,&oOption);
|
.NET
| SalienceEngine.UseSemantics = boolean value
|
Java
| Salience.Options.Base.useSemantics(integer value)
|
Python
| salience.setOption_DocumentSemantics(session,integer value, configuration ID)
|
Back to top
Lists and Tables
Calculate if the document contains lists and tables.
|
Min/Max range
| 0 or 1
|
Default value
| 1
|
|
C API
|
SalienceOption oOption;
oOption.nOption = SALIENCEOPTION_LISTSANDTABLES;
oOption.nValue = <integer value>;
lxaSetSalienceOption(pSession,&oOption);
|
.NET
| SalienceEngine.CalculateListsAndTables = boolean value
|
Java
| Salience.Options.Base.setCalculateListsAndTables(boolean value)
|
Python
| salience.setOption_CalculateListsAndTables(session,integer value, configuration ID)
|
Back to top
One Sentence
Process the entire document as a single sentence. This is good for Twitter and other microblog content which can consist of multiple sentence fragments. Defaults to false (0).
|
Min/Max range
| 0 or 1
|
Default value
| 0
|
|
C API
|
SalienceOption oOption;
oOption.nOption = SALIENCEOPTION_ONESENTENCE;
oOption.nValue = <integer value>;
lxaSetSalienceOption(pSession,&oOption);
|
.NET
| SalienceEngine.ProcessAsOneSentence = boolean value
|
Java
| Salience.Options.Base.setProcessAsOneSentence(boolean value)
|
Python
| salience.setOption_ProcessAsOneSentence(session,integer value, configuration ID)
|
Back to top
Shared Memory
Puts large data files into interprocess memory space for a smaller overall footprint. This option defaults to false on Windows and false on Linux. NOTE: Use on Linux may require running the following command (as root or with sudo): /sbin/sysctl -w kernel.shmmax=4294967296
|
Min/Max range
| 0 or 1
|
Default value
| 0 (Linux), 0 (Windows)
|
|
C API
|
SalienceOption oOption;
oOption.nOption = SALIENCEOPTION_SHAREDMEMORY;
oOption.nValue = <integer value>;
lxaSetSalienceOption(pSession,&oOption);
|
.NET
| SalienceEngine.UseSharedMemory = boolean value
|
Java
| Salience.Options.Base.setUseSharedMemory(boolean value)
|
Python
| salience.setOption_UseSharedMemory(session,integer value, configuration ID)
|
Back to top
Complex Stems
Enables the processing of words containing repeated characters (ex. "I loooovvvve Salience" into "I love Salience") at a cost of processing time. Also, specify a sentiment multiplier for the word in the complexstems.dat data file.
|
Min/Max range
| 0 or 1
|
Default value
| 0
|
|
C API
|
SalienceOption oOption;
oOption.nOption = SALIENCEOPTION_COMPLEXSTEMS;
oOption.nValue = <integer value>;
lxaSetSalienceOption(pSession,&oOption);
|
.NET
| SalienceEngine.ProcessComplexStems = boolean value
|
Java
| Salience.Options.Base.setComplexStems(boolean value)
|
Python
| salience.setOption_ProcessComplexStems(session,integer value, configuration ID)
|
Back to top
Flatten All Upper Case
When set to true, if a sentence is encountered with all upper case tokens, it will be converted to lower case for better POS tagging. Recommended use is when user's documents contain many sentences with all upper case tokens (can be common in Twitter content) and shifting to lower case will provide more consistent POS tagging across upper case and lower case letters.
|
Min/Max range
| 0 or 1
|
Default value
| 0
|
|
C API
|
SalienceOption oOption;
oOption.nOption = SALIENCEOPTION_FLATTENALLUPPERCASE;
oOption.nValue = <integer value>;
lxaSetSalienceOption(pSession,&oOption);
|
.NET
| SalienceEngine.FlattenAllUpperCase = boolean value
|
Java
| Salience.Options.Base.setFlattenAllUpperCase(boolean value)
|
Python
| salience.setOption_FlattenAllUpperCase(session,integer value, configuration ID)
|
Back to top
Content HTML
By default, the content provided to Salience for processing must be plain text. Enabling this option allows HTML content to be provided to text preparation methods, either as a raw well-formed HTML string, or as path to an HTML file.
|
Min/Max range
| 0 or 1
|
Default value
| 0
|
|
C API
|
SalienceOption oOption;
oOption.nOption = SALIENCEOPTION_CONTENTHTML;
oOption.nValue = <integer value>;
lxaSetSalienceOption(pSession,&oOption);
|
.NET
| SalienceEngine.ContentHTML = boolean value
|
Java
| Salience.Options.Base.setContentHTML(boolean value)
|
Python
| salience.setOption_ContentHTML(session,integer value, configuration ID)
|
Back to top
Alternate Forms
Turns on internal alternate forms, allowing Salience to predict intended word choices, e.g. "to" instead of "too". On by default for short-form mode, off by default in default mode.
|
Min/Max range
| 0 or 1
|
Default value
| 0
|
|
C API
|
SalienceOption oOption;
oOption.nOption = SALIENCEOPTION_ALTERNATEFORMS;
oOption.nValue = <integer value>;
lxaSetSalienceOption(pSession,&oOption);
|
.NET
| SalienceEngine.AlternateForms = boolean value
|
Java
| Salience.Options.Base.setAlternateForms(boolean value)
|
Python
| salience.setOption_AlternateForms(session,integer value, configuration ID)
|
Back to top
Disable Chainer
Salience uses a chaining algorithm to understand the connections between regions of a document, but especially in non-English content this can be a little slow. The chainer can be disabled to create a small performance boost at the cost of some degree of accuracy in things like summarization, sentiment assignment and theme detection.
|
Min/Max range
| 0 or 1
|
Default value
| 0
|
|
C API
|
SalienceOption oOption;
oOption.nOption = SALIENCEOPTION_USECHAINER;
oOption.nValue = <integer value>;
lxaSetSalienceOption(pSession,&oOption);
|
.NET
| SalienceEngine.UseChainer= boolean value
|
Java
| Salience.Options.Base.setUseChainer(boolean value)
|
Python
| salience.setOption_UseChainer(session,integer value, configuration ID)
|
Back to top