Salience

The Salience Developer Hub

Welcome to the Salience developer hub. You'll find comprehensive guides and documentation to help you start working with Salience as quickly as possible, as well as support if you get stuck. Let's jump right in!

Get Started

Language support in Salience

The Salience text analytics engine is designed to support multiple languages with a single flexible codebase. During the development of support for each new language, Lexalytics creates components that are specific to the needs of the individual language. This page links to additional information about the specifics of each language that we support in Salience.

Starting in Salience 6.1.1, we offer "Second Tier" support for an expanded list of languages. Second Tier languages only feature document level model based sentiment and queries.

Starting in Salience 6.1.1.1479 we offer "Basic Languages" support for additional languages. Basic Languages feature support for document level sentiment, entities and queries.

English (EN)
German (DE)
French (FR)
Spanish (ES)
Portuguese (PT)
Italian (IT)
Dutch (NL)
Romanian (RO)

Chinese (ZH)
Japanese (JP)
Korean (KO)

Russian (RU)
Arabic (AR)
Norwegian (NO)
Swedish (SV)
Danish (DA)

Russian (RU)
Arabic (AR)
Turkish (TR)
Hebrew (HE)
Polish (PL)

Functionality Support

All Salience functionality is developed for analysis of English content first, with subsequent updates to deploy these techniques to the other languages we support. The table below describes the functionality currently available across the currently available language packs.

Document-level functionalityENFRESPTITDENLZHKOJP
Core NLP 1YYYYYYYYYY
SummariesYYYYYYYYYY
ThemesYYYYYYYYYY
Sentiment 2YYYYYYYYYY
Query TopicsYYYYYYYYYY
Concept TopicsYYYYYYYYYY
Categories 3YNNNNNNNNN
Intentions 3YNNNNNNNNN
Entity-level functionalityENFRESPTITDENLZHKOJP
Named entitiesYYYYYYYY 4Y 4Y 4
Relationships 5YNNNNNNNNN
Opinions 6YNNNNNNNNN
Entity sentimentYYYYYYYYYY
Entity themesYYYYYYYYYY
User defined entitiesYYYYYYYYYY
Collection-level functionalityENFRESPTITDENLZHKOJP
Collection entitiesYYYYYYYYYY
Collection themesYYYYYYYYYY
Collection facetsYYYYYYYYYY
Collection Query TopicsYYYYYYYYYY
Collection Concept TopicsYYYYYYYYYY

Notes:

1

Core NLP consists of document tokenization, POS tagging, and chunking. Document details enables access to core NLP results such as bigrams and trigrams, POS tags, term frequencies, etc.

2

All languages support phrase-based sentiment analysis, which is the recommended approach. Model-based sentiment is also supported with a default sentiment model in most languages, and a tool provided to enable customers to generate sentiment models from their own content.

3

Categorization functionality based on Wikipedia was released in Salience 5.1.1, support for this feature is currently only available for English. Intention extraction was released in Salience 6, support for this feature is currently only available in English.

4

The default threshold for entity extraction is 55. For improved recall in entity extraction from Chinese and Korean content, we recommend decreasing the default threshold to 35.

5

Entity relationship extraction is a pattern-based feature that is functionally supported in each language, but the patterns have not been translated into non-English languages.

6

Entity opinion extraction is a pattern-based feature that is functionally supported in each language, but the patterns have not been translated into non-English languages.

Updated 2 months ago

Language support in Salience


Suggested Edits are limited on API Reference Pages

You can only suggest edits to Markdown body content, but not to the API spec.