| Under this agreement, Convera will embed TEMIS' XeLDA linguistic engine in four languages (French, Spanish, Russian, and German) into its Excalibur WebSearch Platform. XeLDA is a multilingual linguistic engine. It models and standardizes unstructured documents in order to automatically exploit their content. XeLDA provides advanced Text Mining features enabling textual information processing. XeLDA offers services based on natural language processing components that can be integrated in business applications including: Automatic identification of the language within each document; Segmentation of text into sentences; Split of text into basic lexical units (tokenization); Morphological text analysis to return the normalized form (the lemma) and the potential grammatical categories for all the words identified during the tokenization stage; Morpho-syntactic disambiguation to determine the exact grammatical category of a word according to its context; Extraction of sequences of words that form noun phrases; Identification of the context of a word to find the corresponding dictionary entry (Dictionary lookup); and Recognition of idiomatic expressions. (www.temis.com; www.convera.com) |