Invited talk by SAPO Labs: theoretical class, next tuesday, 8/11, 9H30

6 novembro 2011, 19:26 Helena Galhardas

Next tuesday, 8/11, an invited speaker from SAPO Labs will give the following talk:

Semantic APIS's

Luís Sarmento, SAPO Labs

In this talk, we will present a set of APIs that have been developed by SAPO Labs, whose goal is to help the processing of contents in Natural Language. These APIs are open source and they can be used in the construction of Information Extraction and Visualization applications.

These APIs allow the access to three types of   resources: (i) lexical-semantic databases; (ii) basic operations for text processing: and (iii) distilled data coming from on-line journals.

In what concerns the lexical-semantic databases, there are two APIs. SAPO Semantic Lists that publishes lists of words semantically categorized (e.g., lists of occupations). Verbetes supplies information about how important people are mentioned and an historic view of their activities. For example, the Verbetes API allows to know that "Paulo Bento" is the "current national football coacher", but he was the "Sporting coacher", or that "Villas Boas" is an alternative valid name for "Villas-Boas" and that, in the football context, it probably refers to "André Villas-Boas", who is the "Chelsea FC coacher".

Concerning the API of basic operations for text processing, we will speak about the API for processing user-generated contents (e.g., text containing on-line comments, or Twitter messages). It supports the execution of low-level tasks such as the delimitation of words, the identification of "smileys", and the normalization of vocabulary. We will also present the API for identifying entity names that, currently, allows to anotate person names in text, as well as other elements (e.g., occupation).

Finally, we will present the APIs that produce distilled data coming from on-line journals. We will talk about the API SAPO News Trends that supplies information about which topics and important people are the "hottest" in current on-line journals, as well as their history in the last years, We will also present the API Sapo Voxx that allows the access to citations to different important people that were published in on-line journals and also permits to search the corresponding historic.

All these APIS are available as Web Services or through SW modules that can be installed locally. We will finish our talk by giving examples of their use.