Sumários

Data Cleaning

28 novembro 2014, 14:00 Helena Galhardas

  • Motivation
  • Application Contexts
  • Methodology of the data cleaning process
  • Main tasks 
  • Dimensions
  • Taxonomy of the data quality problems
  • Tools.


Creating Schema Mappings (cont.)

27 novembro 2014, 14:30 Helena Galhardas

 

Creating schema mappings:

  • Components of a schema matching system: matchers, combiner, constraint-enforcer, match-selector
  • Applying machine learning to enable the schema matcher to learn.
 

 


String matching

27 novembro 2014, 11:00 Diogo Ribeiro Ferreira

  • Exercícios sobre string matching
  • Métricas de similaridade: Jaro, Jaro-Winkler, e Jaccard
  • Edit distance


Creating Schema Mappings (cont.) and Data Cleaning

27 novembro 2014, 09:30 Helena Galhardas

 

Creating schema mappings:

  • From matches to mappings: CLIO query discovery algorithm.

Data Cleaning: introduction. 

 

 


String matching

27 novembro 2014, 08:00 Diogo Ribeiro Ferreira

  • Exercícios sobre string matching
  • Métricas de similaridade: Jaro, Jaro-Winkler, e Jaccard
  • Edit distance