Sumários

Data profiling and Introduction to Data Warehousing

24 outubro 2019, 14:00 Helena Galhardas

Data Profiling:

  • Motivation and Definition
  • Typical data profiling procedure
  • Data profiling tasks
  • Data profiling tools
  • Visualization

Introduction to Data Warehousing:
  • Motivation for data warehousing
  • Definition of data warehouse
  • Normalized versus non-normalized schema


Duplicate detection and elimination with PDI and Lab Guide 5

24 outubro 2019, 11:00 João Pedro Lebre Magalhães Pereira

  • How to specify a duplicate detection and elimination process with PDI transformations
  • Resolution of Lab Guide 5


Lab Guide 5 + Duplicate detection and elimination with PDI

22 outubro 2019, 15:30 Anna Couto

  • How to specify a duplicate detection and elimination process with PDI transformations
  • Resolution of Lab Guide 5


Lab Guide 5 + Duplicate detection and elimination with PDI

22 outubro 2019, 14:00 Anna Couto

  • How to specify a duplicate detection and elimination process with PDI transformations
  • Resolution of Lab Guide 5


Data Matching and Fusion (cont.); The data cleaning tool Cleenex

21 outubro 2019, 15:30 Helena Galhardas

Data Fusion (elimination of approximate duplicates):

  • Types of data conflicts
  • Data conflict resolution strategies and functions
  • Relational operators and extensions 
Cleenex: a data cleaning tool incorporating the user feedback 
  • User involvement through Quality Constraints and Manual Data Repairs
  • Data debugging through a data derivation mechanism
  • Demonstration of the system