Sumários
Data profiling and Introduction to Data Warehousing
24 outubro 2019, 14:00 • Helena Galhardas
Data Profiling:
- Motivation and Definition
- Typical data profiling procedure
- Data profiling tasks
- Data profiling tools
- Visualization
Introduction to Data Warehousing:
- Motivation for data warehousing
- Definition of data warehouse
- Normalized versus non-normalized schema
Duplicate detection and elimination with PDI and Lab Guide 5
24 outubro 2019, 11:00 • João Pedro Lebre Magalhães Pereira
- How to specify a duplicate detection and elimination process with PDI transformations
- Resolution of Lab Guide 5
Lab Guide 5 + Duplicate detection and elimination with PDI
22 outubro 2019, 15:30 • Anna Couto
- How to specify a duplicate detection and elimination process with PDI transformations
- Resolution of Lab Guide 5
Lab Guide 5 + Duplicate detection and elimination with PDI
22 outubro 2019, 14:00 • Anna Couto
- How to specify a duplicate detection and elimination process with PDI transformations
- Resolution of Lab Guide 5
Data Matching and Fusion (cont.); The data cleaning tool Cleenex
21 outubro 2019, 15:30 • Helena Galhardas
Data Fusion (elimination of approximate duplicates):
- Types of data conflicts
- Data conflict resolution strategies and functions
- Relational operators and extensions
- User involvement through Quality Constraints and Manual Data Repairs
- Data debugging through a data derivation mechanism
- Demonstration of the system