Sumários
Introduction to Information Extraction
12 outubro 2017, 09:30 • Bruno Emanuel Da Graça Martins
- Introduction to Information Extraction (IE)
- IE Problems, Applications and Tasks (e.g., named entity recognition and classification, named entity resolution, relation extraction, etc.)
- Techniques for Information Extraction
- Rule-based approaches and regular expressions
- Introduction to machine learning approaches for IE (e.g., entity recognition as a B-I-O sequence labelling task).
Document classification and clustering
11 outubro 2017, 11:30 • Pável Pereira Calado
- Document classification/clustering with the Python scikit-learn library
- Implementing naive Bayes and K-nearest neighbour classifiers
- Pen-and-paper exercises about document classification with the naive Bayes algorithm
Vector space model for information retrieval
6 outubro 2017, 15:30 • João Miguel Cordeiro Monteiro
- Vector space model for information retrieval
- Cosine Similarity
Exercises about the vector space model for information retrieval
6 outubro 2017, 15:30 • Bruno Emanuel Da Graça Martins
- Exercises about the vector space model for information retrieval
- Support to the course project
Clustering and Dimensionality Reduction
6 outubro 2017, 12:30 • Pável Pereira Calado
- The Clustering Hypothesis in Information Retrieval
- Applications of Clustering and Dimensionality Reduction in Information Retrieval
- Clustering Techniques
- Hierarchical Agglomerative Clustering
- K-Means and Soft K-Means
- Dimensionality Reduction Techniques
- Self-Organising Maps
- Multidimensional Scaling
- Latent Semantic Indexing