Bellow you can find a brief description for the projects that were developed in the context of the 2012 edition of the DEIC course on Information Retrieval.
Pedro Pascoal - 3D Model Retrieval
This project explored information retrieval methods for retrieving examples from collections of 3D objects, leveraging on content-based descriptors.
- First presentation : An overview on 3D model retrieval (16/11/2012)
- Second presentation : The bag-of-words approach to multimedia and 3D model retrieval (7/12/2012)
- Project report, presentation and evaluation
- Bibliography:
- Johan W. Tangelder and Remco C. Veltkamp (2008) "A survey of content based 3D shape retrieval methods". Multimedia Tools Appl. 39, 3 (September 2008), 441-471.
Pedro Mota - Retrieval of Similar Answers in Email Archives
This project explored the usage of similarity search techniques in a query-by-example scenario, involving the search for similar emails, with answers to questions posed by real users in a help-desk context, in a large email archive.
- First presentation : Efficient document similarity computation (16/11/2012)
- Second presentation : An overview on FAQ retrieval (07/12/2012)
- Project report, presentation and evaluation
- Bibliography:
- Andrei Z. Broder (1997) "On the resemblance and containment of documents", Proceedings of the IEEE Conference on Compression and Complexity of Sequences, Positano, Amalfitan Coast, Salerno, Italy, June 11-13, 1997
Silvio Moreira - Microblog Retrieval
This project explored information retrieval methods in the context of microblog posts, using the datasets and the methodology from the task on microblog retrieval proposed at the TREC conference.
- First presentation : An overview on the challenges in microblog retrieval (23/11/2012)
- Second presentation : Effective methods from the TREC microblog retrieval task (14/12/2012)
- Bibliography:
- Ian Soboroff, Dean McCullough, Jimmy Lin, Craig Macdonald, Iadh Ounis, and Richard McCreadie (2012) "Evaluating Real-Time Search over Tweets". In Proceedings of the 6th International AAAI Conference on Weblogs and Social Media
- Rodrygo L. T. Santos, Craig Macdonald, Richard McCreadie, Iadh Ounis, and Ian Soboroff (2012) "Information Retrieval on the Blogosphere". Foundations and Trends Information Retrieval, 6(1), 1-125.
- I. Ounis, C. Macdonald, J. Lin, and I. Soboroff (2011) "Overview of the TREC-2011 microblog track". In Proceedings of the Text REtrieval Conference, 2011.
- Miles Efron (2011) "Information search and retrieval in microblogs". Journal of the American Society for Information Science and Technology. 62, 6, 996-1008.
Miguel Costa - Text-Driven Forecasting
This project explored the usage of supervised learning methods for making predictions with basis on data from textual documents, specifically addressing the task of predicting the number of clicks that a particular news story will have, with basis on its text.
- First presentation : Text-Driven Forecasting (23/11/2012)
- Second presentation : Document Classification with Tree-Based Methods (14/12/2012)
- Project report, presentation and evaluation
- Bibliography:
- Noah A. Smith (2010) "Text-Driven Forecasting". Technical Report.
- Wei-Yin Loh (2011) "Classification and regression trees" Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 1, 14–23.
Hugo Rodrigues - Answer Selection in Question Answering Systems
This project explores the usage of learning to rank methods in the context of selecting the best candidate answers in question answering systems.
- First presentation : The Answer Selection Problem (30/11/2012)
- Second presentation : Learning to Rank for Information Retrieval (21/12/2012)
- Project report, presentation and evaluation
- Bibliography
- Ana Cristina Mendes and Luísa Coheur (2011) "An approach to answer selection in question-answering based on semantic relations". In Proceedings of the Twenty-Second international joint conference on Artificial Intelligence, 1852-1857.
- Jun Suzuki, Yutaka Sasaki, and Eisaku Maeda. 2002. "SVM answer selection for open-domain question answering". In Proceedings of the 19th international conference on Computational linguistics, Stroudsburg, PA, USA, 1-7.
- Tie-Yan Liu (2009) "Learning to Rank for Information Retrieval". Foundations and Trends in Information Retrieval. 3(3), 225-331.
Wesley Mathew - Retrieving Trajectories with Textual Annotations
This project explores data indexing structures for supporting the efficient access to trajectory datasets containing textual annotations for each of the locations visited in the context of the trajectories.
- First presentation : Joint indexing of textual and spatial information (30/11/2012)
- Second presentation : Spatio-Textual Search over Road Networks (21/12/2012)
- Project report, presentation and evaluation
- Bibliography
- Gao Cong, Christian S. Jensen, and Dingming Wu (2009) "Efficient retrieval of the top-k most relevant spatial web objects". Proceedings of the VLDB Endowment. 2(1), 337-348.
- Gao Cong, Hua Lu, Beng Chin Ooi, Dongxiang Zhang, and Meihui Zhang (2012) "Efficient Spatial Keyword Search in Trajectory Databases". Technical Report
Attachments
- 3D model retrieval
- Answer Selection in Question Answering
- Apresentação Hugo Rodrigues
- Apresentação Pedro Mota
- Apresentação Pedro Pascoal
- Apresentação Wesley Mathew
- Avaliação Hugo Rodrigues
- Avaliação Miguel Costa
- Avaliação Pedro Mota
- Avaliação Pedro Pascoal
- Avaliação Wesley Mathew
- Classification and Regression Trees
- Efficient document similarity computation
- FAQ Retrieval
- IR Tree Data Structure
- Learning to Rank
- Relatório Hugo Rodrigues
- Relatório Miguel Costa
- Relatório Pedro Mota
- Relatório Pedro Pascoal
- Relatório Wesley Mathew
- Spatio-Textual Search on Road Networks
- TREC Microblog Retrieval
- Text Driven Forecasting
- The Bag-of-Words Method for Multimedia and 3D Model Retrieval