Sumários

Web Crawling (cont.)

15 novembro 2019, 12:30 Bruno Emanuel Da Graça Martins

  • Universal crawlers
    • Concurrent crawlers
    • Freshness by prioritising according to predictions for changed contents
    • Coverage by prioritising according to PageRank
  • Preferential crawlers
    • Bias of Breadth-first Crawlers
    • Focused crawlers leveraging supervised learning
    • Topical crawlers leveraging similarity towards topic
    • Adaptive crawlers
    • Figures of merit for preferential crawlers
  • Crawler etiquette
    • Crawler ethics and conflicts
    • The robots exclusion protocol
  • Introduction to min-hash and high-dimensional similarity search


Web Crawling (cont.)

15 novembro 2019, 12:30 Bruno Emanuel Da Graça Martins

  • Universal crawlers
    • Concurrent crawlers
    • Freshness by prioritising according to predictions for changed contents
    • Coverage by prioritising according to PageRank
  • Preferential crawlers
    • Bias of Breadth-first Crawlers
    • Focused crawlers leveraging supervised learning
    • Topical crawlers leveraging similarity towards topic
    • Adaptive crawlers
    • Figures of merit for preferential crawlers
  • Crawler etiquette
    • Crawler ethics and conflicts
    • The robots exclusion protocol
  • Introduction to min-hash and high-dimensional similarity search


Lab 05 - Part 2: Information Extraction

15 novembro 2019, 11:00 Danielle Caled Vieira

Hidden Markov Models

  • Pen and paper exercises using structured perceptron


Lab 05 - Part 2: Information Extraction

15 novembro 2019, 09:30 Danielle Caled Vieira

Hidden Markov Models

  • Pen and paper exercises using structured perceptron


Learning to Rank

14 novembro 2019, 17:00 João Miguel Cordeiro Monteiro

  • Python exercises on Learning to Rank
  • Pointwise L2R approaches using Logistic Regression
  • Pen and paper exercise using the perceptron ranking algorithm