Crawling the Web (online class)

27 novembro 2020, 11:00 Flávio Martins

Programming exercises:

  • Implementation of Web Crawler
  • Implementation of Focused Wikipedia Crawler


Pen and paper exercise:

  • Jaccard similarity
  • Min-hash signatures using permutations
  • Min-hash signatures using hashing

Questions about the Project.