String matching

14 outubro 2019, 15:30 Helena Galhardas

String matching challenges: accuracy and scalability

String matching algorithms:

  • sequence-based: edit distance, Needleman-Wunch measure, Jaro measure, Jaro-winkler measure.
  • token-based: overlap measure, Jaccard measure, TF/IDF measure
  • phonetic-based: soundex
  • hybrid methods: overview