String matching
14 outubro 2019, 15:30 • Helena Galhardas
String matching challenges: accuracy and scalability
String matching algorithms:
- sequence-based: edit distance, Needleman-Wunch measure, Jaro measure, Jaro-winkler measure.
- token-based: overlap measure, Jaccard measure, TF/IDF measure
- phonetic-based: soundex
- hybrid methods: overview