The edit distance for muter to match mutt is 2. The edit distance for cuter to match cutter is 1. This happens because mutter is truncated by the german stemmer and get indexed as mutt, where cutter appears to be left untouched by most english stemmers (tested with Porter and Snowball/Porter2 algorithms, known to be the most aggressive) : As I can observe, distance between mu tter and mu ter is 1, not 2. But it is still not clear why for german distance 1 does not work.Ĭould someone explain why distance is 2, but not 1. The languages have different analyzers in schema.xml, so this should be the difference. So name_de:muter~2 works correct and return mutter. If I fuzzy-search with name_en:cu ter~1 (with only one t) it works fine, but if I search for name_de:mu ter~1 it just does not return any result. And words cu tter in english and mu tter in german. I have documents with fields name_en, name_de, name_fr etc.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |