mirror of
https://github.com/osm-search/Nominatim.git
synced 2026-02-16 15:47:58 +00:00
avoid special characters in word tokens
Transliteration should only consist of ASCII letters and numbers. Avoid any other characters.
This commit is contained in:
@@ -21,8 +21,8 @@ transliteration:
|
||||
- !include icu-rules/extended-unicode-to-asccii.yaml
|
||||
- ":: Ascii ()"
|
||||
- ":: NFD ()"
|
||||
- "[^[:Ascii:]] >"
|
||||
- ":: lower ()"
|
||||
- "[^a-z0-9[:Space:]] >"
|
||||
- ":: NFC ()"
|
||||
sanitizers:
|
||||
- step: split-name-list
|
||||
|
||||
Reference in New Issue
Block a user