mirror of
https://github.com/osm-search/Nominatim.git
synced 2026-03-09 03:24:06 +00:00
avoid special characters in word tokens
Transliteration should only consist of ASCII letters and numbers. Avoid any other characters.
This commit is contained in:
@@ -21,8 +21,8 @@ transliteration:
|
|||||||
- !include icu-rules/extended-unicode-to-asccii.yaml
|
- !include icu-rules/extended-unicode-to-asccii.yaml
|
||||||
- ":: Ascii ()"
|
- ":: Ascii ()"
|
||||||
- ":: NFD ()"
|
- ":: NFD ()"
|
||||||
- "[^[:Ascii:]] >"
|
|
||||||
- ":: lower ()"
|
- ":: lower ()"
|
||||||
|
- "[^a-z0-9[:Space:]] >"
|
||||||
- ":: NFC ()"
|
- ":: NFC ()"
|
||||||
sanitizers:
|
sanitizers:
|
||||||
- step: split-name-list
|
- step: split-name-list
|
||||||
|
|||||||
Reference in New Issue
Block a user