add Python part for new ICU-based tokenizer

This commit is contained in:
Sarah Hoffmann
2021-05-02 17:52:45 +02:00
parent 3c67bae868
commit f44af49df9
6 changed files with 1626 additions and 1 deletions

View File

@@ -46,6 +46,12 @@ NOMINATIM_LANGUAGES=
# Changing this value requires a reimport.
NOMINATIM_TERM_NORMALIZATION=":: NFD (); [[:Nonspacing Mark:] [:Cf:]] >; :: lower (); [[:Punctuation:][:Space:]]+ > ' '; :: NFC ();"
# Configuration file for the tokenizer.
# The content depends on the tokenizer used. If left empty the default settings
# for the chooseen tokenizer will be used. The configuration can only be set
# on import and not be changed afterwards.
NOMINATIM_TOKENIZER_CONFIG=
# Search in the Tiger house number data for the US.
# Note: The tables must already exist or queries will throw errors.
# Changing this value requires to run ./utils/setup --create-functions --setup-website.