Sarah Hoffmann
|
c8873d34af
|
harmonize interface of token analysis module
The configure() function now receives a Transliterator object instead
of the ICU rules. This harmonizes the parameters with the create
function.
|
2022-07-29 10:43:07 +02:00 |
|
Sarah Hoffmann
|
d35e3c25b6
|
add type annotations for token analysis
No annotations for ICU types yet.
|
2022-07-18 09:47:57 +02:00 |
|
Sarah Hoffmann
|
13ed184efd
|
housenumber analyzer: avoid creating too many variants
Housenumber fields with lots of text are likely bad data. So is
data with many changes from letter to digit. Exclude them from adding
optional spaces.
|
2022-03-01 09:34:32 +01:00 |
|
Sarah Hoffmann
|
f03a05f6bb
|
add new analyser for houenumbers
This analyser makes spaces optional.
|
2022-03-01 09:34:32 +01:00 |
|