Sarah Hoffmann
186f562dd7
remove automatic setup of tokenizer directory
...
ICU tokenizer doesn't need any extra data anymore, so it doesn't
make sense to create a directory which then remains empty. If a
tokenizer needs such a directory in the future, it needs to create
it on its own and make sure to handle the situation correctly where
no project directory is used at all.
2025-04-02 20:20:04 +02:00
Sarah Hoffmann
3cb183ffb0
add lookup word to variants in word table
2025-03-31 14:52:50 +02:00
Sarah Hoffmann
1705bb5f57
do not save word counts of 1
...
This is the default setting, which will be assumed when the count is
missing.
2025-03-31 14:52:50 +02:00
Sarah Hoffmann
a574b98e4a
remove postcode computation for word table during import
2025-03-04 08:57:59 +01:00
Sarah Hoffmann
13db4c9731
replace datrie library with a more simple pure-Python class
2025-02-24 10:24:21 +01:00
Sarah Hoffmann
86ad9efa8a
keep break indicators [:-] during normalisation
...
All punctuation will be converted to '-'. Soft breaks : may be
added by preprocessors. The break signs are only used during
query analysis and are ignored during import token analysis.
2025-01-09 09:21:55 +01:00
Sarah Hoffmann
98c1b923fc
remove code only needed for older PostgreSQL/PostGIS versions
2024-11-18 10:11:09 +01:00
Sarah Hoffmann
1f07967787
fix style issue found by flake8
2024-11-10 22:47:14 +01:00
Sarah Hoffmann
b87d6226fb
remove legacy tokenizer and direct tests
2024-09-21 11:38:08 +02:00
Sarah Hoffmann
7717bbf59d
remove remaining references to php code
2024-09-15 15:33:59 +02:00
Sarah Hoffmann
fd33ef92dc
add deprecation warnings in the code
2024-08-09 19:25:15 +02:00
Sarah Hoffmann
9659afbade
port code to psycopg3
2024-07-29 08:50:19 +02:00
Sarah Hoffmann
3742fa2929
make DB helper functions free functions
...
Also changes the drop function so that it can drop multiple tables
at once.
2024-07-29 08:49:30 +02:00
Sarah Hoffmann
4da4cbfe27
reduce from 3 to 2 packages
2024-06-28 09:13:22 +02:00
Sarah Hoffmann
dc7c11a9d1
adapt plugin imports
2024-06-26 11:52:47 +02:00
Sarah Hoffmann
6e89310a92
split code into submodules
2024-06-26 11:52:47 +02:00