Sarah Hoffmann
4cc788f69e
enable flake for Python tests
2025-03-09 15:33:24 +01:00
Sarah Hoffmann
6b0d58d9fd
restrict postcode parsing in typed phrases
...
Postcodes can only appear in postcode-type phrases and must then
cover the full phrase
2025-03-05 10:09:33 +01:00
Sarah Hoffmann
afb89f9c7a
add unit tests for postcode parser
2025-03-04 16:25:00 +01:00
Sarah Hoffmann
a574b98e4a
remove postcode computation for word table during import
2025-03-04 08:57:59 +01:00
Sarah Hoffmann
6759edfb5d
make word generation from query a class method
2025-03-04 08:57:37 +01:00
Sarah Hoffmann
e362a965e1
search: merge QueryPart array with QueryNodes
...
The basic information on terms is pretty much always used together
with the node inforamtion. Merging them together saves some
allocation while making lookup easier at the same time.
2025-03-04 08:57:37 +01:00
Sarah Hoffmann
13db4c9731
replace datrie library with a more simple pure-Python class
2025-02-24 10:24:21 +01:00
Sarah Hoffmann
49bd18b048
replace PhraseType enum with simple int constants
2025-02-21 16:44:12 +01:00
Sarah Hoffmann
31412e0674
replace TokenType enum with simple char constants
2025-02-21 10:23:41 +01:00
Sarah Hoffmann
4577669213
replace BreakType enum with simple char constants
2025-02-21 09:57:48 +01:00
Sarah Hoffmann
95e2d8c846
adapt tests to changed wikimedia importance test table
2025-01-14 14:19:17 +01:00
Sarah Hoffmann
efc09a5cfc
add japanese phrase preprocessing
...
Code adapted from GSOC code by @miku.
2025-01-09 09:24:10 +01:00
Sarah Hoffmann
4760e8341b
move lua scripts into a separate directory
2024-12-16 10:26:55 +01:00
Sarah Hoffmann
2b87c016db
generalize normalization step for search query
...
It is now possible to configure functions for changing the query
input before it is analysed by the tokenizer.
Code is a cleaned-up version of the implementation by @miku.
2024-12-13 14:31:08 +01:00
Sarah Hoffmann
5160a1d577
get bbox of postcode areas into results
2024-09-30 08:58:40 +02:00
Sarah Hoffmann
90e207a497
drop automatic migration from versions <4.3
2024-09-27 12:07:48 +02:00
Sarah Hoffmann
a690605a96
remove support for unindexed tokens
...
This was a special feature of the legacy tokenizer who would not
index very frequent tokens.
2024-09-22 10:39:10 +02:00
Sarah Hoffmann
b54ff7d766
remove all references to a module path
...
No longer used now that legacy tokenizer is gone.
2024-09-21 17:39:01 +02:00
Sarah Hoffmann
b87d6226fb
remove legacy tokenizer and direct tests
2024-09-21 11:38:08 +02:00
Sarah Hoffmann
7717bbf59d
remove remaining references to php code
2024-09-15 15:33:59 +02:00
Sarah Hoffmann
6bc044d9c7
remove website setup
...
The website directory was for PHP scripts only and is no longer
needed.
2024-09-15 11:58:55 +02:00
Sarah Hoffmann
882fb16881
restrict use of os.environ in Configuration
...
Only use the OS environment, when the environ parameter is set
to None. Currently it would use the OS env on an empty dict.
2024-09-01 16:17:30 +02:00
Sarah Hoffmann
7f11de0db9
allow None and str for project_dir in NominatimAPI init
2024-08-22 22:49:12 +02:00
Sarah Hoffmann
c2594aca40
make NominatimAPI[Async] a context manager
...
If close() isn't properly called, it can lead to odd error messages
about uncaught exceptions.
2024-08-19 11:31:38 +02:00
Sarah Hoffmann
8e8f7a641b
use custom result formatters in CLI commands
2024-08-16 19:30:57 +02:00
Sarah Hoffmann
5a61d3d5f6
configurable error formatting and content type in result formatter
2024-08-14 12:00:08 +02:00
Sarah Hoffmann
0c25e80be0
make formatting module non-static
2024-08-13 22:39:43 +02:00
Sarah Hoffmann
d22ca186e4
remove v1-specific functions from ASGIAdaptor
2024-08-13 19:38:14 +02:00
Sarah Hoffmann
6527b7cdcd
fail if osm2pgsql is not recent enough
2024-08-09 19:25:15 +02:00
marc tobias
f0390cfe85
add-data: warn and exit if database is frozen
2024-08-05 16:14:19 +02:00
Sarah Hoffmann
9659afbade
port code to psycopg3
2024-07-29 08:50:19 +02:00
Sarah Hoffmann
3742fa2929
make DB helper functions free functions
...
Also changes the drop function so that it can drop multiple tables
at once.
2024-07-29 08:49:30 +02:00
Sarah Hoffmann
4da4cbfe27
reduce from 3 to 2 packages
2024-06-28 09:13:22 +02:00
Sarah Hoffmann
44d5148e5f
fix merge issues
2024-06-26 11:52:47 +02:00
Sarah Hoffmann
2bab0ca060
port unit tests to new python package layout
2024-06-26 11:52:47 +02:00
Sarah Hoffmann
5b02cd22b9
add tests for new importance CSV import
2024-05-16 15:23:54 +02:00
Sarah Hoffmann
60b03d506f
add CSV format for importance import
2024-05-16 15:23:54 +02:00
Sarah Hoffmann
77631f90fd
reindex postcodes that loose their parents
...
When the parent place of a postcode is deleted, it needs to
be reindexed to get a new parent. Otherwise displaying of
results is broken.
2024-05-04 12:33:26 +02:00
Sarah Hoffmann
8f3845660f
add full tokens to addresses
...
This is now needed to weigh results.
2024-05-02 11:47:35 +02:00
Sarah Hoffmann
f923304eea
add slight preference for locating point POIs over POI areas
2024-04-11 10:21:31 +02:00
Sarah Hoffmann
38798bba13
increase search area when filtering by postcode
2024-04-02 19:36:16 +02:00
Sarah Hoffmann
fe873ad0e2
adapt tests for windowing SQL
...
Results with high penalty are now thrown out earlier.
2024-04-02 16:32:49 +02:00
Sarah Hoffmann
07b7fd1dbb
add address counts to tokens
2024-03-18 11:25:48 +01:00
Sarah Hoffmann
ae7c584e28
use import date from osm2pgsql property table if available
2024-03-05 11:33:32 +01:00
Sarah Hoffmann
247065ff6f
Merge pull request #3342 from mtmail/tyops
...
Correct some typos
2024-02-28 14:25:16 +01:00
Sarah Hoffmann
c6d40d4bf4
reduce importance when computed from search rank
2024-02-27 10:15:54 +01:00
marc tobias
7205491b84
Correct some typos
2024-02-26 18:13:30 +01:00
Sarah Hoffmann
1e0025b095
also switch unit tests for cli
2024-02-19 18:39:01 +01:00
Sarah Hoffmann
81eed0680c
recreate word table when refreshing counts
...
The counting touches a large part of the word table, leaving
bloated tables and indexes. Thus recreate the table instead and
swap it in.
2024-02-04 21:35:10 +01:00
Sarah Hoffmann
fed46240d5
disallow category tokens in the middle of a query string
...
This already worked for left-to-right readings and now is also
implemented for right-to-left reading. A qualifier must always be
before or after the name.
2024-01-28 19:03:11 +01:00