anqixxx
9c2fdf5eae
Improve README formatting and add install steps, adding a general cloning step before the virtual environment. This would have been helpful for me during Nominatim setup
2025-05-21 10:14:36 -07:00
Sarah Hoffmann
800c56642b
tweak full count cut-off (as per deployment on osm.org)
2025-05-11 11:48:07 +02:00
Sarah Hoffmann
b51fed025c
Merge pull request #3732 from lonvia/exclude-country-from-direction-penalty
...
Exclude address searches with country from direction penalty
2025-04-30 10:45:37 +02:00
Sarah Hoffmann
34b72591cc
exclude address searches with country from direction penalty
...
Countries are not adequately represented by partial term counts.
2025-04-29 17:37:31 +02:00
Sarah Hoffmann
bc450d110c
Merge pull request #3722 from emmanuel-ferdman/master
...
resolve datetime deprecation warnings
2025-04-22 14:21:05 +02:00
Sarah Hoffmann
388acf4727
Merge pull request #3726 from lonvia/revert-json-format-change
...
Revert accidental change in json output format
2025-04-18 14:43:51 +02:00
Sarah Hoffmann
3999977941
revert accidental change in json output format
2025-04-18 12:05:25 +02:00
Emmanuel Ferdman
df58870e3f
resolve datetime deprecation warnings
...
Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com >
2025-04-17 11:15:16 -07:00
Sarah Hoffmann
478a8741db
Merge pull request #3719 from lonvia/query-direction
...
Estimate query direction
2025-04-17 15:17:56 +02:00
Sarah Hoffmann
7f710d2394
add a comment about the precomputed denominator
2025-04-15 09:38:05 +02:00
Sarah Hoffmann
06e39e42d8
add direction penalties
...
Direction penalties are estimated by getting the name to address
ratio usage for each partial term in the query and computing the
linear regression of that ratio over the entire phrase. Or to put
it in ither words: we try to determine if the terms at the beginning
or the end of the query are more likely to constitute a name.
Direction penalties are currently used only in classic name queries.
2025-04-11 20:41:06 +02:00
Sarah Hoffmann
2ef0e20a3f
reorganise token reranking
...
As the reranking is about changing penalties in presence of other
tokens, change the datastructure to have the other tokens readily
avilable.
2025-04-11 13:38:34 +02:00
Sarah Hoffmann
b680d81f0a
ensure that bailout-check is done after each iteration
2025-04-11 11:02:11 +02:00
Sarah Hoffmann
e0e067b1d6
replace use of range when computing word list
2025-04-11 09:59:04 +02:00
Sarah Hoffmann
3980791cfd
use iterator instead of list to go over partials
2025-04-11 09:38:24 +02:00
Sarah Hoffmann
497e27bb9a
move partial token into a separate field in the query struct
...
There is exactly one token to be expected and the token is usually
present.
2025-04-11 08:57:34 +02:00
Sarah Hoffmann
1db717b886
Merge pull request #3716 from lonvia/github-cache-osm2pgsql-binary
...
Github actions: cache compiled osm2pgsql binary
For the tests on Ubunutu 22-04 we need to compile osm2pgsql because the version they ship is too old. This adds caching of the compiled binary, so that we don't need to recompile for each CI run. Together with the new BDD tests that shaves around 10 min off a CI run.
2025-04-10 17:20:32 +02:00
Sarah Hoffmann
b47c8ccfb1
actions: cache compiled osm2pgsql binary
2025-04-10 16:06:27 +02:00
Sarah Hoffmann
63b055283d
Merge pull request #3714 from lonvia/postcode-update-without-project-dir
...
Change postcode update function to work without a project directory
2025-04-10 08:51:22 +02:00
Sarah Hoffmann
b80e6914e7
Merge pull request #3715 from lonvia/demote-tags-to-fallbacks
...
Demote historic and tourism=attraction to fallback tags
2025-04-10 08:51:06 +02:00
Sarah Hoffmann
9d00a137fe
demote historic and tourism=attraction to fallback tags
2025-04-09 20:15:18 +02:00
Sarah Hoffmann
97d9e3c548
allow updating postcodes without a project directory
...
Postcodes will then be updated without looking for external postcodes.
2025-04-09 20:04:01 +02:00
Sarah Hoffmann
e4180936c1
Merge pull request #3713 from lonvia/bdd-pytest-db-test
...
Move BDD tests to pytest-bdd
2025-04-09 19:37:30 +02:00
Sarah Hoffmann
34e0ecb44f
update documentation for BDD tests
2025-04-09 15:21:50 +02:00
Sarah Hoffmann
d95e9737da
remove usage of behave
2025-04-09 14:57:39 +02:00
Sarah Hoffmann
b34991d85f
add BDD tests for DB
2025-04-09 14:52:34 +02:00
Sarah Hoffmann
5f44aa2873
improve table comparison
2025-04-04 11:02:51 +02:00
Sarah Hoffmann
dae643c040
move database setup to generic conftest.py
2025-04-04 11:02:51 +02:00
Sarah Hoffmann
ee62d5e1cf
remove old behave osm2pgsql BDD tests
2025-04-04 11:02:51 +02:00
Sarah Hoffmann
fb440f29a2
implement BDD osm2pgsql tests with pytest-bdd
2025-04-04 11:02:51 +02:00
Sarah Hoffmann
0f725b1880
enable python-bdd for github actions
2025-04-04 11:02:51 +02:00
Sarah Hoffmann
39f56ba4b8
restrict coordinate output to 7 digits
2025-04-04 11:02:51 +02:00
Sarah Hoffmann
6959577aa4
replace behave BDD API tests with pytest-bdd tests
2025-04-04 11:02:51 +02:00
Sarah Hoffmann
50d4b0a386
Merge pull request #3687 from asharmalik19/test-linked-places-language
...
test: linked places expand default language names
2025-04-04 10:58:53 +02:00
Ashar
9ff93bdb3d
Update linked places name test
...
Clean up test scenario by removing extra language variations and
improving table readability.
2025-04-03 14:30:18 -04:00
Ashar
e0bf553aa5
test: linked places expand default language names
...
Add failing test for issue #2714 to verify default language expansion
2025-04-03 14:30:18 -04:00
Sarah Hoffmann
2ce2d031fa
Merge pull request #3702 from lonvia/remove-tokenizer-dir
...
Remove automatic setup of tokenizer directory
So far the tokenizer factory would create a directory for private data for the tokenizer and then hand in the directory location to the tokenizer.
ICU tokenizer doesn't need any extra data anymore, so it doesn't make sense to create a directory which then remains empty. If a tokenizer needs such a directory in the future, it needs to create it on its own and make sure to handle the situation correctly where no project directory is used at all.
2025-04-03 09:04:48 +02:00
Sarah Hoffmann
186f562dd7
remove automatic setup of tokenizer directory
...
ICU tokenizer doesn't need any extra data anymore, so it doesn't
make sense to create a directory which then remains empty. If a
tokenizer needs such a directory in the future, it needs to create
it on its own and make sure to handle the situation correctly where
no project directory is used at all.
2025-04-02 20:20:04 +02:00
Sarah Hoffmann
c5bbeb626f
Merge pull request #3700 from lonvia/ignore-inherited-addresses
...
Ignore POIs with inherited addresses for the address layer
2025-04-02 12:00:45 +02:00
Sarah Hoffmann
3bc77629c8
ignore POIs with inherited addresses for the address layer
...
We know that there is a building which describes the address as a
polygon and is therefore more suitable.
2025-04-02 10:30:45 +02:00
Sarah Hoffmann
6cf1287c4e
Merge pull request #3686 from astridx/output_names
...
Output names as setting
2025-04-01 20:16:15 +02:00
Sarah Hoffmann
a49e8b9cf7
Merge pull request #3675 from TuringVerified/generic-preprocessors
...
Add generic preprocessors
2025-04-01 20:14:43 +02:00
TuringVerified
2eeec46040
Remove unnecessary assert statement, Fix regex_replace docstring and simplify regex_replace
2025-04-01 18:54:30 +05:30
TuringVerified
6d5a4a20c5
Update documentation, optimise regex_replace, add tests
2025-04-01 18:54:30 +05:30
TuringVerified
4665ea3e77
Add generic preprocessor
2025-04-01 18:54:30 +05:30
Sarah Hoffmann
9cf5eee5d4
add instructions for pip package upload
2025-04-01 11:59:03 +02:00
Sarah Hoffmann
fce279226f
prepare release 5.1.0
v5.1.0
2025-04-01 10:16:35 +02:00
Sarah Hoffmann
54d895c4ce
Merge pull request #3695 from TuringVerified/doc-dependencies
...
[Small fix] Add documentation to install extras for mkdocstrings
2025-04-01 09:34:08 +02:00
TuringVerified
896a1c9d12
Add mkdocstrings extra
2025-04-01 11:06:46 +05:30
Sarah Hoffmann
32728d6c89
Merge pull request #3693 from lonvia/remove-unused-sql
...
Remove SQL function for address lookup
2025-03-31 17:11:39 +02:00