Nominatim

mirror of https://github.com/osm-search/Nominatim.git synced 2026-02-14 18:37:58 +00:00

Author	SHA1	Message	Date
Sarah Hoffmann	9447c90b09	adapt tests to new country token format	2025-12-01 13:10:18 +01:00
Sarah Hoffmann	81c6cb72e6	add normalised country name to word table Country tokens now follow the usual convetion of having the normalized version in the word column and the extra info about the country code in the info column.	2025-12-01 13:10:18 +01:00
Sarah Hoffmann	186f562dd7	remove automatic setup of tokenizer directory ICU tokenizer doesn't need any extra data anymore, so it doesn't make sense to create a directory which then remains empty. If a tokenizer needs such a directory in the future, it needs to create it on its own and make sure to handle the situation correctly where no project directory is used at all.	2025-04-02 20:20:04 +02:00
Sarah Hoffmann	be4ba370ef	adapt tests to extended results	2025-03-31 14:52:50 +02:00
Sarah Hoffmann	4cc788f69e	enable flake for Python tests	2025-03-09 15:33:24 +01:00
Sarah Hoffmann	a574b98e4a	remove postcode computation for word table during import	2025-03-04 08:57:59 +01:00
Sarah Hoffmann	3742fa2929	make DB helper functions free functions Also changes the drop function so that it can drop multiple tables at once.	2024-07-29 08:49:30 +02:00
Sarah Hoffmann	4da4cbfe27	reduce from 3 to 2 packages	2024-06-28 09:13:22 +02:00
Sarah Hoffmann	2bab0ca060	port unit tests to new python package layout	2024-06-26 11:52:47 +02:00
Sarah Hoffmann	8f3845660f	add full tokens to addresses This is now needed to weigh results.	2024-05-02 11:47:35 +02:00
Sarah Hoffmann	07b7fd1dbb	add address counts to tokens	2024-03-18 11:25:48 +01:00
Sarah Hoffmann	81eed0680c	recreate word table when refreshing counts The counting touches a large part of the word table, leaving bloated tables and indexes. Thus recreate the table instead and swap it in.	2024-02-04 21:35:10 +01:00
Sarah Hoffmann	645ea5a057	use information from tokenizer to determine street vs. place address So far the SQL logic used the information from the address field to determine if an address is attached to a street or place. This changes the logic to use the information provided in the token_info. This allows sanitizers to enforce a certain parenting without changing the visible address information.	2023-06-30 11:08:25 +02:00
Sarah Hoffmann	bce93d60bd	move PlaceInfo into data submodule This data structure is shared between indexer and tokenizer.	2022-07-06 10:54:47 +02:00
Sarah Hoffmann	612d34930b	handle postcodes properly on word table updates update_postcodes_from_db() needs to do the full postcode treatment in order to derive the correct word table entries.	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	80ea13437d	move postcode matcher in a separate file	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	0a9f971e44	add tests for new analyzed housenumbers	2022-03-01 09:34:32 +01:00
Sarah Hoffmann	c170d323d9	add tests for cleaning housenumbers	2022-01-20 23:47:20 +01:00
Sarah Hoffmann	d09db09849	adapt ICU tets to new housenumber sanitizer Restrict tests to making sure that handing in multiple housenumbers works.	2022-01-20 16:05:49 +01:00
Sarah Hoffmann	c3788d765e	add consistent SPDX copyright headers	2022-01-03 16:23:58 +01:00
Sarah Hoffmann	7f7d2fd5b3	skip most addr: tags with suffixes Only one addr: tag can be processed currently, so make sure it is the one without suffixes to not get odd data. addr:street is the exception because it uses a different matching mechanism.	2021-12-06 14:55:10 +01:00
Sarah Hoffmann	44cfce1ca4	revert to using full names for street name matching Using partial names turned out to not work well because there are often similarly named streets next to each other. It also prevents us from being able to take into account all addr:street:* tags. This change gets all the full term tokens for the addr:street tags from the DB. As they are used for matching only, we can assume that the term must already be there or there will be no match. This avoid creating unused full name tags.	2021-12-06 11:38:38 +01:00
Sarah Hoffmann	5a9fb6eaf7	specify text type in test SQL Older version of postgres fail otherwise.	2021-12-03 13:56:23 +01:00
Sarah Hoffmann	14a78f55cd	more unit tests for tokenizers	2021-12-02 15:46:36 +01:00
Sarah Hoffmann	c8958a22d2	tests: add fixture for making test project directory	2021-11-30 18:01:46 +01:00
Sarah Hoffmann	b90e719da5	organise python tests in subdirectories The directories follow the same structure as the modules in nominatim/.	2021-11-30 11:22:26 +01:00

26 Commits