Commit Graph

4958 Commits

Author SHA1 Message Date
Sarah Hoffmann
4e79533f1f adapt docs for release 2025-02-05 12:08:33 +01:00
Sarah Hoffmann
ee8915f2b6 prepare 5.0.0 release v5.0.0 2025-02-05 10:54:38 +01:00
Sarah Hoffmann
5475bf7b9c Merge pull request #3635 from lonvia/replace-wikimedia-importance-test-data
Update wikimedia importance file for test database
2025-01-14 16:49:52 +01:00
Sarah Hoffmann
95e2d8c846 adapt tests to changed wikimedia importance test table 2025-01-14 14:19:17 +01:00
Sarah Hoffmann
7552818866 replace wikimedia importance file for test data with CSV version 2025-01-14 09:16:25 +01:00
Sarah Hoffmann
db3991af74 Merge pull request #3626 from lonvia/import-performance
Import performance
2025-01-10 16:44:33 +01:00
Sarah Hoffmann
4523b9aaed Merge pull request #3631 from lonvia/avoid-transactions
Creating tables and indexes in autocommit mode
2025-01-10 16:44:18 +01:00
Sarah Hoffmann
8b1cabebd6 Merge pull request #3633 from lonvia/restrict-long-ways
Ignore overly long ways during import
2025-01-10 16:06:37 +01:00
Sarah Hoffmann
0cf636a80c ignore overly long ways during import 2025-01-10 13:55:43 +01:00
Sarah Hoffmann
c2cb6722fe use autocommit when creating tables and indexes
Might avoid some deadlock situations with autovacuum.
2025-01-09 17:14:37 +01:00
Sarah Hoffmann
f8337bedb2 Merge pull request #3629 from lonvia/additional-breaks
Introduce new break types and phrase splitting for Japanese addresses
2025-01-09 13:55:29 +01:00
Sarah Hoffmann
efc09a5cfc add japanese phrase preprocessing
Code adapted from GSOC code by @miku.
2025-01-09 09:24:10 +01:00
Sarah Hoffmann
86ad9efa8a keep break indicators [:-] during normalisation
All punctuation will be converted to '-'. Soft breaks : may be
added by preprocessors. The break signs are only used during
query analysis and are ignored during import token analysis.
2025-01-09 09:21:55 +01:00
Sarah Hoffmann
d984100e23 add inner word break penalty 2025-01-07 21:42:25 +01:00
Sarah Hoffmann
499110f549 add SOFT_PHRASE break and enable parsing
Also enables parsing of PART breaks.
2025-01-06 17:10:24 +01:00
Sarah Hoffmann
267e5dac0d split up MultiPolygons before adding them to large_areas table 2024-12-22 09:15:16 +01:00
Sarah Hoffmann
32d3eb46d5 move geometry split into insertLocationAreaLarge()
thus insert only needs to be called once.
2024-12-22 09:15:16 +01:00
Sarah Hoffmann
c8a0dc8af1 more efficient belongs-to-address determination 2024-12-22 09:15:16 +01:00
Sarah Hoffmann
14ecfc7834 Merge pull request #3619 from lonvia/demote-farms
Remove farms and isolated dwellings from computed addresses
2024-12-22 09:13:42 +01:00
Sarah Hoffmann
cad44eb00c remove farms and isolated dwellings from computed addresses
Farms and isolated dwellings are usually confined to a very small
area. It does not make sense if they are automatically used in
addressing surrounding features. Still works to use them for
parenting when used with addr:place.
2024-12-20 22:59:02 +01:00
Sarah Hoffmann
f76dbb0a16 docs: update Update docs for virtualenv use 2024-12-20 11:27:45 +01:00
Sarah Hoffmann
8dd218a1d0 Merge pull request #3618 from osm-search/settings-md-table-space-osm-index
Settings.md - one setting was repeated
2024-12-19 08:40:31 +01:00
mtmail
501e13483e Settings.md - one setting was repeated 2024-12-18 21:58:51 +01:00
Sarah Hoffmann
b1d25e404f Merge pull request #3617 from mtmail/pr-3615-wording
Slight wording changes for Import-Styles.md
2024-12-18 11:04:21 +01:00
marc tobias
71fceb6854 Slight wording changes for Import-Styles.md 2024-12-18 01:02:46 +01:00
Sarah Hoffmann
a06e123d70 Merge pull request #3616 from osm-search/tokenizers-md-typo
fix typo in Tokenizers.md
2024-12-17 08:43:16 +01:00
mtmail
df6f70d223 fix typo in Tokenizers.md 2024-12-16 23:38:18 +01:00
Sarah Hoffmann
9058dabf1a Merge pull request #3615 from lonvia/overhaul-osm2pgsql-style
Overhaul osm2pgsql style
2024-12-16 19:13:46 +01:00
Sarah Hoffmann
2535780282 exclude more tourism=information types 2024-12-16 10:44:37 +01:00
Sarah Hoffmann
48333bfbd4 reintroduce brand and remove etymology 2024-12-16 10:26:55 +01:00
Sarah Hoffmann
99cf552c17 exclude unnamed swimming pools
Publicly accessible ones are usually mapped as the bigger area
with the water park.
2024-12-16 10:26:55 +01:00
Sarah Hoffmann
ad214753fc include lock names mapped with "lock_name"
Fixes #3365.
2024-12-16 10:26:55 +01:00
Sarah Hoffmann
0d500d4bd1 do not save names when falling back to addresses
If an object doesn't have a useable main tag, then the names should
always be ignored, independently of the presence of housenumbers.
We have to assume that the name belongs to a feature that was
intentionally filtered out.
2024-12-16 10:26:55 +01:00
Sarah Hoffmann
b1e5265d33 switch to subtags for tourism=information and natural=water 2024-12-16 10:26:55 +01:00
Sarah Hoffmann
e2a9b5fdf7 exclude sidewalks and similar footways
These footways are part of a street that is usually already named.
2024-12-16 10:26:55 +01:00
Sarah Hoffmann
eeb3d5dd0a make nominatim callable with themepark style 2024-12-16 10:26:55 +01:00
Sarah Hoffmann
a75dd32f75 adapt documentation for style import 2024-12-16 10:26:55 +01:00
Sarah Hoffmann
e1e8182c72 adapt taginfo script to new configuration structure 2024-12-16 10:26:55 +01:00
Sarah Hoffmann
59bce26afe convert import styles to themepark
Introduces presets which avoid much of the previous configuration
duplication. The original import files are now thin wrappers around
the themepark themes.
2024-12-16 10:26:55 +01:00
Sarah Hoffmann
d1b7c14f79 osm2pgsql style: add modification for name and address, with tests 2024-12-16 10:26:55 +01:00
Sarah Hoffmann
59416178bd osm2pgsql style: simplify computation of extra tags
Now implemented as a simple filter function which can also be
customized by the user.
2024-12-16 10:26:55 +01:00
Sarah Hoffmann
70e351c528 osm2pgsql style: merge main tag and pre-filter handling
Defining a tag as deleteable/extratag and main tag is mutually exclusive
and deleting certain key/value combinations to exclude them from being
used as a main tag is confusing. By merging the handling, such
excludes can now be made explicit in the main list.

By using the same lookup table, it is now also possible to have a
short-cut for uninteresting objects.
2024-12-16 10:26:55 +01:00
Sarah Hoffmann
1eed2fa395 do not touch original tags of osm2pgsql OSM object 2024-12-16 10:26:55 +01:00
Sarah Hoffmann
438b8fed35 convert flex-base.lua into a themepark theme
This already allows to run Nominatim under themepark, currently
as a topic-less theme.
2024-12-16 10:26:55 +01:00
Sarah Hoffmann
4760e8341b move lua scripts into a separate directory 2024-12-16 10:26:55 +01:00
Sarah Hoffmann
639630d5fe Merge pull request #3610 from lonvia/search-preprocessing
Add configurable query preprocessing
2024-12-16 10:26:09 +01:00
Sarah Hoffmann
5b40aa579b add developers documentation for query-side of tokenizer 2024-12-13 17:09:42 +01:00
Sarah Hoffmann
fbb6edfdaf add documentation for new query preprocessing 2024-12-13 16:53:08 +01:00
Sarah Hoffmann
2b87c016db generalize normalization step for search query
It is now possible to configure functions for changing the query
input before it is analysed by the tokenizer.

Code is a cleaned-up version of the implementation by @miku.
2024-12-13 14:31:08 +01:00
Sarah Hoffmann
a894e0f3a4 docs: external osm2pgsql is not longer optional 2024-12-13 10:36:45 +01:00