[postgis-users] Fuzzy Address Matching - PostgreSql equivalent to FuzzyStringComparer using Python difflib module
Imre Samu
pella.samu at gmail.com
Mon May 11 10:15:58 PDT 2020
> ... Fuzzy Address Matching? ...
> For instance, South Great Avenue, A City, Planet Earth may be put down as
the following:
sometimes .. the same country/city/building - has a different names ..
"A City" example:
https://en.wikipedia.org/wiki/Vienna
Vienna:
"Beč (Croatian, Serbian, older Bulgarian), Beç (older Turkish)*, Bech or
Vidnya (Romani), Bécs (Hungarian)*, Bin / Pin - 빈 (Korean), Dunaj
(Slovene)*, Fienna (Welsh), Vedunia (Celtic), Vena - Вена (Russian), Vídeň
(Czech)*, Viden' / Videň (Ukrainian)*, Viedeň (Slovak), Viên (Vietnamese),
Viena / Vijena/ Виена (Belarusian, Bulgarian, Macedonian), Viena (Catalan*,
Lithuanian, Portuguese*, Romanian*, Spanish*, Tagalog*), Vienna (Italian)*,
Vienne (French)*, Viénni - Βιέννη (Greek), Vieno (Esperanto), Viin
(Estonian), Vin - װין (Yiddish), Vín (Irish, Icelandic), Vina - וינה
(Hebrew), Vínarborg (Icelandic variant), Vindobona (Latin), Vīne
(Latvian)*, Viyana (Turkish)*, Vjenë (Albanian), Vjenna (Maltese), Vyana
(Azeri), Wean (local Viennese, Austrian and Bavarian dialects)*, Weiyena -
維也納 (Chinese)*, Wene (Afrikaans), Wenen (Dutch)*, Wiedeń (Polish)*, Wien
(Danish*, Finnish*, German*, Norwegian*, Swedish*), Wīn - ウィーン (Japanese)*,
Wina (Indonesian), فيينا (Arabic), وين (Persian)"
"
see:
https://en.wikipedia.org/wiki/Names_of_European_cities_in_different_languages_(U%E2%80%93Z)#V
https://en.wikipedia.org/wiki/List_of_names_of_European_cities_in_different_languages
or see the Nominatim names for Vienna:
https://nominatim.openstreetmap.org/details.php?osmtype=R&osmid=109166&class=boundary
And sometimes - the buildings has a different names ...
https://nominatim.openstreetmap.org/details.php?osmtype=R&osmid=11101&class=amenity
- Wien Rathaus (name)
- Ратуша (name:be)
- Vídeňská radnice (name:cs)
- Rathaus (name:de)
- Vienna City Hall (name:en)
- 市庁舎 (name:ja)
- Ратуша (name:ru)
- Віденська ратуша (name:uk)
Best,
Imre
Shaozhong SHI <shishaozhong at gmail.com> ezt írta (időpont: 2020. máj. 11.,
H, 18:24):
> Hello,
>
> I got a few questions as follows:
>
> 1. Which one is the best way for Fuzzy Address Matching?
>
> 2. FME FuzzyStringComparer uses Python difflib module. Which one in
> Postgres is equivalent or similar to it?
>
> 3. Often, addresses collected by different people may well be correct.
> But, there may be typing errors, or addresses are composed not in a
> consistent manner.
>
> For instance, South Great Avenue, A City, Planet Earth may be put down as
> the following:
>
> S. Great Aveue, City A, Earth Planet
> Great Avene South, A City, Earth Planet
> Great Avenue S, A City, Planet Earth
>
> Surely, there would be solutions to deal with this problem.
>
> Can anyone enlighten me?
>
> Regards,
>
> Shao
> _______________________________________________
> postgis-users mailing list
> postgis-users at lists.osgeo.org
> https://lists.osgeo.org/mailman/listinfo/postgis-users
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.osgeo.org/pipermail/postgis-users/attachments/20200511/7763c9f6/attachment.html>
More information about the postgis-users
mailing list