[postgis-users] Various ways to handle addresses in postgresql

Shaozhong SHI shishaozhong at gmail.com
Sat Jan 9 01:40:05 PST 2021


Hi, Steve W,

it is easy to parse addresses as tokens.  But it is difficult to put tokens
in right columns, due to that the same address could be expressed with
partial address or full address.

The same address can be written like,  Flat 1 122 Great Avenue London UK,
or Flat 1 122 Greet Avenue Central London London United Kingdom.

When this happens, each address has different number of tokens, so
different numbers of tokens.  Is there a way to deal with this issue so
that each token can get into right column?

Please enlighten me.

Regards,

David

On Sat, 25 Apr 2020 at 05:09, Stephen Woodbridge <
stephenwoodbridge37 at gmail.com> wrote:

> And I have create an address-standardizer project here
> https://github.com/woodbri/address-standardizer which is user
> configurable. I might be over kill is you just want to strip off the
> number, in which case you might just use a SQL regexp replace to remove it.
>
> -Steve W
>
> On 4/25/2020 12:04 AM, Stephen Woodbridge wrote:
> > PostGIS has address_standardizer extension that includes
> > parse_address() and standardize_address() functions.
> >
> > -Steve W
> >
> > On 4/24/2020 9:54 PM, Imre Samu wrote:
> >> > handle addresses in postgresql
> >>
> >> maybe you can use the https://github.com/openvenues/libpostal library
> >> with your favorite language bindings ( Python / Ruby / Go / PHP /
> >> Node / R / Java  ...)
> >>
> >> or as a Postgres database extension:
> >>
> https://info.crunchydata.com/blog/quick-and-dirty-address-matching-with-libpostal
> >>
> >> https://github.com/pramsey/pgsql-postal
> >>
> >> Regards,
> >>  Imre
> >>
> >>
> >>
> >>
> >> Shaozhong SHI <shishaozhong at gmail.com
> >> <mailto:shishaozhong at gmail.com>> ezt írta (időpont: 2020. ápr. 25.,
> >> Szo, 2:49):
> >>
> >>     I find this is a simple, but important question.
> >>
> >>     How best to split numbers and the rest of address?
> >>
> >>     For instance, one tricky one is as follows:
> >>
> >>     21-1 Great Avenue, a city, a country, this planet
> >>
> >>     How to turn this into the following:
> >>
> >>     column 1,       column 2
> >>
> >>       21-1              Great Avenue, a city, a country, this planet
> >>
> >>     Note:  there is a hyphen in  21-1
> >>
> >>     Any clue?
> >>
> >>     Regards,
> >>
> >>     Shao
> >>     _______________________________________________
> >>     postgis-users mailing list
> >>     postgis-users at lists.osgeo.org <mailto:postgis-users at lists.osgeo.org
> >
> >>     https://lists.osgeo.org/mailman/listinfo/postgis-users
> >>
> >>
> >> _______________________________________________
> >> postgis-users mailing list
> >> postgis-users at lists.osgeo.org
> >> https://lists.osgeo.org/mailman/listinfo/postgis-users
> >
>
> _______________________________________________
> postgis-users mailing list
> postgis-users at lists.osgeo.org
> https://lists.osgeo.org/mailman/listinfo/postgis-users
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.osgeo.org/pipermail/postgis-users/attachments/20210109/94d4e067/attachment.html>


More information about the postgis-users mailing list