[PostGIS] #5695: standardize_address suspected of returning incorrect results

PostGIS trac at osgeo.org
Fri Mar 15 13:04:24 PDT 2024


#5695: standardize_address suspected of returning incorrect results
----------------------------------+---------------------------
  Reporter:  tmcsys               |      Owner:  robe
      Type:  defect               |     Status:  new
  Priority:  medium               |  Milestone:  PostGIS 3.4.3
 Component:  pagc_address_parser  |    Version:  3.4.x
Resolution:                       |   Keywords:
----------------------------------+---------------------------
Comment (by robe):

 Must be something about the data set that changed I'm guessing the weight
 of one of the items which I'll have to investigate possible my change in
 #5299 inadvertently introduced something when I was resorting the values.

 I note that the one shipped with tiger_geocoder does the right thing with
 the direction.


 {{{
 CREATE EXTENSION postgis_tiger_geocoder CASCADE;
 SELECT * FROM standardize_address('tiger.pagc_lex', 'tiger.pagc_gaz',
 'tiger.pagc_rules', 'ONE E PIMA ST STE 999, TUCSON, AZ');
 }}}

 outputs:


 {{{
 building | house_num | predir | qual | pretype | name | suftype | sufdir |
 ruralroute | extra |  city  | state | country | postcode | box |   unit
 ----------+-----------+--------+------+---------+------+---------+--------+------------+-------+--------+-------+---------+----------+-----+-----------
           | 1         | E      |      |         | PIMA | ST      |
 |            |       | TUCSON | AZ    | USA     |          |     | SUITE
 999
 (1 row)
 }}}



 {{{
 SELECT * FROM standardize_address('tiger.pagc_lex', 'tiger.pagc_gaz',
 'tiger.pagc_rules', '8907 W TROPICAIRE BLVD,NORTH PORT,FL, 34291');


  building | house_num | predir | qual | pretype |    name    | suftype |
 sufdir | ruralroute | extra |    city    | state | country | postcode |
 box | unit
 ----------+-----------+--------+------+---------+------------+---------+--------+------------+-------+------------+-------+---------+----------+-----+------
           | 8907      | W      |      |         | TROPICAIRE | BLVD    |
 |            |       | NORTH PORT | FL    | USA     | 34291    |     |
 (1 row)
 }}}


 Which is why I'm thinking it's isolated to the address_data_us extension
 dataset.


 I was thinking maybe the switched in regex library to newer pcre2, but I
 get the same failures you describe with a system running the old pcre and
 new pcre libraries.
-- 
Ticket URL: <https://trac.osgeo.org/postgis/ticket/5695#comment:2>
PostGIS <http://trac.osgeo.org/postgis/>
The PostGIS Trac is used for bug, enhancement & task tracking, a user and developer wiki, and a view into the subversion code repository of PostGIS project.


More information about the postgis-tickets mailing list