[postgis-devel] [PostGIS] #1669: Missing street type w/ Internal component makes geocode() so SLOOOW
PostGIS
trac at osgeo.org
Thu Mar 8 08:12:32 PST 2012
#1669: Missing street type w/ Internal component makes geocode() so SLOOOW
-----------------------+----------------------------------------------------
Reporter: mikepease | Owner: pramsey
Type: defect | Status: new
Priority: medium | Milestone: PostGIS 2.0.1
Component: postgis | Version: 1.5.X
Keywords: |
-----------------------+----------------------------------------------------
It appears there are certain types of address strings that make the
geocode() function run 100 - 1000X slower than "normal addresses".
Consider these variations on[[BR]]
51 Nicollet Ave FL 4, Minneapolis, MN 55402
select (addy).*, rating from geocode('651 Nicollet Ave FL 4, Minneapolis,
MN 55402') -- Fast![[BR]]
select (addy).*, rating from geocode('651 Nicollet FL 4, Minneapolis, MN
55402') -->SLOW! 60+sec[[BR]]
select (addy).*, rating from geocode('651 Nicollet, FL 4, Minneapolis, MN
55402') -->SLOW! 60+sec[[BR]]
select (addy).*, rating from geocode('651 Nicollet, Minneapolis, MN
55402') --Fast![[BR]]
From what I can surmise, it seems that if you have an address that DOESN'T
specify the street type but DOES specify an internal component, then the
big slow-down happens.
When I run a list of several thousand addresses of medium quality, I run
into dozens or more of these slow addresses and it ends up taking a
majority of the time to run through these addresses.
Say I have a list of 10,000 medium-quality addresses.
Maybe 9,950 of them will run fine, but just 50 of them go slow.
This tiny minority ends up taking the vast majority of the batch time.
9,950 x 0.1 sec = ~15 min.
50 x 90 sec = 75 min.
Total ~90 min.
Given the effect this has on running through a list, I think it's
important to find a fix for this.
--
Ticket URL: <http://trac.osgeo.org/postgis/ticket/1669>
PostGIS <http://trac.osgeo.org/postgis/>
The PostGIS Trac is used for bug, enhancement & task tracking, a user and developer wiki, and a view into the subversion code repository of PostGIS project.
More information about the postgis-devel
mailing list