[postgis-devel] [PostGIS] #1109: Lake Drive is good; Lake DR - not so good

PostGIS trac at osgeo.org
Wed Jul 6 12:29:02 PDT 2011


#1109: Lake Drive is good; Lake DR - not so good
----------------------------+-----------------------------------------------
 Reporter:  mikepease       |       Owner:  robe         
     Type:  defect          |      Status:  new          
 Priority:  high            |   Milestone:  PostGIS 2.0.0
Component:  tiger geocoder  |     Version:  1.5.X        
 Keywords:                  |  
----------------------------+-----------------------------------------------

Comment(by chodgson):

 This seems like a bigger problem; Many street types could also be
 potential street names, for example consider:

 west crescent
 crescent rd

 I think the real problem is in the expectation of there being only a
 single way to normalize an input string - there are guaranteed to be
 ambiguous cases. Note that having multiple normalizations doesn't
 necessarily mean there are multiple results - there may not be a 100 1st
 St in 'Paul', but there may be a 100 1 St in St. Paul. Note it could also
 be the other way around, though I don't know of city named 'Paul'... there
 are a lot of Saints, sometimes it might be true. Sometimes the existence
 of a given street address on a given street name in a given city or zip
 code will reduce the number of possible results - however sometimes the
 multiple parsings/normalizations will produce many additional possible
 results - and really the geocoding part of the code should know about the
 different normalizations, and the normalizations might need a score/rating
 as well, which would carry through into the geocoding.

 From my experience with geocoding, assuming there is a single "normalized"
 interpretation for any given input string is just not going to work for
 many cases. And I'm not even talking about really random stuff, I'm
 talking about inputs that would actually make sense to a human (and a
 local resident would be able to identify easily).

-- 
Ticket URL: <http://trac.osgeo.org/postgis/ticket/1109#comment:5>
PostGIS <http://trac.osgeo.org/postgis/>
The PostGIS Trac is used for bug, enhancement & task tracking, a user and developer wiki, and a view into the subversion code repository of PostGIS project.


More information about the postgis-devel mailing list