[postgis-users] Can geocode() deal with NYC Hyphenated address numbers and other special types?

Stephen Woodbridge woodbri at swoodbridge.com
Mon Nov 30 13:20:39 PST 2015


I do not believe it is able to deal with  milepost, grid-style, or 
hyphenated house numbers other than the fact that it can pull off the 
first number and might use that to interpolate.

First understand that it only do linear interpolation between two number 
in a range. These complex numbers do not lead themselves to simple 
linear interpolation. I did an analysis of these address number in the 
past in Tiger data. Specifically, I compared the address ranges, to look 
at which components of the number were constant between the start and 
end of the range and which parts varied over the range.
It was not obvious in many cases which numbers to use for the 
interpolation and in some cases it was totally impossible to interpolate 
because the numbers collected did not make sense (block number at start 
and single number at end) either through a typographical error in the 
data or based on who collected the numbers at the end of the segments or 
how numbers were reassigned when a segment got divided into multiple 
segments.


Also remember you are working with Tiger data which fuzzes the address 
ranges because of Title 13 concerns.

I'll let others respond to the specifics of the postgis geocoder.

-Steve

On 11/30/2015 3:56 PM, draco doc wrote:
> Hi,
>
> I found there are many special format of addresses that valid in their
> local area, but could be quite difficult for geocode() in Tiger Geocoder.
>
> Here is a US street address standard document
> <https://www.fgdc.gov/standards/projects/FGDC-standards-projects/street-address/index_html>.
> Started from page 36 of the final draft, several special cases were listed:
>
> 4. Milepost Complete Address Numbers (Example: "Milepost
> 240").
>
> 5. Grid-style Complete Address Numbers (Example:
> "N89W16758"). In certain communities in and around southern
> Wisconsin, Complete Address Numbers include a map grid cell
> reference preceding the Address Number.
>
> 6. Hyphenated Complete Address Numbers (Example: "5-
> 5415"). In some areas (notably certain parts of New York City,
> southern California, and Hawaii), Complete Address Numbers
> often include hyphens.
>
> I have 18 million address to geocode, and all these special cases can be
> found in my data. From my test, geocode() seemed not be able to
> recognize these cases which is totally understandable for me.
>
> My questions is, is it possible to modify input a little bit to get a
> better than nothing result?
>
> For example, if the NYC hyphenated address cannot be recognized, can we
> at least remove some part and get a rough location? I have address like
> this "40-12 28 AVE , New York City, NY 11103", where 40 is block, 12 is
> house number in block. I tried to just remove the 12 and keep the 40
> which is block number, but all I can get is still just the 28 ave.
>
> Thanks!
>
>
> _______________________________________________
> postgis-users mailing list
> postgis-users at lists.osgeo.org
> http://lists.osgeo.org/mailman/listinfo/postgis-users
>



More information about the postgis-users mailing list