[postgis-tickets] [PostGIS] #2288: Pre Standardize tiger input data
PostGIS
trac at osgeo.org
Thu Apr 25 20:40:57 PDT 2013
#2288: Pre Standardize tiger input data
----------------------------+-----------------------------------------------
Reporter: robe | Owner: robe
Type: enhancement | Status: new
Priority: medium | Milestone: PostGIS Future
Component: tiger geocoder | Version: trunk
Keywords: |
----------------------------+-----------------------------------------------
As noted in ticket: #1118
For issues such as those foreign abbreviations which depending on what it
is and what county of the US you are in, e.g. Camino. Such a thing may
appear in full name, pre type post type etc.
Prenormalizing the tiger data would solve this and be the best solution.
I don't think I'll do this in 2.1 since it requires way too much forward
thinking and testing which I simply don't have time for right now.
but my proposed plan:
1) Add an hstore column (call it norm_tags) to the featnames table
Which would be essentially
hstore(norm_addy)
You ask why hstore? Because
1) I can index the pieces by key and I believe custom by value if needed
2) I expect my norm_addy structure will be in flux for a while, so I don't
want user tables affected by this and hstore seems like a stable enough
structure to give what I need and shielding me from this issue.
3) I'm deliberating on whether to do this only for problem cases or for
all records. I want to at least start of with just problem ones so that
the scan list is shorter and after some benchmarking if it turns out more
efficient to do for all, then move to that extreme.
--
Ticket URL: <http://trac.osgeo.org/postgis/ticket/2288>
PostGIS <http://trac.osgeo.org/postgis/>
The PostGIS Trac is used for bug, enhancement & task tracking, a user and developer wiki, and a view into the subversion code repository of PostGIS project.
More information about the postgis-tickets
mailing list