[postgis-tickets] [PostGIS] #2289: Redesign Geocoder
PostGIS
trac at osgeo.org
Fri Apr 26 08:11:59 PDT 2013
#2289: Redesign Geocoder
----------------------------+-----------------------------------------------
Reporter: woodbri | Owner: robe
Type: enhancement | Status: new
Priority: medium | Milestone: PostGIS Future
Component: tiger geocoder | Version: 2.0.x
Keywords: |
----------------------------+-----------------------------------------------
Comment(by robe):
Which tiger year is that? I have in featnames (I think I have 2012
loaded) ( 564,976 records with non-empty fullnames for MA) granted some
might not be things usually addressable. so I would expect your table to
be a little bigger. Maybe not. Anyway I like the idea of a sideline table
rather than adding a column to a tiger table so I might start with that
approach.
The main reason I haven't denormalized it is I'm not convinced it makes
speed faster for PostgreSQL (for MySQL or SQLite Of course :) ). Although
there is more joining - PostgreSQL is pretty good at that. What you get
with multiple tables is possibility of using multiple disk seeks (since
you can have the different sets on different disks) and possibility of
loading more records in shared memory since you won't need to duplicate.
So before going that route, I'd have to benchmark how the loss in the
aforementioned is improved with a single table set. Granted it will make
query easier to understand to others, but that's the least of my problems
:)
Aside from that, disk space is a big concern for me mostly because
1) I have to worry about cloud disks which are often limited in size and
scaled based on processor (except for Amazon disks which you pay for by
the super suckiness).
2) It takes longer to copy my tables from one install to another the
larger the dataset.
--
Ticket URL: <http://trac.osgeo.org/postgis/ticket/2289#comment:5>
PostGIS <http://trac.osgeo.org/postgis/>
The PostGIS Trac is used for bug, enhancement & task tracking, a user and developer wiki, and a view into the subversion code repository of PostGIS project.
More information about the postgis-tickets
mailing list