[postgis-tickets] [PostGIS] #2260: Benchmarking speed between built-in tiger normalizer and pagc_address_parser

PostGIS trac at osgeo.org
Wed Apr 3 06:01:44 PDT 2013


#2260: Benchmarking speed between built-in tiger normalizer and
pagc_address_parser
---------------------------------+------------------------------------------
 Reporter:  robe                 |       Owner:  robe         
     Type:  task                 |      Status:  new          
 Priority:  medium               |   Milestone:  PostGIS 2.1.0
Component:  pagc_address_parser  |     Version:  trunk        
 Keywords:                       |  
---------------------------------+------------------------------------------

Comment(by robe):

 here is one of my quick benchmark tests:

 {{{
 DROP TABLE IF EXISTS test_parse;
 CREATE TABLE test_parse(addid serial PRIMARY KEY, address text);

 INSERT INTO test_parse(address)
 VALUES ('529 Main Street, Boston MA, 02129'),
  ('77 Massachusetts Avenue, Cambridge, MA 02139'),
  ('25 Wizard of Oz, Walaford, KS 99912323'),
  ('26 Capen Street, Medford, MA'),
  ('124 Mount Auburn St, Cambridge, Massachusetts 02138'),
  ('950 Main Street, Worcester, MA 01610')
 ,  ('949 N 3rd St, New Hyde Park, NY, 11040')
 , ('8401 W 35W Service Dr NE, Blaine, MN 55449');
 }}}



 {{{
 -- built in normalizer takes betweeen 57ms and 60 ms

 SELECT address, normalize_address(address) As na
 FROM test_parse;

 529 Main Street, Boston MA, 02129;(529,,Main,St,,,Boston,MA,02129,t)
 77 Massachusetts Avenue, Cambridge, MA
 02139;(77,,Massachusetts,Ave,,,Cambridge,MA,02139,t)
 25 Wizard of Oz, Walaford, KS 99912323;(25,,"Wizard of
 Oz",,,,Walaford,KS,99912323,t)
 26 Capen Street, Medford, MA;(26,,Capen,St,,,Medford,MA,,t)
 124 Mount Auburn St, Cambridge, Massachusetts 02138;(124,,"Mount
 Auburn",St,,,Cambridge,MA,02138,t)
 950 Main Street, Worcester, MA 01610;(950,,Main,St,,,Worcester,MA,01610,t)
 949 N 3rd St, New Hyde Park, NY, 11040;(949,N,3rd,St,,,"New Hyde
 Park",NY,11040,t)
 8401 W 35W Service Dr NE, Blaine, MN 55449;(8401,W,35W,"Svc
 Dr",NE,,Blaine,MN,55449,t)
 }}}

 -- pagc one

 {{{
 -- pagc normalizer --
 -- 210 ms, 197 ms --
 SELECT address, pagc_normalize_address(address)
 FROM test_parse;

 529 Main Street, Boston MA, 02129;(529,,MAIN,St,,,BOSTON,MA,02129,t)
 77 Massachusetts Avenue, Cambridge, MA
 02139;(77,,MASSACHUSETTS,Ave,,,CAMBRIDGE,MA,02139,t)
 25 Wizard of Oz, Walaford, KS 99912323;(25,,"WIZARD OF",,,"# OZ
 WALAFORD",KANSAS,,99912323,t)
 26 Capen Street, Medford, MA;(26,,CAPEN,St,,,"MEDFORD TEXT MICRO",MA,AS,t)
 124 Mount Auburn St, Cambridge, Massachusetts 02138;(124,,"MOUNT
 AUBURN",St,,,CAMBRIDGE,MA,02138,t)
 950 Main Street, Worcester, MA 01610;(950,,MAIN,St,,,WORCESTER,MA,01610,t)
 949 N 3rd St, New Hyde Park, NY, 11040;(949,N,3,St,,,"NEW HYDE
 PARK",NY,11040,t)
 8401 W 35W Service Dr NE, Blaine, MN 55449;(35,,SERVICE,Dr,NE,"# 8401
 W",BLAINE,MN,55449,t)

 }}}

 But pagc one is in debug mode so not a fair test.

-- 
Ticket URL: <http://trac.osgeo.org/postgis/ticket/2260#comment:1>
PostGIS <http://trac.osgeo.org/postgis/>
The PostGIS Trac is used for bug, enhancement & task tracking, a user and developer wiki, and a view into the subversion code repository of PostGIS project.


More information about the postgis-tickets mailing list