[postgis-users] caching computations ?
Paragon Corporation
lr at pcorp.us
Fri Sep 12 21:18:05 PDT 2008
This is a bit of an annoying fact that was brought up way way way back.
The issue is that by wrapping your call in a function, you are forcing the
planner to materialize the table. If its not wrapped in a function, then
the planner has the luxury of deciding whether to materialize or not.
Try this query and see if it comes closer to your function example.
SELECT t.typeid, count(*)
FROM fbtype t, fbgeom g,
(SELECT ST_Simplify(shape, 1.0) AS shape
FROM fbgeom
WHERE topicid='9202a8c04000641f800000000000d0f6' ORDER BY
ST_Simplify(shape, 1.0)) AS location
WHERE t.topicid=g.topicid
AND t.typeid='9202a8c04000641f8000000000000669'
AND ST_Within(g.shape, location.shape)
GROUP BY 1
ORDER BY 2 DESC;
Also by the way your SQL function is marked as VOLATILE (well that's the
default if not explicitly stated) - if you plan to move this into the SELECT
to run against all topicids. May help to mark it as STABLE.
Hope that helps,
Regina
-----Original Message-----
From: postgis-users-bounces at postgis.refractions.net
[mailto:postgis-users-bounces at postgis.refractions.net] On Behalf Of Andi
Vajda
Sent: Friday, September 12, 2008 9:15 PM
To: PostGIS Users
Subject: [postgis-users] caching computations ?
The following two queries are functionally equivalent in that they return
the same result. They're also equivalent in structure. Except that the first
one uses a subquery in the from clause and the second one a function
executing the same query as that subquery. Both use the geo index,
fbgeom_shape_idx.
The first one runs in 246536ms, the second one in 1745ms.
Is it because in the latter, the ST_Simplify() is computed only once per
shape considered whereas in the former at every test in the join ?
Thanks !
Andi..
EXPLAIN ANALYZE
SELECT t.typeid, count(*)
FROM fbtype t, fbgeom g,
(SELECT ST_Simplify(shape, 1.0) AS shape
FROM fbgeom
WHERE topicid='9202a8c04000641f800000000000d0f6') AS location
WHERE t.topicid=g.topicid
AND t.typeid='9202a8c04000641f8000000000000669'
AND ST_Within(g.shape, location.shape)
GROUP BY 1
ORDER BY 2 DESC;
Sort (cost=32.41..32.41 rows=1 width=33) (actual
time=246536.348..246536.349 rows=1 loops=1)
Sort Key: (count(*))
Sort Method: quicksort Memory: 25kB
-> GroupAggregate (cost=0.00..32.40 rows=1 width=33) (actual
time=246536.331..246536.332 rows=1 loops=1)
-> Nested Loop (cost=0.00..32.38 rows=1 width=33) (actual
time=72.844..246531.640 rows=5596 loops=1)
-> Nested Loop (cost=0.00..21.07 rows=1 width=32) (actual
time=0.426..245923.937 rows=40387 loops=1)
Join Filter: _st_within(g.shape,
st_simplify(fbgeom.shape, 1::double precision))
-> Index Scan using fbgeom_topicid_idx on fbgeom
(cost=0.00..8.53 rows=1 width=123) (actual time=0.074..0.112 rows=8 loops=1)
Index Cond: (topicid =
'9202a8c04000641f800000000000d0f6'::bpchar)
-> Index Scan using fbgeom_shape_idx on fbgeom g
(cost=0.00..12.50 rows=2 width=155) (actual time=44.315..15368.010 rows=6108
loops=8)
Index Cond: (g.shape &&
st_simplify(fbgeom.shape, 1::double precision))
Filter: (g.shape && st_simplify(fbgeom.shape,
1::double precision))
-> Index Scan using fbtype_pkey on fbtype t
(cost=0.00..11.27 rows=3 width=66) (actual time=0.014..0.014 rows=0
loops=40387)
Index Cond: ((t.topicid = g.topicid) AND (t.typeid =
'9202a8c04000641f8000000000000669'::bpchar))
Total runtime: 246536.530 ms
versus:
CREATE OR REPLACE FUNCTION gix_simplify(character(32), double precision)
RETURNS SETOF geometry as $$
SELECT ST_Simplify(shape, $2) FROM fbgeom WHERE topicid=$1; $$ LANGUAGE
SQL;
EXPLAIN ANALYZE
SELECT t.typeid, count(*)
FROM fbtype t, fbgeom g,
gix_simplify('9202a8c04000641f800000000000d0f6', 1.0) AS location
WHERE t.topicid=g.topicid
AND t.typeid='9202a8c04000641f8000000000000669'
AND ST_Within(g.shape, location)
GROUP BY 1
ORDER BY 2 DESC;
Sort (cost=20304.25..20304.27 rows=6 width=33) (actual
time=1757.072..1757.072 rows=1 loops=1)
Sort Key: (count(*))
Sort Method: quicksort Memory: 25kB
-> GroupAggregate (cost=0.00..20304.18 rows=6 width=33) (actual
time=1757.053..1757.053 rows=1 loops=1)
-> Nested Loop (cost=0.00..20302.23 rows=374 width=33) (actual
time=69.671..1753.826 rows=5596 loops=1)
-> Nested Loop (cost=0.00..11726.96 rows=809 width=32)
(actual time=3.663..1304.404 rows=40387 loops=1)
Join Filter: _st_within(g.shape, location.location)
-> Function Scan on gix_simplify location
(cost=0.00..260.00 rows=1000 width=32) (actual time=3.311..3.325 rows=8
loops=1)
-> Index Scan using fbgeom_shape_idx on fbgeom g
(cost=0.00..11.44 rows=2 width=155) (actual time=43.784..71.728 rows=6108
loops=8)
Index Cond: (g.shape && location.location)
Filter: (g.shape && location.location)
-> Index Scan using fbtype_pkey on fbtype t
(cost=0.00..10.56 rows=3 width=66) (actual time=0.010..0.011 rows=0
loops=40387)
Index Cond: ((t.topicid = g.topicid) AND (t.typeid =
'9202a8c04000641f8000000000000669'::bpchar))
Total runtime: 1757.231 ms
_______________________________________________
postgis-users mailing list
postgis-users at postgis.refractions.net
http://postgis.refractions.net/mailman/listinfo/postgis-users
More information about the postgis-users
mailing list