[postgis-tickets] [PostGIS] #3965: KMeans provides less than K clusters

PostGIS trac at osgeo.org
Fri Dec 29 05:32:28 PST 2017


#3965: KMeans provides less than K clusters
---------------------+---------------------------
 Reporter:  komzpa   |      Owner:  pramsey
     Type:  defect   |     Status:  new
 Priority:  high     |  Milestone:  PostGIS 2.4.3
Component:  postgis  |    Version:  trunk
 Keywords:           |
---------------------+---------------------------
 Clustering 100 distinct points into 100 clusters gets 96 clusters:

 {{{
 select count(distinct cid) from
 (WITH
 points AS (
     SELECT ST_MakePoint(x,y) geom from generate_series(1,5) x,
 generate_series(1,5) y
 )
 SELECT ST_ClusterKMeans(geom, 25) over () AS cid, geom
 FROM points) z;
 }}}

 The larger K is, the bigger losses are.

--
Ticket URL: <https://trac.osgeo.org/postgis/ticket/3965>
PostGIS <http://trac.osgeo.org/postgis/>
The PostGIS Trac is used for bug, enhancement & task tracking, a user and developer wiki, and a view into the subversion code repository of PostGIS project.


More information about the postgis-tickets mailing list