<div dir="ltr"><br><div class="gmail_extra"><br><div class="gmail_quote">On Mon, Jan 5, 2015 at 12:17 PM, Benjamin Ducke <span dir="ltr"><<a href="mailto:benducke@fastmail.fm" target="_blank">benducke@fastmail.fm</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><span>On 04/01/15 23:00, Paulo van Breugel wrote:<br>
> Tried it out, and it works like a charm, great!<br>
><br>
> One question/request, would it be terribly difficult to add an option to<br>
> compute an user-defined number of clusters of equal size?<br>
><br>
<br>
</span>I think you are looking for a completely different algorithm<br>
there. There are many clustering methods that will produce<br>
exactly k clusters. But what do you mean by size: number of<br>
points per cluster or spatial extent/area?<br></blockquote><div><br></div><div>Yes, I was looking at something similar to what the function stratify in the R spcosa package does: "Methods for partitioning a spatial object into compact strata by means of k-means. The objective function to minimize is the mean squared shortest distance (MSSD). Optionally, the strata may be forced to be of equal size. This facilitates field work in case of stratified simple random sampling for composites". The key here is to create as compact strata as possible, optionally of equal size (number of points per cluster). <br></div><div><br> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
In both cases, I am not sure how you would reconcile the<br>
two opposing goals: (A) find the actual cluster shapes,<br>
(B) make the cluster shapes so that all clusters are of<br>
equal size.<br>
<br>
(please see also my comment to Markus below).<br>
<span><br>
><br>
><br>
> On Sun, Jan 4, 2015 at 9:02 PM, Markus Metz<br>
</span>> <<a href="mailto:markus.metz.giswork@gmail.com" target="_blank">markus.metz.giswork@gmail.com</a> <mailto:<a href="mailto:markus.metz.giswork@gmail.com" target="_blank">markus.metz.giswork@gmail.com</a>>><br>
<span>> wrote:<br>
><br>
><br>
> Done in trunk r63952 as v.cluster. It is not a GRASS7 addon because it<br>
> does not work with G7.0. At the moment it only identifies and labels<br>
> clusters. Connectivity graphs and cluster shapes are not implemented.<br>
> In future, I would like to add OPTICS because OPTICS can better handle<br>
> different clusters with different densities.<br>
><br>
<br>
</span>Thanks Markus, this is excellent progress.<br>
It seems to me that the approximation of cluster shapes from grouped<br>
points is a generic problem that would best be solved with a separate<br>
module. As long as the shapes are roughly convex, the existing v.hull<br>
should work fine. For concave shapes, AFAIK things become messy because<br>
common methods such as alpha shapes require the user to provide<br>
threshold values.<br>
<br>
Cheers,<br>
<br>
Ben<br>
<div><div><br>
> Markus M<br>
><br>
> ><br>
> > There is a reference implementation (albeit in Java) that you can<br>
> > use to compare it with other clustering algorithms:<br>
> ><br>
> > <a href="http://elki.dbs.ifi.lmu.de/" target="_blank">http://elki.dbs.ifi.lmu.de/</a><br>
> ><br>
> > An implementation in C (that I have not tried) is here:<br>
> ><br>
> > <a href="https://github.com/siddharth950/DBSCAN" target="_blank">https://github.com/siddharth950/DBSCAN</a><br>
> ><br>
> > The one drawback of DBSCAN is that it needs an efficient spatial<br>
> > index to perform well -- and you have just taken care of that!<br>
> ><br>
> >> - point cloud thinning: a sample can be generated from a large point<br>
> >> cloud by specifying a minimum distance between sample points. To be<br>
> >> implemented.<br>
> >><br>
> >> The new k-d tree is now used by v.clean tool=snap<br>
> (Vect_snap_lines()),<br>
> >> reducing both memory consumption and processing time.<br>
> >><br>
> ><br>
> > I would also like to point out that SAGA GIS is moving into<br>
> > the same direction, i.e. more efficient processing of very<br>
> > large point clouds. The latest release includes a number<br>
> > of new point cloud tools. Perhaps it's worth a look.<br>
> ><br>
> > Most importantly, SAGA GIS has introduced a new file format,<br>
> > the SAGA Pointcloud (.spc) format. It is compact and yet<br>
> > flexible enough for a variety of purposes. I recommend to<br>
> > consider implementing import/export of this format in GRASS 7,<br>
> > preferably not via v.in.ogr, to avoid the OGR model conversion<br>
> > overhead.<br>
> ><br>
> > If you think this would be an interesting option, then you can<br>
> > find a summary on our tracker:<br>
> ><br>
> > <a href="http://gvsigce.sourceforge.net/mantis/view.php?id=595" target="_blank">http://gvsigce.sourceforge.net/mantis/view.php?id=595</a><br>
> ><br>
> > (we are going to implement ".spc" in gvSIG CE, as well).<br>
> ><br>
> > Best wishes,<br>
> ><br>
> > Ben<br>
> ><br>
> >><br>
> >> More technical:<br>
> >> the new k-d tree finds the exact nearest neighbor(s), not some<br>
> >> approximation. It supports up to 255 dimensions. It is dynamic, i.e.<br>
> >> points can be inserted and removed at any time. It is balanced to<br>
> >> improve search performance. It provides k nearest neighbor search<br>
> >> (find k neighbors to a given coordinate) as well as radius or<br>
> distance<br>
> >> search (find all neighbors within radius, i.e. not farther away than<br>
> >> radius to a given coordinate).<br>
> >><br>
> >> Markus M<br>
> >> _______________________________________________<br>
> >> grass-dev mailing list<br>
</div></div>> >> <a href="mailto:grass-dev@lists.osgeo.org" target="_blank">grass-dev@lists.osgeo.org</a> <mailto:<a href="mailto:grass-dev@lists.osgeo.org" target="_blank">grass-dev@lists.osgeo.org</a>><br>
<span>> >> <a href="http://lists.osgeo.org/mailman/listinfo/grass-dev" target="_blank">http://lists.osgeo.org/mailman/listinfo/grass-dev</a><br>
> >><br>
> ><br>
> ><br>
> ><br>
> > --<br>
> > Dr. Benjamin Ducke<br>
> > {*} Geospatial Consultant<br>
> > {*} GIS Developer<br>
> ><br>
> > Spatial technology for the masses, not the classes:<br>
> > experience free and open source GIS at <a href="http://gvsigce.org" target="_blank">http://gvsigce.org</a><br>
> > _______________________________________________<br>
> > grass-dev mailing list<br>
</span>> > <a href="mailto:grass-dev@lists.osgeo.org" target="_blank">grass-dev@lists.osgeo.org</a> <mailto:<a href="mailto:grass-dev@lists.osgeo.org" target="_blank">grass-dev@lists.osgeo.org</a>><br>
<span>> > <a href="http://lists.osgeo.org/mailman/listinfo/grass-dev" target="_blank">http://lists.osgeo.org/mailman/listinfo/grass-dev</a><br>
> _______________________________________________<br>
> grass-dev mailing list<br>
</span>> <a href="mailto:grass-dev@lists.osgeo.org" target="_blank">grass-dev@lists.osgeo.org</a> <mailto:<a href="mailto:grass-dev@lists.osgeo.org" target="_blank">grass-dev@lists.osgeo.org</a>><br>
<div><div>> <a href="http://lists.osgeo.org/mailman/listinfo/grass-dev" target="_blank">http://lists.osgeo.org/mailman/listinfo/grass-dev</a><br>
><br>
><br>
<br>
<br>
<br>
--<br>
Dr. Benjamin Ducke<br>
{*} Geospatial Consultant<br>
{*} GIS Developer<br>
<br>
Spatial technology for the masses, not the classes:<br>
experience free and open source GIS at <a href="http://gvsigce.org" target="_blank">http://gvsigce.org</a><br>
_______________________________________________<br>
grass-dev mailing list<br>
<a href="mailto:grass-dev@lists.osgeo.org" target="_blank">grass-dev@lists.osgeo.org</a><br>
<a href="http://lists.osgeo.org/mailman/listinfo/grass-dev" target="_blank">http://lists.osgeo.org/mailman/listinfo/grass-dev</a><br>
</div></div></blockquote></div><br></div></div>