[pdal] Presentation of pdal-parallelizer (new tool !)

ALBA Clément calba at lillemetropole.fr
Tue Jul 12 06:51:37 PDT 2022


Hi,

pdal-parallelizer is a command-line application that will allow you to parallelize your pdal processing with Dask, it can replace tools like rush or GNU Parallel.

It contains two functions :
- process-pipelines: it allows you to process treatments on collection of .las files
- process-copc: it allows you to process treatments on a large copc file by splitting it

For these two functions, there is a serialization system to anticipate possible problems during execution. For example, if you have processed 100 tiles of your copc and your computer shuts down during execution, you will not have to start processing the copc again from the beginning: you will pick up where the execution left off.

The documentation and code are available on this github repository: https://github.com/meldig/pdal-parallelizer

You also have example pipelines and example config files to use the tool.

The improvement of pdal-parallelizer is not finished. Here are the further improvement :

- Use the network aspect of dask to run treatments on a cluster of machines
- Use buffers in copc processing
- Integrate the process-copc function in the process-pipelines function
- Vary the reader formats

Of course, if you have any improvement recommendation or any problem with pdal-parallelizer, you can open an issue on the github repository.

Regards,
Clément Alba
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.osgeo.org/pipermail/pdal/attachments/20220712/22bd935c/attachment.htm>


More information about the pdal mailing list