[Live-demo] Review of GeoKettle Overview doc

Cameron Shorter cameron.shorter at gmail.com
Sun Aug 1 05:49:48 EDT 2010


Thierry,
Excellent docs, thank you. A user will get a very good idea about what 
GeoKettle can do from this explanation.

I've made a few minor syntax changes which don't change meaning at all.


  GeoKettle¶ <cid:part1.09080200.09040308 at gmail.com>


    Business Intelligence¶ <cid:part2.07060001.01060603 at gmail.com>

GeoKettle is a "spatially-enabled" version of Pentaho Data Integration 
(Kettle) <http://www.pentaho.com/products/data_integration/>. It is a 
powerful, metadata-driven spatial ETL (Extract, Transform and Load) tool 
dedicated to the integration of different data sources for building and 
updating geospatial databases and data warehouses.

GeoKettle enables the Extraction of data from data sources, the 
Transformation of data in order to correct errors, make some data 
cleansing, change the data structure, make them compliant to defined 
standards, and the Loading of transformed data into a target DataBase 
Management System (DBMS), GIS file, or geospatial web service.

GeoKettle is particularly useful when a user wants to automate complex 
and repetitive data processing without producing any specific code, to 
make conversions between various data formats, to migrate data from one 
DBMS to another, to perform some data feeding tasks into various DBMS, 
to populate analytical data warehouses for decision support purposes, etc.

In the geospatial domain, Geokettle compares to FME, a proprietary 
spatial ETL tool edited by Safe Software. GeoKettle is stable, fast, 
standards compliant, with hundreds of functions and read/write support 
for many file formats, services and DBMS. GeoKettle is used by diverse 
organisations from around the world, including governmental agencies, 
banks, insurance and geospatial system integrators.


      Core Features¶ <cid:part3.03080300.00010407 at gmail.com>

    * Extract data from:
          o 35+ database types: MySQL, PostgreSQL, Oracle, ...
          o XML files
          o XLS files
          o Xbase files (dBase, Foxpro, etc)
          o File systems information
          o Generated data
          o MS Access files
          o LDAP
          o Geospatial data formats: Shapefile, ...
    * Transformation of data:
          o Engine based data transfer (no code generator)
          o Looking up data in databases, files or memory
          o Calculating
          o Scripting: Javascript, SQL, RegExp
          o Splitting
          o Mapping
          o Selecting
          o Partitioning
          o Filtering
          o Merging
          o Joining
          o Duplicating
          o Clustering (MPP)
          o Pivotting
          o Geospatial data analysis and processing
    * Load data into a target format:
          o Database loads
          o Data warehouse population
          o Partitioned loading
          o Bulk loading
          o Parallel loading
          o Clustering
    * Environment:
          o Full GUI named "Spoon" to edit every transformation options
          o Command line tools: execute jobs and transformations
          o Web server: remote execution and clustering perfect in cloud
            computing environment for very large datasets processing
          o Programming API for Java
          o Plugin eco-system


      Implemented Standards¶ <cid:part4.02070206.09050105 at gmail.com>

    * OGC standards compliant (SFS)


      Details¶ <cid:part5.00040200.09030307 at gmail.com>

*Website:* http://www.geokettle.org/

*Licence:* GNU Lesser General Public License (LGPL) version 2.1

*Software Version:* 3.2.0-20090609

*Supported Platforms:* Windows, Linux, Mac, Solaris

*API Interfaces:* Java, Javascript

*Support:* http://www.spatialytics.org & http://www.spatialytics.com

-- 
Cameron Shorter
Geospatial Director
Tel: +61 (0)2 8570 5050
Mob: +61 (0)419 142 254

Think Globally, Fix Locally
Geospatial Solutions enhanced with Open Standards and Open Source
http://www.lisasoft.com

-------------- next part --------------
Skipped content of type multipart/related


More information about the Live-demo mailing list