[OSGeo-Discuss] Automatic geocoding of PDF documents

Stephen Woodbridge woodbri at swoodbridge.com
Tue Jan 17 12:24:15 EST 2012


Ahhh! found the reference I was looking for. This will show relevant links:

https://www.google.com/#q="geo+search"+egnor

Hope this is useful.

-Steve

On 1/17/2012 11:58 AM, Stephen Woodbridge wrote:
> Here are some more links that you might find useful.
>
> http://www.biomedcentral.com/1471-2105/10/385
> http://www.ijcte.org/papers/005.pdf
> http://www.e-perimetron.org/Vol_4_1/Martins_et_al.pdf
> http://nltk.googlecode.com/svn/trunk/doc/book/ch07.html
>
> I can not find a potentially excelent reference that was done by about 5
> years ago by maybe a GSoC student that I think then hired on with
> Google, but the basically he wrote a document parser that looked for
> location references in the text and then tagged the document with
> locations and lat/longs. If I remember correctly it as a gazetteer based
> system and it is open source and was online somewhere also.
>
> -Steve
>
> On 1/13/2012 6:00 PM, slesage wrote:
>> Hi,
>>
>> does anybody knows about some opensource software dedicated to automatic
>> geocoding of text documents ? The idea of that "black box" would be:
>> * give, as an input, a text document or a PDF,
>> * receive, as an output, a list of place names with their coordinates /
>> a map of POI corresponding to that places.
>>
>> Using the geonames database (http://www.geonames.org/), the solution
>> appears to be only a fulltext search, that could be done using Lucene
>> (https://lucene.apache.org/java/docs/index.html).
>>
>> I found the metacarta solution
>> (http://www.metacarta.com/products-platform-geotag.htm) but couldn't
>> find any opensource solution.
>>
>> Thanks for your suggestions.
>>
>> Sylvain Lesage.
>> _______________________________________________
>> Discuss mailing list
>> Discuss at lists.osgeo.org
>> http://lists.osgeo.org/mailman/listinfo/discuss
>
> _______________________________________________
> Discuss mailing list
> Discuss at lists.osgeo.org
> http://lists.osgeo.org/mailman/listinfo/discuss



More information about the Discuss mailing list