[OSGeo-Discuss] Automatic geocoding of PDF documents

slesage slesage at geo.gob.bo
Tue Jan 17 14:44:28 EST 2012


Thank you,

very useful for us, as are too the links to geocoding/geoserach tools 
from other list members. We will do some state of the art revision, and 
as soon as we write something consistent, I will send the reference 
here.

Sylvain Lesage

El 2012-01-17 13:24, Stephen Woodbridge escribió:
> Ahhh! found the reference I was looking for. This will show relevant 
> links:
>
> https://www.google.com/#q="geo+search"+egnor
>
> Hope this is useful.
>
> -Steve
>
> On 1/17/2012 11:58 AM, Stephen Woodbridge wrote:
>> Here are some more links that you might find useful.
>>
>> http://www.biomedcentral.com/1471-2105/10/385
>> http://www.ijcte.org/papers/005.pdf
>> http://www.e-perimetron.org/Vol_4_1/Martins_et_al.pdf
>> http://nltk.googlecode.com/svn/trunk/doc/book/ch07.html
>>
>> I can not find a potentially excelent reference that was done by 
>> about 5
>> years ago by maybe a GSoC student that I think then hired on with
>> Google, but the basically he wrote a document parser that looked for
>> location references in the text and then tagged the document with
>> locations and lat/longs. If I remember correctly it as a gazetteer 
>> based
>> system and it is open source and was online somewhere also.
>>
>> -Steve
>>
>> On 1/13/2012 6:00 PM, slesage wrote:
>>> Hi,
>>>
>>> does anybody knows about some opensource software dedicated to 
>>> automatic
>>> geocoding of text documents ? The idea of that "black box" would 
>>> be:
>>> * give, as an input, a text document or a PDF,
>>> * receive, as an output, a list of place names with their 
>>> coordinates /
>>> a map of POI corresponding to that places.
>>>
>>> Using the geonames database (http://www.geonames.org/), the 
>>> solution
>>> appears to be only a fulltext search, that could be done using 
>>> Lucene
>>> (https://lucene.apache.org/java/docs/index.html).
>>>
>>> I found the metacarta solution
>>> (http://www.metacarta.com/products-platform-geotag.htm) but 
>>> couldn't
>>> find any opensource solution.
>>>
>>> Thanks for your suggestions.
>>>
>>> Sylvain Lesage.
>>> _______________________________________________
>>> Discuss mailing list
>>> Discuss at lists.osgeo.org
>>> http://lists.osgeo.org/mailman/listinfo/discuss
>>
>> _______________________________________________
>> Discuss mailing list
>> Discuss at lists.osgeo.org
>> http://lists.osgeo.org/mailman/listinfo/discuss
>
> _______________________________________________
> Discuss mailing list
> Discuss at lists.osgeo.org
> http://lists.osgeo.org/mailman/listinfo/discuss



More information about the Discuss mailing list