Perdido: Python library for geoparsing and geocoding French texts - Université de Pau et des Pays de l'Adour Access content directly
Conference Papers Year :

Perdido: Python library for geoparsing and geocoding French texts

Abstract

This paper introduces the Perdido Python library for geoparsing and geocoding French texts. The architecture of the Perdido Geoparser, which includes three layers: back-office, API, and Python library, is outlined. We also provide details on the methods used in the development of the processing chain and the various tasks covered, such as named entity recognition and classification (NERC), and toponym resolution. Lastly, we showcase the different features of the Python library and explain how to use it. The library is built as an overlay using API services, enabling users to manipulate, visualize, and export the results of geoparsing and geocoding. A Jupyter notebook is also provided to demonstrate all the functionalities implemented in the library.
Fichier principal
Vignette du fichier
GeoExT___ECIR_2023.pdf (460.04 Ko) Télécharger le fichier
Origin : Publisher files allowed on an open archive
Licence : CC BY - Attribution

Dates and versions

hal-04049794 , version 1 (28-03-2023)

Licence

Attribution

Identifiers

  • HAL Id : hal-04049794 , version 1

Cite

Ludovic Moncla, Mauro Gaio. Perdido: Python library for geoparsing and geocoding French texts. First International Workshop on Geographic Information Extraction from Texts (GeoExT), Apr 2023, Dublin, Ireland. ⟨hal-04049794⟩
12 View
17 Download

Share

Gmail Facebook Twitter LinkedIn More