-
Ticket to strategize getting a sample of full text articles into the toponym resolution pipeline.
-
We should consider implementing the different linking algorithms in subclasses of `Linker`.
This would avoid redundant logic based on String parameters like:
```python
def run(self, dict_mention:…
-
The • character appears relatively frequently in our newspaper data, and the toponym resolution pipeline doesn't no how to handle it. This causes the API to return an error.
E.g.
Input:
```
{'se…
-
Add requests here for research requirements
-
If you call the `run_sentence` function on an empty string (`""`) or whitespace (`" "`) you receive the following error:
```
>>> geoparser.run_sentence(" ")
UnboundLocalError …
-
Antrag: "Die automatische Extraktion unterstützt hochgeladene Geodaten, Links zu ausgewählten Diensten, und die Herleitung von Geodaten über Titel und Abstract des Artikels mit einem Gazetteer."
--…
nuest updated
11 months ago
-
Goal: document processing times for different settings in the pipeline (esp. DeezyMatch vs. perfectmatch)
Use sample set of articles from _The Sun_ for this test.
TASKS
- [x] confirm different…
-
The `resources/` subdirectory is .gitignored due to the large size of the data files on which T-Res depends, but there is currently no shared storage location containing a canonical set of resources.
…
-
I understand that the address labels are based on OpenCageData's adress-format, but I feel that the people behind lack a complete understanding of gridion city planning (https://en.wikipedia.org/wiki/…
-
It would be nice to have IJ-digraphs for Dutch and other texts which use it.
These are some quick and ugly versions I made, I can supply them in vector format if needed as a base (.eps?)
![ij](htt…