issues
search
impresso
/
impresso-text-acquisition
🛠️ Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.
https://impresso.github.io/impresso-text-acquisition/
GNU Affero General Public License v3.0
7
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Feature/add rebuilt
#134
piconti
closed
1 month ago
1
Add the code for rebuilt to impresso-text-acquisition
#133
piconti
closed
1 month ago
0
[BCUL] - Reingest years 1866 and 1867 of `TouSuIl`
#132
piconti
opened
3 months ago
0
[FedGaz] - Duplicated pages
#131
piconti
opened
3 months ago
0
[BNL - Lux importer] Investigate and fix the logical matching of physical articles and content-items
#130
piconti
opened
6 months ago
1
Bcul acquisition
#129
piconti
closed
6 months ago
0
[CI-CD] Create GH Actions for releases and uploading package to Pypi
#128
piconti
opened
6 months ago
0
Bugfix invalid ci metadata
#127
piconti
closed
6 months ago
1
RERO 1 (Olive) - Incorrect coordinates to rescale
#126
piconti
opened
8 months ago
2
Data preparation: partition rebuilt data with Run AI
#125
e-maud
closed
4 months ago
1
Create generic canonical data patching script
#124
piconti
closed
7 months ago
1
KB importer
#123
piconti
opened
11 months ago
2
v1.0.0 – Revive and bugfixes
#122
piconti
closed
11 months ago
0
Update Text-importer dependencies and documenation, and fix small associated bugs
#121
piconti
closed
11 months ago
0
ONB importer
#120
e-maud
opened
1 year ago
1
Check BCUL IIIF endpoint
#119
e-maud
closed
8 months ago
6
BCUL importer
#118
e-maud
closed
6 months ago
3
[Various Importers] Inconsistencies in iiif links and coordinates in code and JSONs
#117
piconti
closed
6 months ago
18
Re-ingest canonical for modified importers
#116
piconti
closed
6 months ago
1
Bump certifi from 2019.6.16 to 2022.12.7
#115
dependabot[bot]
closed
6 months ago
1
Bump py from 1.8.0 to 1.10.0
#114
dependabot[bot]
closed
6 months ago
1
Bump mistune from 0.8.4 to 2.0.3
#113
dependabot[bot]
closed
6 months ago
1
Pipfile cannot be locked currently
#112
aflueckiger
opened
4 years ago
0
introduce a file lock to prevent overwriting of issues
#111
aflueckiger
closed
4 years ago
0
Consistency issues in FedGazDe data
#110
mromanello
closed
4 years ago
3
prevent data inconsistencies due to the partioning of issues by dask
#109
aflueckiger
closed
4 years ago
0
partitioning by dask may lead to severe data inconsistencies
#108
aflueckiger
closed
4 years ago
1
PR to merge current master into this branch
#107
aflueckiger
closed
4 years ago
0
Bnf en acquisition
#106
ehoelzl
closed
4 years ago
0
[Rero importer] invalid issue JSON output
#105
mromanello
closed
6 months ago
0
[BNF importer] invalid issue JSON output
#104
mromanello
closed
6 months ago
1
change in BNL's ARK-based URLs
#103
mromanello
closed
6 months ago
3
missing IIIF links for some BNF-EN newspaper issue
#102
mromanello
opened
4 years ago
2
Fix bnf
#101
ehoelzl
closed
4 years ago
0
Consistency issue BNF data
#100
ehoelzl
closed
4 years ago
0
Hotfix/swa
#99
ehoelzl
closed
4 years ago
0
integrity issues in SWA data
#98
mromanello
closed
4 years ago
3
Hotfix/swa
#97
ehoelzl
closed
4 years ago
0
integrity issues in BNF data
#96
mromanello
closed
4 years ago
0
`handelstztg` integrity issues
#95
mromanello
closed
4 years ago
0
Disentangle the generic tetml importer and the fedgaz importer
#94
aflueckiger
closed
4 years ago
0
merging TETML/FedGaz importers
#93
aflueckiger
closed
4 years ago
1
incomplete ingestion when chunk-size is defined
#92
aflueckiger
closed
4 years ago
1
Tetml importer
#91
aflueckiger
closed
4 years ago
1
exception when installing library
#90
mromanello
opened
4 years ago
5
Newspaper information on genealogy seems weird
#89
simon-clematide
opened
4 years ago
0
L'essor 2006-2015 has terrible text recognition
#88
simon-clematide
opened
4 years ago
0
Bnf acquisition
#87
ehoelzl
closed
4 years ago
0
BNL-Bugfix
#86
ehoelzl
closed
4 years ago
0
Bl acquisition
#85
ehoelzl
closed
4 years ago
0
Next