issues
search
NationalLibraryOfNorway
/
meteor
A python module and REST API for automatic extraction of metadata from PDF files
Apache License 2.0
11
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
More robust URL verification using HEAD requests
#30
osma
closed
6 days ago
1
WIP: Metadata extraction using LLM API service
#29
osma
opened
3 months ago
1
fix: Edit dimo pvc claim name to new one (TT-1606)
#28
Sindrir
closed
4 months ago
0
build: Update ci-file, remove linux tag
#27
MariusLevang
closed
6 months ago
0
ci: deploy to stage from main only
#26
pierrebeauguitte
closed
7 months ago
0
fix: use correct pvc name
#25
pierrebeauguitte
closed
7 months ago
0
ci: build and deploy from runner (TT-1536)
#24
pierrebeauguitte
closed
7 months ago
0
fix: look for 'e'/'p' to choose electronic standard number (TT-944)
#23
pierrebeauguitte
closed
8 months ago
0
fix: Show publisher in frontend also when name not found in registry (TT-1060)
#22
pierrebeauguitte
closed
8 months ago
0
Support for LLM backend: implementation plan
#21
osma
opened
9 months ago
3
perf: Use flag to skip images in page blocks (TT-1336)
#20
pierrebeauguitte
closed
10 months ago
0
ci: Add tag/version to Docker push job. Move CI jobs to single file (TT-1334)
#19
pierrebeauguitte
closed
10 months ago
0
feat: Identify outdated authority posts and sort them last
#18
pierrebeauguitte
closed
11 months ago
0
chore: update dependencies (TT-1235)
#17
fredrikmonsen
closed
12 months ago
0
feat: Add support for OCR / ALTO XML files (TT-1083)
#16
pierrebeauguitte
closed
1 year ago
0
fix: catch apostrophes in author names (TT-989)
#15
fredrikmonsen
closed
1 year ago
0
feat: remove dashes from ISBN (TT-1125)
#14
fredrikmonsen
closed
1 year ago
0
TT-1082: Add env parameter to specify REST service path
#13
pierrebeauguitte
closed
1 year ago
0
TT-1037: Create dummy pdf report for tests
#12
fredrikmonsen
closed
1 year ago
0
TT-1072: Move (most) configuration to pyproject file
#11
pierrebeauguitte
closed
1 year ago
0
TT-1065: Add gielladetect as an optional dependency
#10
pierrebeauguitte
closed
1 year ago
0
TT-1042: Read language codes for initialized files from .env
#9
fredrikmonsen
closed
1 year ago
0
EXT-7: Handle LangDetectException gracefully, return correct HTTP status codes
#8
pierrebeauguitte
closed
1 year ago
0
Language detection causes traceback with documents that don't have a text layer
#7
osma
closed
1 year ago
0
TT-1045: Replace pytextcat language classifier with langdetect module
#6
pierrebeauguitte
closed
1 year ago
0
Clarify license: Apache vs. GPL
#5
osma
closed
1 year ago
1
Feature request: Language detection for Finnish and Swedish
#4
osma
closed
1 year ago
2
TT-1039: Update instructions for diff-script
#3
fredrikmonsen
closed
1 year ago
0
TT-1035: Replace DIMO terminology with simple terms
#2
pierrebeauguitte
closed
1 year ago
0
TT-1034: Add github action for lint, typecheck, and test
#1
pierrebeauguitte
closed
1 year ago
0