issues
search
qurator-spk
/
mods4pandas
Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis
Apache License 2.0
11
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Include top-level structMap type in mods_info?
#46
mikegerber
opened
2 months ago
0
Can't export page info due to OOM
#45
mikegerber
opened
3 months ago
2
Review `--help`
#44
mikegerber
opened
3 months ago
0
ERROR:mods4pandas:Exception in digisam_mets/PPN188698431X.xml: No structMap[@TYPE='PHYSICAL'] found (but not a multivolume work)
#43
mikegerber
opened
3 months ago
0
Add time to log output
#42
mikegerber
opened
3 months ago
0
Switch to GitHub Actions
#41
mikegerber
opened
3 months ago
0
Factor out CSV + Excel support
#40
mikegerber
closed
3 months ago
0
Don't use absolute path for "mets_file" column
#39
mikegerber
opened
3 months ago
0
ERROR:mods4pandas:Exception in digisam_mets/PPN746979266.xml: Specification mandates value for attribute x, line 3, column 786189 (PPN746979266.xml, line 3)
#38
mikegerber
closed
3 months ago
1
Set proper dtypes (compatible with Parquet)
#37
mikegerber
opened
3 months ago
0
Move on to Parquet format
#36
mikegerber
closed
3 months ago
1
PPN863433170: No smlinks to parent elements in structMap?
#35
mikegerber
opened
3 months ago
0
dtype for indicator variable
#34
mikegerber
closed
3 months ago
1
ValueError: numpy.dtype size changed, may indicate binary incompatibility
#33
mikegerber
closed
3 months ago
5
Migrate to pyproject.toml
#32
mikegerber
closed
3 months ago
2
Merge feat/page_info
#31
mikegerber
closed
3 months ago
1
Group names given in the MODS-file according to given roles to reduce number of columns
#30
joergleh
opened
11 months ago
1
Test on Python 3.12
#29
mikegerber
closed
3 months ago
2
Don't use the Python namespace qurator
#28
mikegerber
closed
3 months ago
1
Documentation of the fields exported
#27
mikegerber
opened
1 year ago
0
Structure information
#26
mikegerber
opened
1 year ago
16
Use Case: Aggregate number of pages/canvases across multiple METS derived from search query
#25
ch-sander
opened
1 year ago
8
Missing subject/topic, genre
#24
mikegerber
opened
1 year ago
3
Missing information from the original METS/MOTS
#23
mikegerber
opened
1 year ago
12
One or more element has unexpected attributes: mods:recordIdentifier source="dnb-ppn"
#22
mikegerber
closed
1 year ago
3
Add missing information for "original" PPNs
#21
BibWiss
closed
1 year ago
7
Group name columns by role
#20
mikegerber
opened
1 year ago
0
README should show some results
#19
mikegerber
opened
1 year ago
0
alto4pandas: LANG + language
#18
mikegerber
opened
2 years ago
0
Use test data in qurator/modstool/tests/data
#17
mikegerber
opened
2 years ago
0
Review XXXs and TODOs
#16
mikegerber
opened
2 years ago
0
Better name for altotool
#15
mikegerber
opened
2 years ago
1
Smarter handling of namespaces
#14
mikegerber
opened
2 years ago
0
Review imports
#13
mikegerber
opened
2 years ago
0
Integration of ALTO metadata
#12
mikegerber
closed
2 years ago
19
Fix tests on Python 3.10
#11
mikegerber
closed
2 years ago
0
Update docs
#10
mikegerber
opened
2 years ago
0
Optionally export to Excel/CSV
#9
mikegerber
closed
2 years ago
0
Improve documentation of TagGroup and mods_to_dict
#8
mikegerber
opened
2 years ago
0
More than one instance: <mods:shelfLocator>
#7
mikegerber
closed
2 years ago
2
ValueError: Unknown tag "{http://www.loc.gov/mods/v3}partName"
#6
mikegerber
closed
2 years ago
0
Document columns
#5
mikegerber
opened
2 years ago
1
Handle multiple mods:role/mods:roleTerm
#4
mikegerber
closed
2 years ago
1
Multiple language tags vs multiple languageTerm tags
#3
mikegerber
closed
2 years ago
1
Include METS metadata
#2
mikegerber
opened
2 years ago
2
MODS "name" changes
#1
mikegerber
opened
2 years ago
3