issues
search
welfare-state-analytics
/
blm-corpus
Code and issues related to Bonniers litterära magasin at KB lab
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Map article ID to TOCs
#42
MansMeg
opened
5 months ago
0
Lägg till metadata i filerna till Humlab
#41
MansMeg
opened
5 months ago
2
Update output files from topic model runs
#40
MansMeg
opened
5 months ago
1
Mindre fix i novels metadata
#39
MansMeg
opened
6 months ago
0
Fixa metadata för novel-corpus
#38
MansMeg
opened
7 months ago
0
Demo for LB, novels and BLM
#37
MansMeg
opened
7 months ago
0
Store the TOC in an XML-format for each BLM edition
#36
MansMeg
opened
10 months ago
0
Add true page for editions to metadata
#35
liamtabib
closed
1 year ago
4
potential issues with BLM curation
#34
liamtabib
closed
1 year ago
2
Segment reviews
#33
liamtabib
opened
1 year ago
9
Create a mapping between article ids and the table of contents files
#32
MansMeg
closed
1 year ago
4
Create a blm-corpus-supplementary-material repo
#31
MansMeg
closed
1 year ago
7
Take a random sample of 20 pages and 3 rows per page
#30
MansMeg
closed
1 year ago
1
Unit test that all ids are unique
#29
MansMeg
closed
1 year ago
3
Add page numbers to all pages
#28
MansMeg
closed
1 year ago
3
Extract and structure registers and table of contents into XML files
#27
MansMeg
opened
1 year ago
2
Minor issues to fix after demonstration
#26
MansMeg
closed
1 year ago
2
Handle Header after Header (discuss this issue later)
#25
magan6
opened
1 year ago
1
Change the <pb>
#24
magan6
closed
1 year ago
1
Re-OCR the BLM corpus
#23
magan6
closed
1 year ago
1
Create a Python file/library
#22
magan6
opened
1 year ago
1
Add to test suit: check that alla UUid:s are unique
#21
magan6
closed
1 year ago
7
Add designer information to the metadata
#20
MansMeg
closed
1 year ago
15
Move handling of no of words and headers meta data to functions in library
#19
MansMeg
closed
1 year ago
2
Setup a quality annotation for the segmentation of articles
#18
magan6
opened
1 year ago
0
Add UUid to <Headers>, <Page_headers> and <p>
#17
magan6
closed
1 year ago
2
Lost edition BLM 2002:1
#16
magan6
closed
1 year ago
2
Metadata: add number of words and number of headers to each edition
#15
magan6
closed
1 year ago
1
Identify page headers
#14
MansMeg
closed
1 year ago
4
Lost edition 1940:7 (issue KB-lab)
#13
magan6
closed
1 year ago
1
Add test to the test suite
#12
magan6
closed
1 year ago
1
Add tests to the test suite
#11
MansMeg
closed
1 year ago
4
Add consistency check in the test suite
#10
MansMeg
closed
1 year ago
0
Add uuid for each page in the corpus
#9
MansMeg
closed
1 year ago
2
Setup a quality annotation for the headers
#8
MansMeg
closed
1 year ago
1
Identify the reviews from the segmented articles
#7
magan6
opened
1 year ago
0
Identification of authors in the corpus
#6
magan6
opened
1 year ago
1
Create a metadata file on editions
#5
MansMeg
closed
1 year ago
6
Segment articles
#4
MansMeg
closed
1 year ago
1
Adding page breaks with betalab links to the pages in the corpus
#3
MansMeg
closed
1 year ago
2
Identification of headers in the corpus
#2
MansMeg
closed
1 year ago
1
Create a test suite
#1
MansMeg
closed
1 year ago
1