issues
search
Princeton-CDH
/
ppa-django
Princeton Prosody Archive v3.x - Python/Django web application
http://prosody.princeton.edu
Apache License 2.0
4
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Bump webpack from 5.90.0 to 5.94.0
#667
dependabot[bot]
opened
5 days ago
0
Bump micromatch and gulp
#666
dependabot[bot]
opened
1 week ago
0
write code so that all note types in eebo-xml appear at bottom of page instead of inline
#665
mnaydan
opened
2 weeks ago
0
PPA page indexing enhancement - index all records from one source (e.g. Hathi only); handle control-c
#662
mnaydan
closed
3 weeks ago
2
revise Gale image URLs handling (use image urls from Gale API)
#661
jerielizabeth
closed
2 weeks ago
7
Suppress linguistic items
#660
jerielizabeth
opened
1 month ago
0
check ppa subset of eebo-tcp xml contents (div types, inline elements, check for notes)
#659
rlskoeser
closed
2 weeks ago
12
Use Gale image url instead of image id to embed images
#658
rlskoeser
closed
1 month ago
0
Improve index pages: graceful exit on ctrl-c, index all from one source
#657
rlskoeser
closed
1 month ago
0
EEBO-TCP import
#656
rlskoeser
closed
1 month ago
0
Bump ws and socket.io
#655
dependabot[bot]
closed
1 month ago
1
Page indexing refactor
#654
rlskoeser
closed
2 months ago
0
Add developer notes for updating hathitrust, reindexing, and exporting text corpus
#653
rlskoeser
closed
3 months ago
0
modify the generate_corpus.py script to include timestamp in output file name
#652
mnaydan
closed
3 weeks ago
1
one-time script to extract subset of EEBO-TCP xml files and MARC records
#651
rlskoeser
closed
3 months ago
2
Grab images and text from HathiTrust for the entire corpus at the same time
#664
mnaydan
opened
3 months ago
0
Release v3.12.1
#650
rlskoeser
closed
4 months ago
0
Text-corpus script revisions
#649
rlskoeser
closed
4 months ago
0
Code to generate plain text page content from EEBO-TCP XML #641
#648
rlskoeser
closed
4 months ago
1
Improve search for hyphenated words wrapped around lines
#647
rlskoeser
closed
4 months ago
0
As a developer, I want a script that will download the HathiTrust page images for the PPA corpus from the HathiTrust image server.
#663
mnaydan
opened
4 months ago
0
Updates for solr9 compatibility
#646
rlskoeser
closed
4 months ago
0
work with PUL to investigate load balancer timeout possibility
#645
rlskoeser
closed
3 months ago
0
work with PUL to get ppa application logs into datadog
#644
rlskoeser
opened
4 months ago
0
Support full-text indexing for EEBO-TCP content
#643
mnaydan
closed
1 month ago
1
Write a version of the EEBO-TCP import that does metadata only
#642
mnaydan
closed
1 month ago
1
Write the mapping to read EEBO-TCP XML structure
#641
mnaydan
closed
4 months ago
5
Release v3.12
#640
rlskoeser
closed
4 months ago
0
prep v3.12 - software release checklist
#639
rlskoeser
closed
4 months ago
1
Admin excerpt source links, first page validation, hathitrust page urls
#638
rlskoeser
closed
4 months ago
0
additional changes related to excerpt stable id based on first original page
#637
rlskoeser
closed
4 months ago
0
address deprecation warnings and settings import issue
#636
rlskoeser
closed
5 months ago
0
Update excerpt ids and urls to use original pages to ensure they are stable ids
#635
rlskoeser
closed
5 months ago
0
New manage command to update excerpt digital page range
#634
rlskoeser
closed
5 months ago
0
Bump express from 4.18.1 to 4.19.2
#633
dependabot[bot]
closed
4 months ago
2
As a researcher, I want metadata about the PPA records for use with full-text corpus or other research.
#632
mnaydan
opened
5 months ago
0
As a team member, I want a way to generate PDFs of Editorial content so I can deposit them elsewhere.
#631
mnaydan
opened
5 months ago
0
Investigate image mining permissions for HathiTrust works
#630
mnaydan
opened
5 months ago
0
Bump webpack-dev-middleware from 5.3.3 to 5.3.4
#629
dependabot[bot]
closed
4 months ago
2
Refine "save as new" functionality for copying excerpts
#628
rlskoeser
closed
5 months ago
0
As an admin, I want to see a list of volumes that need to be manually reviewed so that I can resolve issues related to updates on HathiTrust’s end.
#627
mnaydan
opened
5 months ago
0
As an admin, I want to regularly rsync and reindex excerpts and articles that HathiTrust has updated so that we have the most current version of HathiTrust content.
#626
mnaydan
opened
5 months ago
0
As a developer, I want to one-time bulk fix HathiTrust excerpt page ranges from a spreadsheet so that we can pull correct page content when we reindex.
#625
mnaydan
closed
5 months ago
15
Feature/collect version labels
#624
laurejt
closed
5 months ago
0
v3.11.4
#623
rlskoeser
closed
5 months ago
1
Fix 1-based indexing when checking excerpt page ranges
#622
rlskoeser
closed
5 months ago
0
Implement and test 303 redirect for multiple cluster params
#621
rlskoeser
closed
5 months ago
0
Investigate scraping version labels from HathiTrust excerpts to determine frequency of rescanning/range changes
#620
mnaydan
closed
5 months ago
1
Redirect aggregated cluster URLs to main search page so that bots stop causing errors by crawling those fake URLs
#619
mnaydan
closed
5 months ago
2
Bump follow-redirects from 1.15.5 to 1.15.6
#618
dependabot[bot]
closed
5 months ago
2
Next