WBC archive served via HTTP API and as a web front-end application.
Docker Compose running the following:
macbre/wbc
can fetch and convert DJVU files to XML format that can be indexed by SphinxSE.
Run the following:
docker-compose up -d sphinx
cd app && virtualenv env -p python3.8 && source env/bin/activate && pip install -e . && ./server_debug.sh
The local instance of wbc.macbre.net should be ready at http://0.0.0.0:8080/
Needs to be prefixed with
/api/v1
(e.g./api/v1/search?q=foo
)
GET /publications
List of all publications
GET /publications/{id}
Meta data of a given publication
GET /issues/{id}
Get all documents in a given issue
GET /documents/{id}
Get a given document
GET /documents/{id}.txt
Get a given document in txt file format
GET /search?q={query}
Search within all publications
GET /suggest?q={query}
Return search suggestions
acme.sh --issue -d wbc.macbre.net --stateless --force
http://s3.macbre.net/wbc/kronika_gazeta_wielkiego_ksiestwa.xml.gz
(indexed by macbre/wbc
)make index
to index XML file in sphinxusing config file '/opt/sphinx/conf/sphinx.conf'...
indexing index 'wbc'...
collected 11980 docs, 246.9 MB
sorted 35.1 Mhits, 100.0% done
total 11980 docs, 246858497 bytes
total 318.765 sec, 774419 bytes/sec, 37.58 docs/sec
total 97 reads, 1.865 sec, 2095.4 kb/call avg, 19.2 msec/call avg
total 1650 writes, 0.733 sec, 390.8 kb/call avg, 0.4 msec/call avg