issues
search
andrew-thox
/
pb-journalist
Responsible for scraping sites
Eclipse Public License 1.0
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Command to manually scrape site
#23
andrew-thox
opened
7 years ago
0
RSS Media/Thumbnails
#22
andrew-thox
opened
7 years ago
0
Queue may not exist
#21
andrew-thox
opened
7 years ago
1
Not processing RSS feed
#20
andrew-thox
opened
8 years ago
1
Avro articles are using a dynamic schema.
#19
andrew-thox
opened
8 years ago
0
Stop auto acknowlegement of queue messages
#18
andrew-thox
opened
8 years ago
0
Autodetect date format.
#17
andrew-thox
opened
8 years ago
1
Manual article additions
#16
andrew-thox
opened
8 years ago
0
Dublin Core Tags
#15
andrew-thox
opened
8 years ago
1
Other fields to potentially store
#14
andrew-thox
opened
8 years ago
0
Create some sort of debug mode.
#13
andrew-thox
opened
8 years ago
0
Record acquisition method
#12
andrew-thox
opened
8 years ago
1
Always returns the same list of articles.
#11
andrew-thox
closed
8 years ago
1
Add support for more outlets.
#10
andrew-thox
opened
8 years ago
0
Use avro
#9
andrew-thox
closed
8 years ago
1
When should migrations be run?
#8
andrew-thox
opened
8 years ago
0
Articles without time published.
#7
andrew-thox
opened
8 years ago
0
Record time article was acquired.
#6
andrew-thox
closed
8 years ago
1
Don't just run uberjar.
#5
andrew-thox
closed
8 years ago
0
Configure logging
#4
andrew-thox
opened
8 years ago
0
Almost certainly new-statesman archive adds the same articles multiple times to the queue.
#3
andrew-thox
closed
8 years ago
1
Determine what to do in the event that the queue is down?
#2
andrew-thox
opened
8 years ago
2
Get text of article
#1
andrew-thox
opened
8 years ago
1