issues
search
nasa-jpl-memex
/
sce
Sparkler Crawl Environment - a packaged, dockerized version of http://github.com/USCDataScience/sparkler.git
http://irds.usc.edu/sparkler/
Apache License 2.0
4
stars
3
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Solr keeps shutting down
#49
rduerr
opened
5 years ago
0
create k8s compatible deployment
#48
buggtb
opened
5 years ago
0
remove flask and switch to gunicorn or similar
#47
buggtb
opened
5 years ago
0
tidy up url paths
#46
buggtb
opened
5 years ago
0
separate the build components out of the docker file
#45
buggtb
opened
5 years ago
0
create travis build for sce
#44
buggtb
opened
5 years ago
0
Update wiki to include instructions on GUI capabilities currently not mentioned
#43
rduerr
opened
5 years ago
0
And now ./dumper.sh is not working either
#42
rduerr
opened
5 years ago
0
The crawl dashboard is not coming up
#41
rduerr
closed
5 years ago
3
Anyone seen this error before?
#40
rduerr
opened
5 years ago
3
Documentation on how to construct a keyword list needs to be added to wiki or somewhere accessible
#39
rduerr
opened
5 years ago
1
Need visual indication that a choice has been made for a given URL.
#38
rduerr
closed
5 years ago
0
allow search operators in search box
#37
sjskhalsa
opened
5 years ago
0
Retreived 12 webpages in UI are already colored before voting for relevancy
#36
ahmadika
opened
6 years ago
3
Banana dashboard needs tweaking
#35
wmburke
opened
6 years ago
0
fix line wrap when triple digits are reached under Generate a Model
#34
wmburke
opened
6 years ago
0
Fix link to crawl dashboard for local install
#33
wmburke
opened
6 years ago
2
Add documentation into the wiki
#32
wmburke
closed
6 years ago
0
pagination on search results
#31
wmburke
opened
7 years ago
0
Develop release plan that includes solid testing
#30
wmburke
opened
7 years ago
0
Design interface to include progress reporting
#29
wmburke
opened
7 years ago
1
Establish clear work flow in the interface
#28
wmburke
opened
7 years ago
1
Send alert when the relevancy of a deep crawl changes
#27
wmburke
opened
7 years ago
0
Send alert when a site goes down
#26
wmburke
opened
7 years ago
0
Develop Crawl Alert capability
#25
wmburke
opened
7 years ago
0
Develop Crawl Alert interface
#24
wmburke
closed
7 years ago
1
Develop ability to crawl behind logins
#23
wmburke
opened
7 years ago
0
Enable user to submit info for logging in to a given site
#22
wmburke
opened
7 years ago
0
Figure out a plan
#21
wmburke
opened
7 years ago
3
Recommendation Engine
#20
wmburke
opened
7 years ago
3
Add/remove urls from the deep crawl list
#19
wmburke
opened
7 years ago
1
Adaptive fetch schedule
#18
wmburke
opened
7 years ago
0
Add -uninstall option in kickstart.sh
#17
sujen1412
opened
7 years ago
0
Add --upgrade option to kickstart.sh
#16
sujen1412
opened
7 years ago
0
Add uninstall instructions to the User Guide
#15
wmburke
closed
7 years ago
1
Kickstart.sh should report the urls more accurately
#14
wmburke
opened
7 years ago
0
Enable deep crawling using Sparker
#13
sujen1412
closed
7 years ago
1
Configure the banana dashboard to show the data from last 10 days
#12
sujen1412
closed
7 years ago
1
Hook up monitoring on Solr and Sparkler
#11
sujen1412
opened
7 years ago
0
User Guide: Output commands section needs to be cleaned up
#10
wmburke
opened
7 years ago
0
User Guide needs a section on kickstart.sh options
#9
wmburke
opened
7 years ago
0
Output from SOLR in Elasticsearch bulk upload format
#8
sujen1412
opened
7 years ago
0
Upgrade CDR script from v3 to 3.1
#7
sujen1412
closed
7 years ago
1
Use bash tee command to redirect output to log file and console
#6
sujen1412
closed
7 years ago
1
Allow for infinite crawling
#5
sujen1412
closed
7 years ago
3
kickstart.sh -l default
#4
wmburke
closed
7 years ago
1
dumper.sh -l default
#3
wmburke
closed
7 years ago
1
sce.sh functionality
#2
wmburke
closed
7 years ago
2
Check for ports already in use
#1
sujen1412
closed
7 years ago
1