issues
search
USCDataScience
/
sparkler
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
http://irds.usc.edu/sparkler/
Apache License 2.0
412
stars
143
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Broken run script
#219
buggtb
closed
3 years ago
4
Elasticsearch for Sparkler - Factory Design Pattern
#218
slhsxcmy
closed
3 years ago
1
WIP: ISSUE-215 Elasticsearch for Sparkler - Maven Profiles
#217
KilometersFan
closed
2 years ago
3
Config File and StorageProxy Abstractions
#216
Kefaun2601
closed
3 years ago
7
Elasticsearch for Sparkler - Maven Profiles
#215
KilometersFan
opened
3 years ago
0
Elasticsearch for Sparkler - Containerization Logic
#214
nhandyal
closed
3 years ago
7
[docker] docker dev environment
#213
nhandyal
closed
3 years ago
1
Elasticsearch for Sparkler - Containerization Logic
#212
felixloesing
closed
3 years ago
4
Elasticsearch for Sparkler - Command Line Configuration
#211
Kefaun2601
closed
3 years ago
2
Bump netty-codec-http from 4.0.15.Final to 4.1.59.Final in /sparkler-core/sparkler-plugins/fetcher-chrome
#210
dependabot[bot]
closed
3 years ago
0
Sparkler Elasticsearch storage engine
#209
lewismc
closed
3 years ago
10
Bump axios from 0.18.0 to 0.21.1 in /sparkler-ui
#208
dependabot[bot]
closed
3 years ago
0
URL Injector/Config override and bug fixes
#207
buggtb
closed
3 years ago
0
Bump junit from 4.12 to 4.13.1 in /sparkler-core
#206
dependabot[bot]
closed
3 years ago
0
Changes so sparkler can be launched inside of a Databricks cluster
#205
mattvryan-github
closed
3 years ago
2
Sparkler cannot be executed on Databricks because sparkContext not pulled from sparkSession
#204
mattvryan-github
closed
3 years ago
0
Investigate pipeline frameworks
#203
buggtb
opened
3 years ago
1
Update CI so users can download built Sparkler package
#202
buggtb
opened
3 years ago
0
Update CI to deal with releases/tagging of images/artifacts
#201
buggtb
closed
3 years ago
1
Arm support
#200
buggtb
opened
3 years ago
0
Fix preview performance issues
#199
buggtb
opened
3 years ago
0
Improve deployments for different architectures
#198
buggtb
opened
3 years ago
0
Improve plugins
#197
buggtb
opened
3 years ago
0
Make storage engine pluggable
#196
buggtb
closed
3 years ago
2
Enable pagination in SCE
#195
buggtb
opened
3 years ago
0
Fix basic SCE deployment
#194
buggtb
opened
3 years ago
0
Bump jackson-databind from 2.6.5 to 2.9.10.5 in /sparkler-core/sparkler-app
#193
dependabot[bot]
closed
3 years ago
1
Bump jackson-databind from 2.6.5 to 2.9.10.5 in /core/sparkler-app
#192
dependabot[bot]
closed
3 years ago
4
update spark 3.0.1 scala 2.12
#191
thammegowda
closed
3 years ago
1
Update to Spark 3.x , scala 2.12.x
#190
thammegowda
closed
3 years ago
1
May I know How it works if we submit job in spark cluster
#189
ravituduru
opened
3 years ago
0
Dashboard for banana
#188
ravituduru
opened
3 years ago
1
silly question
#187
vwoloszyn
closed
3 years ago
2
Is there any benchmark Sparkler versus Nutch?
#186
MobinRanjbar
closed
3 years ago
2
Argument '-i -1' does not work.
#185
MobinRanjbar
opened
4 years ago
4
Move Sparkler to sbt build
#184
karanjeets
opened
4 years ago
3
Bump htmlunit from 2.26 to 2.37.0 in /sparkler-plugins/fetcher-htmlunit
#183
dependabot[bot]
closed
3 years ago
1
Bump jackson-databind from 2.6.5 to 2.9.10.4 in /sparkler-app
#182
dependabot[bot]
closed
3 years ago
5
Add fetcher-default as a plugin
#181
balashashanka
opened
4 years ago
4
Modified Readme for crawling seed-urls.txt
#180
amirhosf
closed
4 years ago
1
Bump jackson-databind from 2.6.5 to 2.9.10.3 in /sparkler-app
#179
dependabot[bot]
closed
4 years ago
2
Updates kubernetes deployment config and containers for v1.14 and v1.15
#178
RyanStonebraker
closed
4 years ago
1
Newbie question
#177
chrome83
closed
4 years ago
4
Not an issue
#176
chaitra-rs
closed
4 years ago
1
Always use the latest docker image
#175
prowave
closed
4 years ago
1
data from JS pages is not returned
#174
chaitra-rs
closed
4 years ago
1
push chrome fetcher code
#173
buggtb
closed
5 years ago
1
Fix mvn build
#172
buggtb
closed
5 years ago
0
Quiet build
#171
thammegowda
closed
5 years ago
0
Sparkler on K8S updates
#170
buggtb
closed
5 years ago
0
Previous
Next