issues
search
codelibs
/
elasticsearch-river-web
Web Crawler for Elasticsearch
Apache License 2.0
234
stars
57
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Improve logger.debug()
#138
deka0106
opened
5 years ago
0
Windows Installer
#137
conradbm
opened
5 years ago
1
How to save an image from page
#135
moorthi07
opened
6 years ago
0
unexpected behavior of robots_txt option
#134
viktor-svirsky
opened
7 years ago
1
riverweb: command not found
#133
aratrika1
opened
7 years ago
0
lastModified header
#132
viktor-svirsky
opened
7 years ago
0
RiverWeb-2.4.0-snapshot connectivity issue
#131
mykola-shulba
opened
7 years ago
1
include_urls doesn't work
#130
viktor-svirsky
opened
7 years ago
3
Nothing happens when I run Riverweb
#129
bducharme
closed
7 years ago
0
Crawler is connecting then disconnecting??
#128
osmanra2
opened
7 years ago
1
Index objects on page instead of entire page
#127
dutchiexl
opened
7 years ago
0
IsArray does not seem to work
#126
dutchiexl
opened
7 years ago
0
How to enable retry on crawler?
#125
lmatt-bit
opened
7 years ago
0
Can use river web with ES 5.0.0 ?
#124
iDongkil
opened
8 years ago
4
Website indexation on AWS Elasticsearch service
#123
femat
opened
8 years ago
2
Website Indexation
#122
hkhail
opened
8 years ago
1
River-web and Cluster Elasticsearch
#121
hkhail
opened
8 years ago
1
Error when i run riverweb
#120
youcefboukersi
opened
8 years ago
3
Disconnected - Connection manager is shutting down
#119
hkhail
closed
8 years ago
5
Problem with news.yahoo.com
#118
rdrgporto
closed
8 years ago
1
None of the configured nodes are available
#117
devmiyax
opened
8 years ago
1
Help with URL patterns
#116
beefwad13
closed
8 years ago
2
Error message when running river-web
#115
marcshep-scribe
opened
8 years ago
4
Failure starting riverweb under Windows Server 2012
#114
ndrwchn
opened
8 years ago
1
Riverweb stops before crawling
#113
SarahBaeriswyl
opened
8 years ago
1
update ES index when the website has been changing
#112
hanasian
opened
8 years ago
1
ES Version And elasticsearch-river-web Version
#111
hanasian
opened
8 years ago
2
Property 'url' changed to 'urls' in version 2.0
#110
neilneyman
opened
8 years ago
1
None of the configured nodes are available
#109
neilneyman
opened
8 years ago
3
The max file size (1804200/1000000 is exceeded
#108
beefwad13
closed
8 years ago
1
Getting this stack trace when running RiverWebTest.java
#107
tulikaMithal
opened
8 years ago
0
EsDataService.java and EsUrlQueueService.java showing error when i import the project into eclipse
#106
tulikaMithal
opened
8 years ago
0
Error: Could not find or load main class org.codelibs.elasticsearch.web.RiverWeb (PC)
#105
osmanra2
opened
9 years ago
5
Could not find or load main class org.codelibs.elasticsearch.web.RiverWeb
#104
aroonseenamurthy
closed
9 years ago
1
Crawl page immediately when page is updated
#103
audunru
opened
9 years ago
0
Ignoring already stored URL's
#102
Choumy
opened
9 years ago
1
File System Crawling
#101
ln-lv
opened
9 years ago
7
Schema.org facetting
#100
marvink
opened
9 years ago
0
URL with Parameters
#99
marvink
closed
9 years ago
1
Retrieve elasticsearch info from properties file
#98
marevol
closed
9 years ago
0
JDK-7 ? or must use 8?
#97
jbardu
opened
9 years ago
1
ExcludeFilters are sometimes ignored
#96
LeNightHawk
opened
9 years ago
4
Improve a log message for skipping scraping
#95
marevol
closed
9 years ago
0
Riverweb stops after indexing < 200 pages
#94
neilneyman
closed
9 years ago
6
Need to poll data from crawlingUrlQueue if over maxCrawlingQueueSize
#93
marevol
closed
9 years ago
0
"target.pattern" does not work
#92
marevol
closed
9 years ago
0
Use version info from pom.xml
#91
marevol
closed
9 years ago
0
Use javascript as a default lang
#90
marevol
closed
9 years ago
0
Refactoring for overwrite/incremental options
#89
marevol
closed
9 years ago
0
Create mappings for S2Robot if they does not exist
#88
marevol
closed
9 years ago
0
Next