issues
search
BayanGroup
/
nutch-custom-search
65
stars
34
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Need help with extracting nested HTML content.
#45
Amaresh
opened
5 years ago
0
README config instructions for Mode 1, new binaries?
#44
ferrerod
opened
6 years ago
0
how to split a list of strings separated by a specific character?? is there any thing like a split function or some work around for it?
#43
m2ai
opened
6 years ago
0
Limit the extraction of outlinks
#42
wiradikusuma
closed
6 years ago
1
Nutch 2.x support
#41
tomchiverton
opened
7 years ago
12
Fix PersianDateConverter
#40
mahshad
closed
7 years ago
0
Fix PersianDateConverter
#39
mahshad
closed
7 years ago
0
Update README.md
#38
mahshad
closed
7 years ago
0
Missing LICENSE
#37
nicobrevin
closed
8 years ago
1
Not indexing data in solr
#36
aakashkag
closed
8 years ago
0
Using fragment on xml documents
#35
rohith004
closed
8 years ago
0
Using fragment on xml documents
#34
rohith004
opened
8 years ago
0
Ignore load-external-dtd declaration in xml
#33
nithingit
closed
7 years ago
1
Css :not pseudo-class doesn't work
#32
AndraIonescu
closed
7 years ago
1
Plugin doesn't work on Linux
#31
rodrigomagnoss
closed
8 years ago
0
Plugin doesn't work
#30
AndraIonescu
closed
7 years ago
3
Concatenate 2 fields into one
#29
AndraDenis
closed
7 years ago
4
Added missing TestConverter class to fix build
#28
matt-deboer
closed
9 years ago
1
How do I get nutch to crawl outlinks only and not the urls for each fragment?
#27
manjunathbharadwaj
closed
9 years ago
2
cannot testUrl
#26
virivigio
closed
9 years ago
4
Unlike in all examples, I am asked to explicitly declare a "url" field in my extractors.xml where I want to use fragments
#25
manjunathbharadwaj
closed
9 years ago
2
issue when xpath expression contains element index with more than one digit
#24
moees
opened
9 years ago
0
Images
#23
galvinm
closed
9 years ago
3
What subset of Jsoup css selector is supported?
#22
ChanderG
closed
9 years ago
5
Add indexAs property to Field and use it in nutch.ExtractorIndexingFilter
#21
richard-lund
closed
9 years ago
0
Add multiple-match mode to support easier tagging of content based on url
#20
richard-lund
closed
9 years ago
0
Parsing Javascript script tags
#19
raisindetre
closed
9 years ago
3
Bug in document inheritance?
#18
raisindetre
closed
9 years ago
3
Using for-each in an HTML Page.
#17
jaychakra
closed
9 years ago
3
Extract from PDF
#16
rugbymauri
closed
9 years ago
1
NPE trying to index
#15
dmnt3rr0r
closed
9 years ago
2
Return img alt and text
#14
phranq
closed
9 years ago
8
corrupt distribution zip
#13
rugbymauri
closed
9 years ago
3
Conditional indexing or following
#12
tahagh
opened
9 years ago
4
error when using fragment option
#11
moees
closed
9 years ago
6
sitemap protocol
#10
paulescom
closed
9 years ago
1
extraction with xpath engin
#9
moees
closed
9 years ago
8
How to use this
#8
jayasreemca
closed
9 years ago
1
Compile plugin as .job file
#7
arkka
opened
10 years ago
1
Fix plugin version
#6
mathieubouchard
closed
10 years ago
1
Implement conditional resource matching
#5
tahagh
opened
10 years ago
0
Do not join multiple values for multi-valued fields
#4
floschmedding
closed
10 years ago
2
Error: Unsupported major.minor version 51.0
#3
pepeabel
closed
10 years ago
5
TruncateTest.testBreak() seems to be wrong
#2
floschmedding
closed
10 years ago
3
Unable to index documents with Tika
#1
wwhurley
closed
10 years ago
3