committer-solr Search Results

199 results
for committer-solr

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Norconex/crawlers #55

Text from PDF, DOC, etc files

Since it is not unusual that such types of files don't have title, author, subject, etc., I'm wondering if there is a way of capturing about (say) 100 characters or so from the beginning of the docume…

csaezl updated 9 years ago
15
Norconex/crawlers #98

Documents removed after read time-out.

Is it possible to only remove documents with 404 status code? (and also log the broken link)

OkkeKlein updated 9 years ago
14
Norconex/crawlers #56

collector.referrer-link-text field not filled

Hi, I'm trying to gater information about links: the text near che anchor. I'm using: norconex-collector-http-2.0.2.zip with openjdk-7 I have this definition: ``` text/htm…

MirtoBusico updated 9 years ago
11
Norconex/importer #4

Irregular behaviour of TextBetweenTagger

I'm using TextBetweenTagger in order to acquire HTML code from crawled pages. The configuration looks like: ``` ^.* .*$ ``` However, this has pu…

Betongsuggan updated 9 years ago
6
Norconex/crawlers #42

Java application for crawling purpose with collector-http

Hi, I need to use collector-http to get data from several sites which fulfill some regular expression and store them in a database via Java application. Is this possible with collector-http, and how …

comcrawler updated 9 years ago
4
samvera-deprecated/sufia #1358

Failed attempt to run tests

Lots of 404s? No idea what's going on but @mjgiarlo told me to create a ticket. Here is all the output: [aheadley actual-sufia 13:53:21]$ bundle exec rake spec Running RuboCop... Inspecting 441 files…

hackartisan updated 8 years ago
14
Norconex/committer-core #5

Extracting fields from metadata

At committer.commitBatch() function I try to get page's content for database writing. ``` java public class CustomCommitter extends AbstractMappedCommitter { ... @Override protected void comm…

AntonioAmore updated 10 years ago
6
Norconex/committer-core #3

Custom MySQL commiter implementation

Hello! I write my own committer implementation to put collected pages into MySQL database. As an example I've taken SolrCommiter - is it a right decision? So I inherited from AbstractMappedCommitt…

AntonioAmore updated 10 years ago
5
Norconex/committer-core #1

Committer never cleans up created folders in comitter-queue

After running continuously for quite some time on Windows, the committer will have created a lot of folders (more than 500 000 in my case). This is extremely performance degrading on Windows. Curren…

Nycander updated 10 years ago
3
Norconex/crawlers #27

Tune up collector-http for specific features

I would like to know may collector-http fit requirements for the following task (I appreciate your precious time, and read documentation first, but haven't found some nuances): A set of hundreds web …

AntonioAmore updated 10 years ago
12

上一页 1...14 15 16 17 18 19 20...20 下一页

199 results for committer-solr

199 results
for committer-solr