issues
search
CASM-Consulting
/
springcrawler
Apache License 2.0
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Replace `tag-dist`
#61
casm-tom
opened
1 month ago
0
Bump ch.qos.logback:logback-classic from 1.2.3 to 1.2.13
#60
dependabot[bot]
opened
10 months ago
0
Bump ch.qos.logback:logback-core from 1.2.3 to 1.2.13
#59
dependabot[bot]
opened
10 months ago
0
Bump ch.qos.logback:logback-classic from 1.2.3 to 1.3.12
#58
dependabot[bot]
closed
10 months ago
1
Bump ch.qos.logback:logback-core from 1.2.3 to 1.3.12
#57
dependabot[bot]
closed
10 months ago
1
Bump spring-boot-starter-web from 2.1.5.RELEASE to 2.5.12
#56
dependabot[bot]
opened
2 years ago
0
link using -P doesn't import
#55
andehr
closed
3 years ago
1
unlink all
#54
andehr
closed
3 years ago
0
fix NL parser bug and change the behaviour of the delete command
#53
Punchwes
closed
3 years ago
0
Natural Language Parser Bug
#52
Punchwes
closed
3 years ago
0
[READY] fix-links
#51
simonwibberley
closed
3 years ago
0
Kill command for shell runner
#50
andehr
opened
3 years ago
0
deprecate old scheduler; make option job part of jobrunner interface
#49
andehr
closed
3 years ago
0
import-list shell command
#48
simonwibberley
closed
3 years ago
0
DontRecrawlResolver change & crawl-off-domain option
#47
andehr
closed
3 years ago
0
allow natural language date parser uses timezone property from source
#46
Punchwes
closed
3 years ago
2
[READY] Report Run IDs now source id + timestamp
#45
andehr
closed
3 years ago
0
[READY] Support multi-sourcelist crawling.
#44
andehr
closed
3 years ago
0
Prerequisite for multi-sourcelist crawling: article per match
#43
andehr
opened
3 years ago
0
Augment check-reports to deal with crawlers that didn't run
#42
andehr
opened
3 years ago
0
[READY] Re-scrape console command
#41
andehr
closed
3 years ago
3
Console command to run scrape without crawl on existing raw HTML
#40
andehr
opened
3 years ago
0
Design how we assign business keys to articles from crawler
#39
andehr
opened
3 years ago
0
Ensure that when JQM payloads fail, JQM counts them as crashed
#38
andehr
opened
3 years ago
0
Console command for getting random sample of articles from each source
#37
andehr
opened
3 years ago
0
check-reports formatting improvements
#36
andehr
opened
3 years ago
0
fix arg spec conflict
#35
andehr
closed
3 years ago
0
Dump command needs fix to arg spec
#34
andehr
closed
3 years ago
0
[WIP] codebase tidy/refactor
#33
Punchwes
closed
3 years ago
0
[READY] Email checklisting
#32
andehr
closed
3 years ago
0
Add option to have checklist command send email when something isn't passing.
#31
andehr
closed
3 years ago
0
checklist command should be sensitive to whether sitemaps enabled
#30
andehr
closed
3 years ago
0
add check-delay mechanism, check the robots.txt first
#29
Punchwes
closed
3 years ago
0
[READY] Reporting v2
#28
andehr
closed
3 years ago
0
cralwer / scraper behaviour monitoring
#27
simonwibberley
opened
3 years ago
1
batch link and unlink sources from CSV
#26
andehr
closed
3 years ago
0
source specific get jobs
#25
andehr
closed
3 years ago
0
add unique url filtering
#24
Punchwes
closed
3 years ago
0
Batch Update shell command needs to support List values
#23
andehr
opened
3 years ago
0
Shell management with admin commands
#22
Punchwes
closed
3 years ago
0
RSS Sitemaps
#21
simonwibberley
closed
3 years ago
1
[WIP] ACLEDScraper to support source rules
#20
Punchwes
closed
3 years ago
0
Further admin console commands
#19
simonwibberley
closed
3 years ago
1
[READY] Fixes issue where sitemaps were being missed
#18
simonwibberley
closed
4 years ago
0
[READY] Link & Unlink source - source list using CLI - Include dateParsing to table
#17
andehr
closed
4 years ago
2
[WIP] add shell management
#16
Punchwes
closed
4 years ago
0
[READY] Scheduler-related fixes
#15
andehr
closed
4 years ago
0
[WIP] Norconex handlers (transformer, tagger, filters) (same as scraper-add-one except this one has ACLEDScraper without support of user input selector)
#14
Punchwes
closed
3 years ago
2
predefined sitemaps, sitemap discovery optional, importing/exporting csv…
#13
andehr
closed
4 years ago
0
Decouple Scheduler and implement handling functions
#12
Punchwes
closed
4 years ago
0
Next