issues
search
RichJackson
/
cogstack
Database - Elasticsearch realtime mapping. With NLP goodiness.
Apache License 2.0
7
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Dockerize libre office
#69
hkkenneth
closed
7 years ago
0
Add options to write a dummy placeholder for documents that failed processor step
#68
hkkenneth
closed
7 years ago
0
Add options to write a dummy placeholder for documents that failed processor step
#67
hkkenneth
closed
7 years ago
1
Move PDF / thumbnail file writer to processor
#66
hkkenneth
closed
7 years ago
0
Misc fixes
#65
hkkenneth
closed
7 years ago
0
RichJackson/cogstack#54 close elasticsearch REST service with listener instead of relying on @PreDestroy
#64
hkkenneth
closed
7 years ago
0
Prevent exceptions in GATE blocking CogStack pipeline
#63
hkkenneth
closed
7 years ago
0
WIP - Comments needed - Modular/plugins approach for new item processors
#62
hkkenneth
opened
7 years ago
0
Improve logging in ElasticsearchRestDocumentWriter, ESRestService
#61
hkkenneth
closed
7 years ago
0
Improve log clarity in GATE processor
#60
hkkenneth
closed
7 years ago
0
Fix default scheduler.rate to follow cron syntax
#59
hkkenneth
closed
7 years ago
0
[BUG] webservice.fieldsToSendToWebservice does not support multiple field
#58
hkkenneth
opened
7 years ago
0
WIP PDF form parsing with PDF Box
#57
hkkenneth
closed
7 years ago
1
Code clean up - remove import for CleanupBean when not needed
#56
hkkenneth
closed
7 years ago
0
Add newline after log messages in STDOUT
#55
hkkenneth
closed
7 years ago
0
[Investigation needed] CogStack process does not stop when elasticsearchRest profile is active
#54
hkkenneth
closed
7 years ago
2
fieldsToGate supports multiple fields as map
#53
hkkenneth
closed
7 years ago
0
Fix property defaults
#52
hkkenneth
closed
7 years ago
0
Remove targetDataSource and sourceDataSource from SingleJobLauncher
#51
hkkenneth
closed
7 years ago
0
Partitioner: correct timestamp name to partition
#50
hkkenneth
closed
7 years ago
0
Error message when scheduler.useScheduling=true
#49
hkkenneth
closed
7 years ago
2
Remove targetDataSource and sourceDataSource from SingleJobLauncher
#48
hkkenneth
closed
7 years ago
0
Remove deprecated primaryKeyAndTimeStampPartition
#47
hkkenneth
opened
7 years ago
0
Add CircleCI integration environment
#46
RichJackson
opened
7 years ago
0
Rename jsonFileWriter profile to jsonFileItemWriter
#45
hkkenneth
closed
7 years ago
0
Config cleanup
#44
hkkenneth
closed
7 years ago
0
WIP - Support arbitrary parameter for SQL INSERT statement for jdbc_out
#43
hkkenneth
closed
7 years ago
2
[Suggestion] Rename jsonFileWriter profile to jsonFileItemWriter
#42
hkkenneth
closed
7 years ago
1
Issue with nanoseconds in java.sql.Timestamp
#41
RichJackson
opened
7 years ago
0
gradle-wrapper.jar should be checked in to Git
#40
hkkenneth
closed
7 years ago
1
Biolark Download site currently unavailable. Building the biolark container will fail until it's back up
#39
RichJackson
closed
7 years ago
0
Add support for PDF Form Parsing
#38
hkkenneth
closed
7 years ago
0
buildBiolarkContainer fails on step 4/10
#37
cdlangen
opened
7 years ago
1
[Discussion] Tika 1.14 update and OCR of PDF
#36
hkkenneth
opened
7 years ago
4
DOCMAN Feature
#35
RichJackson
closed
7 years ago
2
ES java API will be depreciated, as may cause cluster instability
#34
RichJackson
closed
7 years ago
1
[Improvement] Add options to write a dummy placeholder for documents that failed processor step
#33
hkkenneth
opened
7 years ago
1
[Improvement] Move PDF / thumbnail file writer to processor
#32
hkkenneth
closed
7 years ago
0
Previouse pull request + various bug fix + improve logging
#31
hkkenneth
closed
7 years ago
1
[BUG] OCR cannot handle multiple page PDF
#30
hkkenneth
closed
7 years ago
0
[BUG] fieldsToBioLark configuration does not work for upper case character
#29
hkkenneth
closed
7 years ago
0
[BUG] fieldsToGate does not support multiple field
#28
hkkenneth
closed
7 years ago
0
[BUG] Error in GATE apps will block the whole pipeline
#27
hkkenneth
closed
7 years ago
1
[Improvement] Improve logging and track time spent on GATE per document
#26
hkkenneth
closed
7 years ago
0
[BUG] gateAnnotationSets not respected in convertDocToJSON
#25
hkkenneth
closed
7 years ago
0
[BUG] fieldsToGate configuration does not work for upper case character
#24
hkkenneth
closed
7 years ago
0
Extract page count, Timing for tika/pdf/thumbnail generation, Improve error handling and logging in PDFFileItemWriter
#23
hkkenneth
closed
7 years ago
3
fix issue where configured start doesn't work if there's already a completed job in the repository
#22
RichJackson
closed
7 years ago
0
Support writing thumbnail images to static files
#21
hkkenneth
closed
7 years ago
0
Support writing PDF content to static files
#20
hkkenneth
closed
7 years ago
0
Next