issues
search
teragrep
/
dpf_03
Teragrep Tokenizer for Apache Spark
GNU Affero General Public License v3.0
0
stars
4
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
use smallest filtersize for initial buffer
#47
elliVM
opened
1 day ago
1
Investigate and fix possible zero value filter returned from BloomFilterAggregator
#46
elliVM
opened
2 days ago
0
remove unused tokenizer parameter in BloomFilterAggregator
#45
elliVM
closed
2 days ago
0
update javadoc of finish method
#44
elliVM
closed
2 days ago
0
javadoc for finish method does not describe the finish() method purpose in BloomFilterAggregator
#43
kortemik
closed
6 hours ago
0
unused member field tokenizer in BloomFilterAggregator.scala
#42
kortemik
closed
6 hours ago
0
Combine tokenizer classes
#41
elliVM
opened
1 month ago
1
Add coverity badge to README.adoc via shields.io
#40
kortemik
closed
1 month ago
0
Update workflows
#39
elliVM
closed
1 month ago
0
update ci pipeline for central
#38
kortemik
closed
1 month ago
0
Spark 3
#37
elliVM
closed
1 month ago
0
update spark libraries to match the ones on pth_10
#36
kortemik
closed
1 month ago
0
fix disabled test
#35
kortemik
opened
1 month ago
1
Bump org.apache.spark:spark-core_2.12 from 2.4.5 to 3.3.3
#34
dependabot[bot]
closed
1 month ago
3
Add RegexTokenizerUDF
#33
elliVM
closed
1 month ago
1
update tests not to expect entanglement tokens
#32
elliVM
closed
1 month ago
0
avoid entanglement
#31
elliVM
closed
6 months ago
0
Optimize token generation
#30
ronja-ui
closed
6 months ago
0
Add readme from template
#29
elliVM
closed
9 months ago
0
Add readme.adoc
#28
ronja-ui
closed
9 months ago
1
Replace Array(Array(ByteType)) with Array(BinaryType)
#27
eemhu
closed
10 months ago
0
add .github/ISSUE_TEMPLATE/*.md and *.yml to RAT plugin exclusions
#26
eemhu
closed
10 months ago
0
fix remaining references to old Array(Array(ByteType)) instead of Array(BinaryType)
#25
eemhu
closed
10 months ago
0
change BloomFilterAggregator to use Array(BinaryType) instead of Array(Array(ByteType))
#24
eemhu
closed
10 months ago
0
Create config.yml
#23
ronja-ui
closed
11 months ago
0
Update issue templates
#22
ronja-ui
closed
11 months ago
0
use java map
#21
elliVM
closed
11 months ago
0
Fixes to aggregator
#20
elliVM
closed
11 months ago
0
Use estimate spark column to select size for bloom filter
#19
elliVM
closed
11 months ago
0
Bf udf
#18
kortemik
closed
1 year ago
0
change to Array[Byte] return value
#17
kortemik
closed
1 year ago
0
use bloom filters inside buffer
#16
elliVM
closed
1 year ago
2
Use bloom directly in aggregator
#15
StrongestNumber9
closed
1 year ago
1
Max minor tokens
#14
kortemik
closed
1 year ago
0
make maxMinorTokens constructor parametrized
#13
kortemik
closed
1 year ago
0
use blf_01 Token inside buffer
#12
elliVM
closed
1 year ago
1
Change hash map in TokenBuffer to use blf_01.Token instread of string
#11
elliVM
closed
1 year ago
2
Javadoc plugin
#10
elliVM
closed
1 year ago
1
add missing license headers
#9
elliVM
closed
1 year ago
1
Conversion to Scala
#8
elliVM
closed
1 year ago
1
Revert "aggregator returns set instead of list"
#7
kortemik
closed
1 year ago
2
aggregator returns set instead of list
#6
elliVM
closed
1 year ago
0
TokenAggregator does not provide Set but a List
#5
kortemik
closed
1 year ago
0
Removes dependencies settings pom
#4
StrongestNumber9
closed
1 year ago
1
Prepare for maven release - WIP
#3
StrongestNumber9
closed
1 year ago
0
Update junit to 5
#2
StrongestNumber9
opened
1 year ago
0
Initial public release
#1
StrongestNumber9
closed
1 year ago
0