issues
search
BitFunnel
/
mg4j-workbench
Java tools for evaluating BitFunnel performance compared to an mg4j baseline.
GNU Lesser General Public License v3.0
1
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Support streams that include multiple segments
#35
jondgoodwin
closed
6 years ago
0
Allow document to use stream id more than once
#34
jondgoodwin
closed
6 years ago
1
utf-8 to utf-16 conversion in ChunkWordReader.next() is incorrect.
#33
MikeHopcroft
opened
7 years ago
0
LuceneQueryProcessor may be querying titles instead of text
#32
MikeHopcroft
opened
7 years ago
0
Chunks generated from Gov2 seem to have lots of duplicate DocId values.
#31
MikeHopcroft
opened
7 years ago
0
Ensure ExperimentalQueryEngine searches both text and titles
#30
MikeHopcroft
opened
7 years ago
0
Crash in ChunkDocument.tryParseStream while attempting to read filtered version of GX229
#29
MikeHopcroft
opened
7 years ago
1
README.md: Update recipes for building from chunks, chunk manifests, exporting, etc.
#28
MikeHopcroft
opened
7 years ago
0
README.md: IntelliJ instructions.
#27
MikeHopcroft
opened
7 years ago
0
IndexExporter.exportIndex() exports text index only.
#26
MikeHopcroft
opened
7 years ago
0
Description of mgj4 index configuration
#25
MikeHopcroft
opened
7 years ago
2
Description of query log preprocessing
#24
MikeHopcroft
opened
7 years ago
0
Use a ConcatenatedDocumentCollection to combine chunk files
#23
MikeHopcroft
opened
7 years ago
0
Figure out whether we can measure the size of the mg4j posting list data structure.
#22
MikeHopcroft
opened
7 years ago
0
Ensure that experimental index is non-positional.
#21
MikeHopcroft
opened
7 years ago
0
Remove the QueryPerformance class.
#20
MikeHopcroft
closed
7 years ago
0
QueryLogRunner.go should write query performance data to a file.
#19
MikeHopcroft
closed
7 years ago
0
QueryLogRunner.main() should have a command line argument for thread count.
#18
MikeHopcroft
closed
7 years ago
1
QueryLogRunner.main() should use SimpleJSAP for command line argument parsing.
#17
MikeHopcroft
closed
7 years ago
0
Conflicting timing measurements
#16
MikeHopcroft
opened
7 years ago
0
Trec terabyte topics contain characters that are illegal in mg4j queres.
#15
MikeHopcroft
closed
7 years ago
2
Index built directly from chunk differs from index built from collection.
#14
MikeHopcroft
closed
7 years ago
3
Archive files from http://mg4j.di.unimi.it/
#13
MikeHopcroft
opened
7 years ago
0
Investigate using wired BitStreamIndexReader/BitStreamHPIndexReader
#12
MikeHopcroft
opened
7 years ago
0
Query processing pipeline should reuse result array
#11
MikeHopcroft
closed
7 years ago
1
GenerateBitFunnelChunks should not hard-code field numbers
#10
MikeHopcroft
opened
7 years ago
1
Refactor GenerateBitFunnelChunks into Main() and run() methods.
#9
MikeHopcroft
opened
7 years ago
0
Index-build needs to filter out documents not in the BitFunnel shard.
#8
MikeHopcroft
opened
7 years ago
2
Collection-building pipeline should be based on gz files.
#7
MikeHopcroft
opened
7 years ago
1
README.md: Author and test step-by-step build-and-run instructions for Windows.
#6
MikeHopcroft
opened
7 years ago
0
README.md: Author and test step-by-step build-and-run instructions for OSX.
#5
MikeHopcroft
opened
7 years ago
0
README.md: Author and test step-by-step build-and-run instructions for Linux.
#4
MikeHopcroft
opened
7 years ago
0
Look into making QueryPerformance multi-threaded.
#3
MikeHopcroft
closed
7 years ago
1
Investigate whether mg4j has a mode that counts matches.
#2
MikeHopcroft
closed
7 years ago
2
Mitigate performance impact of slf4j logging
#1
MikeHopcroft
closed
7 years ago
1