migumar2 / libCSD

C++ Library implementing Compressed String Dictionaries
GNU Lesser General Public License v2.1
45 stars 9 forks source link

Reconcile with hdt-cpp implementatation. #4

Open donpellegrino opened 2 years ago

donpellegrino commented 2 years ago

The HDT-C++ Repository includes an implementation of Compressed String Dictionaries embedded within its codebase at https://github.com/rdfhdt/hdt-cpp/tree/develop/libhdt/src/libdcs. The HDT-C++ CSD.h file references:

"Compressed String Dictionaries"
Nieves R. Brisaboa, Rodrigo Canovas, Francisco Claude, Miguel A. Martinez-Prieto and Gonzalo Navarro.
10th Symposium on Experimental Algorithms (SEA'2011), p.136-147, 2011.

There is a bit of a mess of HDT-C++ library use documented at https://github.com/rdfhdt/hdt-cpp/issues/19.

The libCSD README in this repository cites the same journal article as the basis for the implementation.

The article itself references http://libcds.recoded.cl (Wayback Machine) which is now https://github.com/fclaude/libcds.

Is this Martinez-Prieto implementation distinct from the Navarro / Claude implementation? If so, how should I choose between them? Or, is this codebase an old artifact along the lifecycle of one canonical Compressed String Dictionaries implementation?

migumar2 commented 2 years ago

Hi!

I'm not sure what is the current state of HDT-C++ (I can't remember when I wrote the last line of code for it), but I think that it uses an old version of our string dictionaries (https://doi.org/10.1007/978-3-642-20662-7_12).

libCSD improves this original work, but I think that PFC, which is the by default HDT dictionary, is essentially the same. Anyway, libcds must be the same because I never migrate to libcds2 and I think that this project is dead :(.

I'm sorry I can't be of more help.

donpellegrino commented 2 years ago

Thanks.