VowpalWabbit / vowpal_wabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
https://vowpalwabbit.org
Other
8.47k stars 1.93k forks source link

pre-loaded feature dictionary in VW #4500

Open olgavrou opened 1 year ago

olgavrou commented 1 year ago

Short description

The triggering idea here is an action dictionary/catalog for vowpal wabbit

Feature dictionaries can be loaded (id to parsed features, we only care about the features not the labels). Then incoming examples can reference the id instead of including the entire feature string and we can avoid parsing the same example strings many times.

Possible solution/implementation details

The idea is to

*VW::example_feature is a new struct that holds the VW::v_array<namespace_index> indices and std::array<features, NUM_NAMESPACES> feature_space that is the full information of an example's features and potentially other information needed for feature counting

Other things to consider

RohitRathore1 commented 1 year ago

@olgavrou Is this issue open for anyone? Or your team is going to work on it?