the parallelization in preprocessing_utils is extremely slow for the sequence representation, because this function gets called once for every sequence. For this case, the parallelization code is wrong, because we should be parallelizing the processing of each sequence, not parallelizaing for each context in a single sequence.
the parallelization in preprocessing_utils is extremely slow for the sequence representation, because this function gets called once for every sequence. For this case, the parallelization code is wrong, because we should be parallelizing the processing of each sequence, not parallelizaing for each context in a single sequence.