Open MLFlexer opened 1 year ago
I have the same problem. @mrphilroth is this a common problem?
i have the same problem , if you have fix it please tell me how
i have the same problem , if you have fix it please tell me how
I have not been able to find a fix for this yet, although I have not spent a lot of time on this
i had the same issue, downgraded python to 3.6 in my environment, worked like charm.
A way to fix it is to replace:
entry_name_hashed = FeatureHasher(50, input_type="string").transform([raw_obj['entry']]).toarray()[0]
with:
entry_name_hashed = FeatureHasher(50, input_type="string").transform([ [raw_obj['entry']] ]).toarray()[0]
in features.py
at line 192. In this way an iterable over iterable over raw features is obtained, as transform()
method require.
Same problem. I started a fork to curate this repo. Also my PR #108 fixes the issue
downgrade to py3.6will easily solve
A way to fix it is to replace: entry_name_hashed = FeatureHasher(50, input_type="string").transform([raw_obj['entry']]).toarray()[0] with: entry_name_hashed = FeatureHasher(50, input_type="string").transform([ [raw_obj['entry']] ]).toarray()[0]
in features.py at line 192. In this way an iterable over iterable over raw features is obtained, as transform() method require.
Can anyone provide any insight on what the intended output for the entry name hash table is supposed to be?
Using it the way it's written with Python3.6 or earlier, the FeatureHasher hashes each character in the entry string. For example, if .text
is the entry point, there are 4 bins populated in the returned hash table.
Using the fixed version, the FeatureHasher hashes the entire string, so an entry point string of .text
will return a hash table with only one bin populated.
In the grand scheme of the model, I don't know if either way has much of an impact, but it would be good to know if the authors intended the hash table to be one way or the other.
I have problems running the following commands in python:
I have installed the dependencies and tried on docker with leif versions 0.9.0, 0.10.1 and i still get the same failure:
I seems from the error msg, that the input is not the same format as expected in the vectorizor? Any fix to this?