Open georgeamccarthy opened 3 years ago
Added a feature to log number of culled proteins.
Future work
- Currently using pandas to shuffle the data. One could use the jina built in .shuffle (see cookbook). However I couldn't get this working properly.
Apparently the shuffle
method is a recent addition: https://github.com/jina-ai/jina/commit/2302e456165810c9d9f8d6df1505a0aabd2edc76
It will work if you upgrade:
pip install --upgrade jina
Great find! TODO
:)
PR type
Purpose
Why?
Feedback required over
Mentions
Future work
References
Legal