Plotting of dendrograms for hierarchical clustering and matching colors between the dendrogram clusters and the scatter plot ones.
Ability to select x and y ranges of points to be plotted.
A small fix regarding the scaling of the original data. It is now stored in a separate variable, else the features selection wouldn't work correctly since it needs the original unscaled data.
A small fix regarding the file deleted in 9504886af0bfd6e3e7945ec138a375a63ed42cb0 that was still being imported and would yield this issue :
┌──[user@packing-box]──[/mnt/share]──[experiments/exp-upx]──[improve-visualization|+1…2]──────── ────[172.17.0.4]──[15:21:08]────
$ dataset plot features upx-PE byte_0_after_ep byte_1_after_ep
/home/user/.local/lib/python3.10/site-packages/sklearn/base.py:329: UserWarning: Trying to unpickle estimator DecisionTreeClassifier from version pre-0.18 when using version 1.1.3. This might lead to breaking code or invalid results. Use at your own risk. For more info please refer to:
https://scikit-learn.org/stable/model_persistence.html#security-maintainability-limitations
warnings.warn(
Traceback (most recent call last):
File "/home/user/.opt/tools/dataset", line 3, in <module>
from pbox import *
File "/home/user/.local/lib/python3.10/site-packages/pbox/__init__.py", line 4, in <module>
from .experiment import *
File "/home/user/.local/lib/python3.10/site-packages/pbox/experiment.py", line 9, in <module>
from .learning import *
File "/home/user/.local/lib/python3.10/site-packages/pbox/learning/__init__.py", line 8, in <module>
from .dataset import *
File "/home/user/.local/lib/python3.10/site-packages/pbox/learning/dataset.py", line 8, in <module>
from .plot import *
ModuleNotFoundError: No module named 'pbox.learning.plot'
Note that with this file being deleted, it is currently not possible to plot a dataset.
This PR adds :
Note that with this file being deleted, it is currently not possible to plot a dataset.