packing-box / docker-packing-box

Docker image gathering packers and tools for making datasets of packed executables and training machine learning models for packing detection
GNU General Public License v3.0
44 stars 10 forks source link

Visualization Error : Number of manager items must equal union of block items #138

Closed jramhani closed 1 month ago

jramhani commented 2 months ago

I get this error on some datasets ..

Example : dataset plot infogain datasetname1 --max-features 10

00:00:02.800 [INFO] Computing features...
  0% ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━   0/131 samples • 0:00:00 • -:--:-- • datasetname1 Process ForkPoolWorker-1:
Traceback (most recent call last):
  File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap
    self.run()
  File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run
    self._target(*self._args, **self._kwargs)
  File "/usr/lib/python3.11/multiprocessing/pool.py", line 114, in worker
    task = get()
           ^^^^^
  File "/usr/lib/python3.11/multiprocessing/queues.py", line 367, in get
    return _ForkingPickler.loads(res)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/user/.local/lib/python3.11/site-packages/pandas/core/internals/managers.py", line 934, in __init__
    self._verify_integrity()
  File "/home/user/.local/lib/python3.11/site-packages/pandas/core/internals/managers.py", line 943, in _verify_integrity
    raise AssertionError(
AssertionError: Number of manager items must equal union of block items
# manager items: 158, # tot_items: 157
 99% ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╸ 130/131 samples • 0:00:12 • 0:00:01 • datasetname1

dataset show datasetname1

Dataset characteristics

 • #Executables: 131                                                                                                                                                    
 • Format(s):    PE32, PE64                                                                                                                                             
 • Packer(s):    upx                                                                                                                                                    
 • Size:         40MB                                                                                                                                                   
 • Labelled:     100.00%                                                                                                                                                
 • Altered:      100.00%                                                                                                                                                
 • Alterations:  rename_packer_sections                                                                                                                                 
 • With files:   yes
 <<snipped>>