Closed davidread closed 8 years ago
Improved version of stats after running the tool (now excludes deleted resources/packages so much better figures):
file size no. files files size (bytes)
<1 KB 6,348 3,179,212
1-10 KB 21,193 91,875,918
10-100 KB 52,340 1,808,154,330
100 KB - 1 MB 14,078 4,502,765,524
1-10 MB 5,597 19,602,547,888
10-100 MB 1,721 71,469,455,696
100 MB - 1 GB 426 115,872,674,171
1-10 GB 88 133,191,097,679
10-100 GB 0 0
>100 GB 0 0
Totals: 101,791 346,541,750,418
(ckan)co@prod3 ~ () $ df -h /media/hulk/
Filesystem Size Used Avail Use% Mounted on
/dev/sdc1 459G 444G 15G 97% /media/hulk
Changed max size from 2GB to 1GB and increased the disk space from 3% to 27% for a greater margin.
(ckan)co@prod3 /vagrant/src/ckanext-archiver (master) $ paster --plugin=ckanext-archiver archiver size-report
2016-05-11 17:37:04,153 DEBUG [ckanext.spatial.model.package_extent] Spatial tables defined in memory
2016-05-11 17:37:04,159 DEBUG [ckanext.spatial.model.package_extent] Spatial tables already exist
2016-05-11 17:37:04,176 DEBUG [ckanext.harvest.model] Harvest tables defined in memory
2016-05-11 17:37:04,179 DEBUG [ckanext.harvest.model] Harvest tables already exist
file size no. files files size (bytes)
<1 KB 6,399 3,199,988
1-10 KB 21,708 96,046,298
10-100 KB 52,670 1,822,751,526
100 KB - 1 MB 14,179 4,557,149,644
1-10 MB 5,631 19,744,994,609
10-100 MB 1,753 72,283,320,289
100 MB - 1 GB 423 113,423,795,153
1-10 GB 0 0
10-100 GB 0 0
>100 GB 0 0
Totals: 102,763 211,931,257,507
(ckan)co@prod3 /vagrant/src/ckanext-archiver (master) $ df -h /media/hulk/
Filesystem Size Used Avail Use% Mounted on
/dev/sdc1 459G 333G 127G 73% /media/hulk
to discuss in sprint planning
Cleared 15GB with the first gen tool. Thought it would be more, but it appears to include archivals of deleted datasets in its report figures.