zavolanlab / bindz-rbp

RBP module for bindz, a bioinformatics tool to detect regulators' binding sites on RNA sequences.
https://github.com/zavolanlab/bindz-rbp
Apache License 2.0
7 stars 1 forks source link

Non-reproducible md5 checksums of R plots #24

Closed AngryMaciek closed 3 years ago

AngryMaciek commented 4 years ago

Describe the bug It seems that on every machine the script that plots the heatmap gets executed the md5 checksum of the output is different. The svg plot is generally the same, some minor changes appear, as:

<line x1='772.69' y1='203.91' x2='776.15' y2='203.91' style='stroke-width: 0.38; stroke: #FFFFFF; stroke-linecap: butt;' clip-path='url(#cpMC4wMHw4NjQuMDB8NTA0LjAwfDAuMDA=)' />

vs.

<line x1='772.69' y1='203.77' x2='776.15' y2='203.77' style='stroke-width: 0.38; stroke: #FFFFFF; stroke-linecap: butt;' clip-path='url(#cpMC4wMHw4NjQuMDB8NTA0LjAwfDAuMDA=)' />

The script has been tested on macOS Catalina, Ubuntu, Linux Kali and Travis CI servers. For every machine the checksums are reprodicible however while executing on another machine - the results changes. I believe this is a ggplot internal issue...

Expected behavior We would like to have a reproducible plotting script that generates exactly the same output file regardless of the machine it runs on.


For now testing the checksums of the heatmaps in the CI has been turned off.

AngryMaciek commented 4 years ago

@krish8484 , Would you like to add anything to that issue?

AngryMaciek commented 3 years ago

Miniscule plots' coordinate differences on distinct machines is not an issue of this repository.