stefmolin / data-morph

Morph an input dataset of 2D points into select shapes, while preserving the summary statistics to a given number of decimal points through simulated annealing. It is intended to be used as a teaching tool to illustrate the importance of data visualization.
https://stefaniemolin.com/data-morph/
MIT License
60 stars 16 forks source link

Add gorilla starter dataset #180

Open stefmolin opened 1 month ago

stefmolin commented 1 month ago

Use the data from the A hypothesis is a liability paper (if available and permitted) as a new starter dataset.

We should preserve the classes (the sex field) to use for coloring points in the future.

See also: https://stefaniemolin.com/data-morph/stable/custom_datasets.html

AlexTheHopper commented 1 month ago

@epbennetts and I will be working on this over the next day or so :)

epbennetts commented 1 month ago

Yes, we are working on this together

AlexTheHopper commented 1 month ago

Hi Steph! @epbennetts and I have had a try at this but we're not sure if the final plots are clean enough. Let us know what you think - otherwise we'd be keen to try with another starting shape (perhaps a more symmetrical one?)

gorilla_to_all1.zip gorilla_to_all2.zip

stefmolin commented 1 month ago

This looks like a great start, @AlexTheHopper and @epbennetts. It is not expected that every starter dataset will work for every target shape. Be sure to reduce the number of points used for the gorilla though – it looks very dense and the results indicate you didn't run for enough iterations.