kdahlquist / GRNmap

Gene Regulatory Network modeling and parameter estimation
BSD 3-Clause "New" or "Revised" License
4 stars 3 forks source link

L-curve analysis with varied network sizes, optimization parameters, data #351

Closed kdahlquist closed 5 years ago

kdahlquist commented 7 years ago

@kdahlquist is going to run L-curve analyses in the Seaver 120 computer lab as follows:

Note that because of the problem of getting slightly different results on different computers, the results are not 100% comparable, but there isn't anything we can do about that right now.

kdahlquist commented 7 years ago
kdahlquist commented 7 years ago

The good news is that the L-curves I ran on the first two files took about 28 hours to run (instead of a week!), so I won't need to monopolize all the computers, I can just run my tests sequentially. Link to the results files has been appended to the previous post. I plan to run 2 analyses per day until I exhaust the permutations I want to try.

kdahlquist commented 7 years ago

Runs have been completed. Data and notes can be found in the Dahlquist Lab repository here:

https://github.com/kdahlquist/DahlquistLab/tree/master/data/Summer2017/L-curve

The compiled data with the run ID, alpha, LSE, and penalty values can be found here:

https://github.com/kdahlquist/DahlquistLab/blob/master/data/Summer2017/L-curve/compiled_L-curve_data_2017-06-21_updated.xlsx

bklein7 commented 6 years ago

An updated version of the R script for generating L-curves has been uploaded to the Dahlquist Lab repository: https://github.com/kdahlquist/DahlquistLab/blob/master/R_scripts/L-Curves.Rd. Although this code is still a work in progress, the following features have been successfully added:

(Updated originally posted on September 5th to issue #354.)

bklein7 commented 6 years ago

A newly updated version of the R script for generating L-curves has been uploaded to the Dahlquist Lab repository: https://github.com/kdahlquist/DahlquistLab/blob/master/R_scripts/L-Curves.R. This version includes the following updates:

Using this iteration of the script, new L-curves were produced for all analysis sets previously generated in this issue (#351). All L-curves were plotted on a fixed scale. The PowerPoint file containing these results can be accessed here: https://github.com/kdahlquist/DahlquistLab/blob/master/documents/L-Curve-Analysis_Varied-Optimization-Data-Size_BK20170909.pptx.

kdahlquist commented 6 years ago

Loose ends from today's meeting:

bklein7 commented 6 years ago

The latest version of the R script for generating L-curves can be found here: https://github.com/kdahlquist/DahlquistLab/blob/master/R_scripts/L-Curves.R. This update added hard-coded color/pattern combinations for 42 distinct lines when plotting multiple L-curves on the same graph.

An updated L-curve analysis PowerPoint presentation can be found here: https://github.com/kdahlquist/DahlquistLab/blob/master/documents/L-Curve-Analysis_Varied-Optimization-Data-Size_BK20170909.pptx. This version addresses the following issues from last week's meeting:

Finally, I did some research on AIC curves that we can discuss during today's meeting (see #360).

bklein7 commented 6 years ago

The "L-curves" R script has been commented out and can now be considered complete until further features are proposed: https://github.com/kdahlquist/DahlquistLab/blob/master/R_scripts/L-Curves.R.

bklein7 commented 6 years ago

The L-curve powerpoint presentation was updated to include proper labeling of the step-down experiment graph (LSE vs. parameter #): https://github.com/kdahlquist/DahlquistLab/blob/master/documents/L-Curve-Analysis_Varied-Optimization-Data-Size_BK20170909.pptx.

kdahlquist commented 5 years ago

This has been completed.