iiasa / message_ix

The integrated assessment and energy systems model MESSAGEix
https://docs.messageix.org
Apache License 2.0
121 stars 153 forks source link

Numerical properties of the LP optimization problem generated from message_ix #179

Open MarekMakowski opened 5 years ago

MarekMakowski commented 5 years ago

We would like to improve the numerical properties of the LPPs the message_ix sends to LP solvers. IMHO this requires a concerted treatment of at least the following issues:

  1. Model specification: split dense rows and columns; especially important for the IPM (barrier) lpmethod, which is usually the best for large LPs (provided they have sparse Cholesky). Note that just one very dense row or column may dramatically increase the matrix density and thus the number of multiplications in each iteration. Although some solvers split dense rows/cols but the modelers certainly can do it much better.
  2. Data cleansing (cleaning) in our DB should support management of "correct" ranges of data values and the corresponding units. The data import from the DB and its conversion to the LP parameters should include replacing "non-substantial" ("small") coefficients by zeros and warnings about the "large" coefficients. Large values of bounds/rhs should be verified, and - if appropriate - replaced by the infinity (i.e., "no bound"). A utility for the basic diagnostics of the LP matrix would be very helpful.
  3. Diagnostics of the LPP numerical properties. This is a pretty complex issue, so let's start with a simple element, i.e., the condition number (usually denoted by Kappa) that might be provided by GAMS and/or solvers. I have not found any GAMS function/option for computing Kappa (maybe it still provides it but my very limited knowledge of GAMS was not enough to find it). However, I've found the cplex binary option called quality, which outputs info (including Kappa) on the solution quality. Even if it works then it can only help in evaluation of the solution sensitivity (I guess it provides the Kappa of the basis). Kappa of the Cholesky would much better help in evaluation of the matrix numerical properties. If GAMS indeed does not provide such a function then may the R does (I guess chances are good and hopefully colleagues experienced in R can quickly check this).

Summing-up: these issues call for a careful consideration. The open question however is, if we will find enough motivation to deal with these topics.

danielhuppmann commented 5 years ago

thank you @MarekMakowski for the detailed description!

it seems that quality can easily be called via the GAMS CPLEX options

MarekMakowski commented 5 years ago

thanks Daniel for the comment. Well… your comment shows that I’ve succeeded to hide this piece:

...the cplex binary option called quality, which outputs info (including Kappa) on the solution quality. well beyond (maybe too much?) details. Sorry 😢

I’ll experiment with the cplex quality option although it appears to provide limited info, i.e., on the sensitivity of the “optimal” solution. Therefore, I think, we should still try to get the Kappa of the QR of the AA^T (maybe someone remembers the algebra better than I do and thus can suggest a better method for evaluation of an LPP numerical properties). The best I was able to find last night is the hint that R provides the kappa(…) function, which appears to do this. The IMSL library has, of course, several functions for the condition number calculations but … although IIASA maintained the IMSL in 1980s, I am afraid it would be time consuming to try to make IMSL operational (and I am not sure that this old Ftn library would handle the Indus-model-size matrix). This is why I suggest to explore the expertise of colleagues who know the R.

However, the kappa will most probably only confirm what we already guess. Therefore, I still think that the most effective for us would be to try our best to be closed to the classical advice: keep the abs-values of LPP coefficients within the [0.01, 100] range. I know… nowadays this is commonly considered “impossible” but we all know there is a very good reason beyond this advice. We also know that generating models with coefficients within a “reasonable” range takes a lot of time which is our scarce resource. However, I am convinced that this is one of the best “investments” towards the quality of our models.

Bests, Marek

On Mar 21, 19, at 7:17, Daniel Huppmann notifications@github.com wrote:

thank you @MarekMakowski for the detailed description!

it seems that quality can easily be called via the GAMS CPLEX options

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.

MarekMakowski commented 5 years ago

As promised, I've explored the Kappa (cplex option: quality 1)

image

Adding the aggressive scaling cplex option (scaind 1) improves the Kappa but it is still far too high to trust the results; moreover, the infeasibilities remain:

image
MarekMakowski commented 5 years ago

@khaeru (#127 copied(?) here): I am not sure if the barrier (or any other lpmethod) is the best default. Maybe an annotated cplex.opt file with all defaults and the suggested [for trying] options commented will be a better (although not easier to use) solution?