Why is the minimum criterion so high with the direct MTR approach?

johnnybonney commented 2 years ago

I have been getting very high values of the minimum criterion when using the direct MTR approach.

I recall that if I were to specify the IV-like moments myself, the minimum criterion represents an L1 measure of how close we can get to matching the moments (given the specified constraints on the MTRs). However, when I use the direct MTR approach, I've been getting minimum criteria in the tens of thousands for a binary outcome and no covariates.

It must be either that (i) a different definition of minimum criterion is used for the direct MTR method that I'm just not familiar with, (ii) I am goofing up somewhere, or (iii) ivmte is doing something weird.

Here is an example:

library(data.table)
library(ivmte)

set.seed(1)

N <- 10000

m0 <- function(u) 0.37 + 0.5*u - 0.19*u^2
m1 <- function(u) 0.45 + 0.63*u - 0.47*u^2

pscores <- c(0.31, 0.55, 0.79, 0.83)

# simulate data
U <- runif(N)
Z <- sample(1:4, N, replace = T)
P <- pscores[Z]
D <- as.integer(U <= P)

eps <- runif(N)
Y1  <- as.integer(eps <= m1(U))
Y0  <- as.integer(eps <= m0(U))
Y   <- D * Y1 + (1 - D) * Y0

dt <- data.table(Y = Y,
                 D = D,
                 Z = Z)

# run IVMTE
args <- list(data = dt,
             target = "ate",
             propensity = D ~ factor(Z),
             outcome = "Y",
             m0 = ~uSplines(knots = c(0.25, 0.5, 0.75) , degree = 3),
             m1 = ~uSplines(knots = c(0.25, 0.5, 0.75) , degree = 3),
             noisy = TRUE,
             initgrid.nu = 5000,
             audit.nu = 10000,
             criterion.tol = 0)

res <- do.call(ivmte, args)

This yields a minimum criterion of 2367.948 and bounds of [-0.09251298, 0.2420311].

In addition, if I increase N = 50000 and rerun the code, I get a minimum criterion of 11887.48 and bounds of [-0.2021548, 0.2500426]. This pattern holds in general---the minimum criterion scales linearly with N, and it seems to be changing the bounds non-trivially. (While I would not expect increasing the size of the data set to improve the fit of the moments per se, it seems odd that the measure of fit is worse for higher N.)

a-torgovitsky commented 2 years ago

The criteria are different and not comparable.

With the moment-based approach, you are trying to match the moments themselves. So you get 0 if and only if the moments are matched perfectly.

With the regression approach you are trying to minimize the sum of squared residuals. Even if you have specified the conditional mean perfectly, you still won't get 0 unless the variance of the residuals is 0.

As for increasing with N, my guess is that @jkcshea coded it in such a way that he dropped the 1/N factor in the criterion function. So it's not converging anywhere with N. That's why you're seeing something that's about 5x as large when you make N 5x as large. @jkcshea it probably makes more sense to use the normalized criterion. Potentially this could contribute to computational stability (although seems unlikely). Do you have time to change it? Probably it would be quick I think, but I know you are busy, so feel free to defer.

jkcshea commented 2 years ago

Sorry for the confusion, we had agreed several times in #198 to normalize the criterion. While the function is indeed doing this, I was still reporting the SSR as the criterion. This is now corrected, and the reported criterion should no longer vary with sample size now!

a-torgovitsky commented 2 years ago

Easy fix, thanks!

jkcshea / ivmte

Why is the minimum criterion so high with the direct MTR approach? #208