Nonlinear second-order boundary value problems

asdasdqwdqwfsdf commented 3 years ago

Hi @ChrisRackauckas and @KirillZubov

I am trying to solve a nonlinear second-order boundary value problem in "NeuralPDE.jl", the numerical result is still NOT correct.

The nonlinear second-order model in "NeuralPDE.jl" looks like:

using NeuralPDE, Flux, ModelingToolkit, GalacticOptim, Optim, DiffEqFlux,Plots
@parameters x, t
@variables u(..)
@derivatives Dxx''~x
@derivatives Dx'~x
@derivatives Dtt''~t
@derivatives Dt'~t

#2D PDE
gamm=0.0001
eq  = Dt(u(x,t)) ~ gamm*Dxx(u(x,t))- 5*u(x,t)^3+5*u(x,t)

# Initial and boundary conditions
bcs = [u(-1,t) ~ u(1,t),
       Dx(u(-1,t)) ~ Dx(u(1,t)),
       u(x,0) ~ x*x*cos(π*x)] 

# Space and time domains
domains = [x ∈ IntervalDomain(-1.0,1.0),
           t ∈ IntervalDomain(0.0,1.0)]
# Discretization
dx = 0.1

# Neural network
chain = FastChain(FastDense(2,16,Flux.σ),FastDense(16,16,Flux.σ),FastDense(16,1))

discretization = PhysicsInformedNN(chain,
                                   strategy= GridTraining(dx=dx))

pde_system = PDESystem(eq,bcs,domains,[x,t],[u])
prob = discretize(pde_system,discretization)
cb = function (p,l)
    println("Current loss is: $l")
    return false
end
# optimizer
opt = Optim.BFGS()
res = GalacticOptim.solve(prob,opt; cb = cb, maxiters=3000)
phi = discretization.phi

ts,xs = [domain.domain.lower:dx:domain.domain.upper for domain in domains]
u_predict_contourf = reshape([first(phi([t,x],res.minimizer)) for t in ts for x in xs] ,length(xs),length(ts))
plot(ts, xs, u_predict_contourf, linetype=:contourf,title = "predict")

Therefore, for nonlinear second-order boundary value problems, how can we choose the parameters for a stable & correct solution using "NeuralPDE.jl"?

ChrisRackauckas commented 3 years ago

What do you mean by it's not correct? How does the loss convergence look? Did you try other neural networks? That's a small neural network. Etc.

asdasdqwdqwfsdf commented 3 years ago

1.) What do you mean by it's not correct?

Answer: Run this model using "NeuralPDE.jl", and check the analytical result / numerical result from [https://arxiv.org/pdf/2007.04542.pdf], then you will know that the result from "NeuralPDE.jl" is totally wrong.

2.) How does the loss convergence look?

Answer: the loss convergence looks very good, but the results is totally wrong. Again, you may just check the analytical result / numerical result from [https://arxiv.org/pdf/2007.04542.pdf].

3.) Did you try other neural networks? That's a small neural network. Etc.

Answer: I have extend as follows: FastChain(FastDense(2,16,Flux.σ),FastDense(16,16,Flux.σ),FastDense(16,16,Flux.σ),FastDense(16,16,Flux.σ),FastDense(16,16,Flux.σ),FastDense(16,16,Flux.σ),FastDense(16,1))

Its loss convergence looks very good, but the result is totally wrong. Again, we can check the analytical result / numerical result from [https://arxiv.org/pdf/2007.04542.pdf].

As mentioned in the paper [https://arxiv.org/pdf/2007.04542.pdf], we have to use some strategies to improve the accuracy of the physics informed neuralnetworks, for example, this example!

Now come to the original question, how can we choose the deep learning parameters for a stable & correct solution using "NeuralPDE.jl"? for example, this example!

ChrisRackauckas commented 3 years ago

First of all, what is the validation you used to ensure you implemented the equation you thought you did? How did you test it?

Answer: Run this model using "NeuralPDE.jl", and check the analytical result / numerical result from [https://arxiv.org/pdf/2007.04542.pdf], then you will know that the result from "NeuralPDE.jl" is totally wrong.

I didn't run the code and would like you (or someone else) to share what some of the results look like, or else I won't be able to debug it for a very long time.

Answer: the loss convergence looks very good

That's not an answer. An answer to that would be a plot.

Answer: I have extend as follows: FastChain(FastDense(2,16,Flux.σ),FastDense(16,16,Flux.σ),FastDense(16,16,Flux.σ),FastDense(16,16,Flux.σ),FastDense(16,16,Flux.σ),FastDense(16,16,Flux.σ),FastDense(16,1))

That's a bad idea. Deep nets with small intermediate layers both don't parallelize well and do not satisfy universal approximation.

opt = Optim.BFGS()

You used pure BFGS. That's a bad idea without setting allow_increases=true. You will hit a local optimum. Did you try doing it will ADAM and then finishing with BFGS?

ChrisRackauckas commented 3 years ago

Wait, why was this closed?

ChrisRackauckas commented 3 years ago

This is most likely a user issue, but I think it's fine to keep it open and turn it into a tutorial. @KirillZubov can you want to investigate this model over the next few weeks?

KirillZubov commented 3 years ago

This is most likely a user issue, but I think it's fine to keep it open and turn it into a tutorial. @KirillZubov can you want to investigate this model over the next few weeks? yeah, sure

ChrisRackauckas commented 2 years ago

This works now with the correct training setup.

SciML / NeuralPDE.jl

Nonlinear second-order boundary value problems #203