platisd / indoor-navigation-system

[WIP] An indoor navigation system to guide users towards their colleagues' desks
Apache License 2.0
15 stars 13 forks source link

Server core-dumps after ~16 minutes of operation #68

Closed platisd closed 6 years ago

platisd commented 6 years ago

Description

The ins_server core-dumps after approximately 16 minutes of operation, regardless of whether it is handling incoming data or not. The defect was reproducible 100% of the times.

Ways to reproduce

Have ins_server executing for long amounts of time.

Related logs

terminate called after throwing an instance of 'std::runtime_error' what(): No route to host Aborted (core dumped)

Definition of Done

The ins_server can run for indefinite amounts of time.

samup4web commented 6 years ago

@platisd hmm, what is your setup like? I tried running the ins_server on my machine and it runs for over 1 hour, and also accepted request thereafter. See the time-stamp prints in the screenshot below. screenshot

I will investigate further...

samup4web commented 6 years ago

I still have the server running, I will see how long before I reproduce the problem.

platisd commented 6 years ago

How did it go? Weird... otherwise it must be some local weird error, since I could consistently reproduce it.

samup4web commented 6 years ago

Still running at 19:38 ...

platisd commented 6 years ago

Weird... Let's close this for now and I will open it again if I have a different angle on it.

samup4web commented 6 years ago

@lexious89 can you try to run the server on your box and let's see if you can reproduce the problem.

samup4web commented 6 years ago

I manage to reproduce this issue. I will be investigating further...

platisd commented 6 years ago

Interesting, keep us posted! :+1:

samup4web commented 6 years ago

Strangely enough, I am on my second attempt to reproduce the problem after the first crash with no success. Will be testing on both my ubuntu box and rasberrypi.

samup4web commented 6 years ago

On a side note, @platisd are you on slack?

platisd commented 6 years ago

I sure am! 😛

Send invite to my work mail if you're making a channel

samup4web commented 6 years ago

Done.

samup4web commented 6 years ago

OK, couldn't get it to crash. So, am making a new run on rasberrypi now.

This will be running all night, so, fingers crossed.

platisd commented 6 years ago

What is your setup like? Are you using the wifi_module.ino without letting it sleep?

samup4web commented 6 years ago

Yes, modified wifi_module to not sleep and send data continuously with delay(20000);. I am using NodeMCU device for this. For the server, I am running on a fresh installation of rasberrypi.

samup4web commented 6 years ago

Strange that INS service has been running all night/day and still running (on rasberrypi)

I had to press reset button on ins_node this morning because it wasn't sending out data. I guess due to network connection.

platisd commented 6 years ago

I will also try to debug this when I find the time. Any suggestions on where to add extra logs?

samup4web commented 6 years ago

Okay, great! As for extra logs, none that I can think of now.

samup4web commented 6 years ago

Closing this issue since we haven't observed this issue lately. This can be re-opened again if problem reoccur

platisd commented 6 years ago

I agree. That being said, I think it could make sense to invest in making a Docker image to make sure everyone is running things in the same environment. :+1:

samup4web commented 6 years ago

Haha, nice idea. Taking it to another level :smile_cat: