CPU 100% with tornado 4.5.3?

SylphiaWindy commented 6 years ago

We have built a app with tornado 4.5.3, along with pynsq 0.8.2(which was not compatible with tornado 5), running on CentOS 7.4 with default kernel 3.10.0-693.21.1.el7.x86_64. CPU usage came up to 100% soon after app started, with ~100 connections. We are using tornado's TCPServer class and IOStream.read_bytes(4096, partial=True) to implement our async tcp server, something like

        while True:
            try:
                buf = await self._stream.read_bytes(4096, partial=True)
            except:
                self.close()
                return None
            return buf

Many lines like read(370, 0x2e47d80, 11) = -1 EAGAIN (Resource temporarily unavailable) in strace result, and strace -c result was:

% time     seconds  usecs/call     calls    errors syscall
------ ----------- ----------- --------- --------- ----------------
 37.95    0.892246           4    245309    243731 read
 35.95    0.845239           3    242459         2 ioctl
 19.15    0.450160           4    119424           epoll_wait
  4.58    0.107586          13      7996       590 futex
  0.46    0.010713           3      3263      1395 recvfrom
  0.43    0.010007           5      1908           sendto
  0.33    0.007788           3      2700           epoll_ctl
  0.30    0.006951           4      1638           write
  0.19    0.004412           3      1746           fstat
  0.17    0.004001           2      1711      1705 lseek
  0.10    0.002335           3       754       726 getpeername

A tornado issue?

SylphiaWindy commented 6 years ago

epoll_wait(3, [{EPOLLOUT, {u32=170, u64=8589934762}}, {EPOLLOUT, {u32=370, u64=8589934962}}, {EPOLLOUT, {u32=491, u64=8589935083}}, {EPOLLOUT, {u32=361, u64=8589934953}}], 1023, 11) = 4
read(361, 0x2dc4213, 5)                 = -1 EAGAIN (Resource temporarily unavailable)
ioctl(361, FIONBIO, [1])                = 0
read(491, 0x5dee050, 11)                = -1 EAGAIN (Resource temporarily unavailable)
ioctl(491, FIONBIO, [1])                = 0
read(370, 0x2e47d80, 11)                = -1 EAGAIN (Resource temporarily unavailable)
ioctl(370, FIONBIO, [1])                = 0
read(170, 0x6230b30, 11)                = -1 EAGAIN (Resource temporarily unavailable)
ioctl(170, FIONBIO, [1])                = 0
epoll_wait(3, [{EPOLLOUT, {u32=170, u64=8589934762}}, {EPOLLOUT, {u32=370, u64=8589934962}}, {EPOLLOUT, {u32=491, u64=8589935083}}, {EPOLLOUT, {u32=361, u64=8589934953}}], 1023, 10) = 4
read(361, 0x2dc4213, 5)                 = -1 EAGAIN (Resource temporarily unavailable)
ioctl(361, FIONBIO, [1])                = 0
read(491, 0x5dee050, 11)                = -1 EAGAIN (Resource temporarily unavailable)
ioctl(491, FIONBIO, [1])                = 0
read(370, 0x2e47d80, 11)                = -1 EAGAIN (Resource temporarily unavailable)

bdarnell commented 6 years ago

That's interesting. It's telling epoll that it wants to write (EPOLLOUT) to these file descriptors, but then when epoll says they're writeable all it does is read. It also looks like it's trying to read 5 or 11 bytes at a time, unlike the 4096 in your code snippet. Is there something else going on here (maybe in pynsq)?

Are you using SSL? This kind of confusion about whether a stream is reading or writing is somewhat more common in SSL code although I haven't seen a bug like that in Tornado in a very long time.

SylphiaWindy commented 6 years ago

Yes, we were using SSL, but even we disabled SSL, this issue also occurred. Socket 370 was a normal app client connection, not from pynsq.

Is it possible that my code caused this issue?

bdarnell commented 6 years ago

Is it possible that my code caused this issue?

It's always possible, especially since I haven't heard of anyone else having this problem. But if your code looks like what you've pasted above, it looks fine to me and this bug seems more likely to be somewhere inside the IOStream.

SylphiaWindy commented 6 years ago

How to write code to cause this problem: request to write but do read instead?

What I've done in my code was:

Try read some bytes (4096 max, partial read preferred)
Try parse packet header, if found, continue to read packet body bytes of length in header in the same while loop, until full packet read.
Process request, do some write back to the stream.
Go back to 1

No 5 or 11 bytes read in my code.

SylphiaWindy commented 6 years ago

Just tested, start app server, make only one client connection.

epoll_wait(3, [{EPOLLOUT, {u32=17, u64=8589934609}}], 1023, 92) = 1
read(17, 0x3486240, 11)                 = -1 EAGAIN (Resource temporarily unavailable)
ioctl(17, FIONBIO, [1])                 = 0
epoll_wait(3, [{EPOLLOUT, {u32=17, u64=8589934609}}], 1023, 92) = 1
read(17, 0x3486240, 11)                 = -1 EAGAIN (Resource temporarily unavailable)
ioctl(17, FIONBIO, [1])                 = 0

bdarnell commented 6 years ago

How to write code to cause this problem: request to write but do read instead?

I don't know. I think you'll just need to keep hacking on your application, trying to reduce the problem to the minimum reproducible code. You might try other tornado versions or other operating system versions to see if that makes a difference.

SylphiaWindy commented 6 years ago

Ok, I'll report later.

SylphiaWindy commented 6 years ago

Correction: with SSL disabled, one client connection didn't reproduce this problem, only when SSL was enabled.

SylphiaWindy commented 6 years ago

I found the cause of the problem. I've not read tornado test code before, and it turns out that handle_stream is being called just after connection made, before ssl handshake. In my code, I sent a welcome message to client in handle_stream which caused the problem. In test code there is a test_wait_for_handshake_callback which seems meet my needs. But there's a question, in my design, code in handle_stream don't care if it was a ssl connection, and wait_for_handshake is a SSLIOStream method, not an override of IOStream class. How about make waiting ssl handshake ~~inside~~ before handle_stream code?

bdarnell commented 6 years ago

Ah, that explains it. In protocols like HTTP where the client sends the first message, you wouldn't see this, but it's possible in protocols that work the other way around. Nice work tracking this down.

Is there any reason why TCPServer shouldn't call wait_for_handshake for you before calling handle_stream? I guess one reason is that if you want to block an abusive IP address you'd want to have the ability to do that before going through the relatively expensive handshake process.

Instead of calling wait_for_handshake automatically, we could change the behavior of IOStream.write() when the handshake is not completed. The data passed should be buffered and we shouldn't tell the event loop we want to write until the handshake is done. I actually thought we already did that for use with client-side connections.

SylphiaWindy commented 6 years ago

My mistake, like you said, I mean TCPServer should wait for ssl handshake completed before calling handle_stream, seems asyncio does. Yesterday, after I found the cause, the first version of my fix was:

async def handle_stream(self, io_stream, addr):
    async def handle_new_connection(stream, address):
        pass # original handle_stream code goes here
    if isinstance(io_stream, SSLIOStream):
        await io_stream.wait_for_handshake()
    await handle_new_connection(io_stream, addr)

CPU usage went down, but it lead to another big problem, too many open files. I saw over 60k fds under /proc/*/fd, most of them were socket fd, but netstat only showed ~100 established connections along with ~10 half open connections. I didn't dig deeper about how this happened, just made a second version fix:

async def handle_stream(self, io_stream, addr):
    DEFAULT_SSL_HANDSHAKE_TIMEOUT = 10
    async def handle_new_connection(stream, address):
        pass # original handle_stream code goes here
    if isinstance(io_stream, SSLIOStream):
        try:
            co = io_stream.wait_for_handshake()
            await with_timeout(IOLoop.current().time() + DEFAULT_SSL_HANDSHAKE_TIMEOUT,
                               co, quiet_exceptions=(IOError, StreamClosedError))
        except TimeoutError:
            if not io_stream.closed():
                io_stream.close()
            return
    await handle_new_connection(io_stream, addr)

Problem solved.

tornadoweb / tornado

CPU 100% with tornado 4.5.3? #2373