matrix-org / synapse

Synapse: Matrix homeserver written in Python/Twisted.
https://matrix-org.github.io/synapse
Apache License 2.0
11.79k stars 2.13k forks source link

Failed to fetch room list #12253

Open MatthieuBarthel opened 2 years ago

MatthieuBarthel commented 2 years ago

Description

I am using latest synapse docker image (v1.54), everything seems to work fine except I cannot list remote public rooms from app.element.io, it says "Failed to fetch room list" (all external homeserver fails: matrix.org, gitter.im, libera.chat). I am having this issue since I installed synapse (v1.52).

Federation seems ok: https://federationtester.matrix.org/#imatt.ch

In the synapse logs we can see it fails to verify the remote server SSL certificate (here matrix.org) :

matrix_synapse.1.hirryiykm2t0@web9    | 2022-03-18 21:13:45,364 - synapse.access.http.8008 - 427 - INFO - GET-1582 - 80.219.12.248 - 8008 - {@matthieu.barthel:imatt.ch} Processed request: 0.001sec/0.001sec (0.000sec, 0.000sec) (0.000sec/0.000sec/0) 2B 200 "GET /_matrix/client/r0/thirdparty/protocols HTTP/1.1" "Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:98.0) Gecko/20100101 Firefox/98.0" [0 dbevts]
matrix_synapse.1.hirryiykm2t0@web9    | 2022-03-18 21:13:45,462 - synapse.http.matrixfederationclient - 649 - INFO - POST-1583 - {GET-O-2} [matrix.org] Request failed: GET matrix://matrix.org/_matrix/federation/v1/publicRooms?include_all_networks=false&limit=20: ResponseNeverReceived:[Error([('SSL routines', 'tls_process_server_certificate', 'certificate verify failed')])]
matrix_synapse.1.hirryiykm2t0@web9    | 2022-03-18 21:13:45,954 - synapse.http.matrixfederationclient - 649 - INFO - POST-1583 - {GET-O-2} [matrix.org] Request failed: GET matrix://matrix.org/_matrix/federation/v1/publicRooms?include_all_networks=false&limit=20: ResponseNeverReceived:[Error([('SSL routines', 'tls_process_server_certificate', 'certificate verify failed')])]
matrix_synapse.1.hirryiykm2t0@web9    | 2022-03-18 21:13:46,850 - synapse.http.matrixfederationclient - 649 - INFO - POST-1583 - {GET-O-2} [matrix.org] Request failed: GET matrix://matrix.org/_matrix/federation/v1/publicRooms?include_all_networks=false&limit=20: ResponseNeverReceived:[Error([('SSL routines', 'tls_process_server_certificate', 'certificate verify failed')])]
matrix_synapse.1.hirryiykm2t0@web9    | 2022-03-18 21:13:49,500 - synapse.http.matrixfederationclient - 649 - INFO - POST-1583 - {GET-O-2} [matrix.org] Request failed: GET matrix://matrix.org/_matrix/federation/v1/publicRooms?include_all_networks=false&limit=20: ResponseNeverReceived:[Error([('SSL routines', 'tls_process_server_certificate', 'certificate verify failed')])]
matrix_synapse.1.hirryiykm2t0@web9    | 2022-03-18 21:13:49,501 - synapse.http.server - 95 - INFO - POST-1583 - <XForwardedForRequest at 0x7fda095dd4c0 method='POST' uri='/_matrix/client/r0/publicRooms?server=matrix.org' clientproto='HTTP/1.1' site='8008'> SynapseError: 502 - Failed to fetch room list

The SSL verification seems fine from the docker host and the synapse container :

root@6151c1073bd8:/# openssl s_client -showcerts -connect matrix.org:443
CONNECTED(00000003)
depth=2 C = US, O = Internet Security Research Group, CN = ISRG Root X1
verify return:1
depth=1 C = US, O = Let's Encrypt, CN = R3
verify return:1
depth=0 CN = www.matrix.org
verify return:1
---
Certificate chain
 0 s:CN = www.matrix.org
   i:C = US, O = Let's Encrypt, CN = R3
-----BEGIN CERTIFICATE-----
MIIG6DCCBdCgAwIBAgISBM7BiY4r42zWwv1Dh6/5GljwMA0GCSqGSIb3DQEBCwUA
MDIxCzAJBgNVBAYTAlVTMRYwFAYDVQQKEw1MZXQncyBFbmNyeXB0MQswCQYDVQQD
EwJSMzAeFw0yMjAxMjUwNzU4MDlaFw0yMjA0MjUwNzU4MDhaMBkxFzAVBgNVBAMT
Dnd3dy5tYXRyaXgub3JnMIICIjANBgkqhkiG9w0BAQEFAAOCAg8AMIICCgKCAgEA
rb/4AkYQ7UoG3vUOH2fLjP2wVPKQoRz4knAi+MnELLsSeqjYA2alp/7kDx2RmeP8
VbSUicl68/uK8FZWmuWyuf5+Fg+zDJi6iOz+mdGxwjIVxbgu55LSpy+6G5Vrlqxo
LGtvFyrcvKHNmfTsF5qa795xDXDLIpIuw7z4n0ufk9QLsAArhK4bzXgKkSKJDIZa
ds57rSJHjiefhzGwPTB78V3VZaagS4vQCt5Le1WrjLQbE0UujZ40q6+kzVaUWGWk
FOH0n+U2zM+eaOJTPUwz/+bxUBMilk4Y8LKnVXwTR7ffcEVuNvTwsq4SOt80Nkac
vDE+EWPPseM5PEvEesqzf2r7xByhOD9DjXZVxyxie1Yqadl4ORKLLMydLgSFrtZw
nGU/gdIJESWxouVoFSwQ9Mt8eMef0p7KwIy1cu+rWdVgqT0A9PsIuubuMyy/9xIJ
V5QDyw2gcVEL1qZptRXQfk1hJPdOwcxTONiMTJVnemgwGhNFrkHfdGyfW7Ll71bZ
B+JPC+euTS01DImDDOy8jCaWSzMw2qzqHKkc9MjScybV92x4mP66fVTGBkA+GAw3
uFFRf+L8gfjS5pABkUFUU7fuyZGDGtjMrEYkqVnUfj7MGuDYPpqGgrsttNMZLz/U
tdNXWMyiP0jzQ5Zs2TNOcdU6OCCm6rrsFoTogzCaDf8CAwEAAaOCAw8wggMLMA4G
A1UdDwEB/wQEAwIFoDAdBgNVHSUEFjAUBggrBgEFBQcDAQYIKwYBBQUHAwIwDAYD
VR0TAQH/BAIwADAdBgNVHQ4EFgQUnbZw115TEUazQk/fV8AEmuR0RjAwHwYDVR0j
BBgwFoAUFC6zF7dYVsuuUAlA5h+vnYsUwsYwVQYIKwYBBQUHAQEESTBHMCEGCCsG
AQUFBzABhhVodHRwOi8vcjMuby5sZW5jci5vcmcwIgYIKwYBBQUHMAKGFmh0dHA6
Ly9yMy5pLmxlbmNyLm9yZy8wgd0GA1UdEQSB1TCB0oINYWJvdXQucmlvdC5pbYIR
YXJld2VyZWFkeXlldC5jb22CDmxmcy5tYXRyaXgub3JnghhtYXRyaXgtY2xpZW50
Lm1hdHJpeC5vcmeCHG1hdHJpeC1mZWRlcmF0aW9uLm1hdHJpeC5vcmeCCm1hdHJp
eC5vcmeCCW1hdHJpeC50b4IKbW9kdWxhci5pbYIHcmlvdC5pbYIPc3BlYy5tYXRy
aXgub3Jnggl2ZWN0b3IuaW2CDnd3dy5tYXRyaXgub3Jngg53d3cubW9kdWxhci5p
bTBMBgNVHSAERTBDMAgGBmeBDAECATA3BgsrBgEEAYLfEwEBATAoMCYGCCsGAQUF
BwIBFhpodHRwOi8vY3BzLmxldHNlbmNyeXB0Lm9yZzCCAQUGCisGAQQB1nkCBAIE
gfYEgfMA8QB3AEHIyrHfIkZKEMahOglCh15OMYsbA+vrS8do8JBilgb2AAABfpB0
70sAAAQDAEgwRgIhAN8Hoqf4wDDec4giZvK7FBtzPKHT9zbKsGnER9GXIjhYAiEA
/2ziF5cJrajoTlY8gXyvj4sPMRBFpbr7DyP0wsEpwR8AdgApeb7wnjk5IfBWc59j
pXflvld9nGAK+PlNXSZcJV3HhAAAAX6QdO8zAAAEAwBHMEUCIAjlIo2o9cnuu18E
FsCj4FvdMiUCksUtcEZjATBc4L5lAiEA1CKPT9aA3nFTy63Hm3MKdHV197EhRbBe
WflSmrl71jIwDQYJKoZIhvcNAQELBQADggEBAB3rzyTkM8Dh26oR7xyVGOVPWlHz
KFvyvvKEc84LE3DV31tPFKXwXnSATEUnjKlSTIQ72UxaqdIfBiUUikmk5qHzTNeh
ZHKEUqGzq4Tt7yqZSXSp+jfyTRbaLBI7FmAKZoEEO9m4ef8wvFXel2zxHpYtb7eG
VSJXEMR8c1N1+GgaKNryHh7aTFNz1jibctV9nPjHgpP6XJU+9EgdBvcoTisbrLjZ
JOgaNT6CFO2DWk7ule+R4vNiiSq+baqEn3S10ouPcv4TWe6JWxHtwdj3wQUHx8aW
bCeAmbtg5TFQZAT0Zq90U+EASwTByyZJ4rFYmjdm5902Vr+UXTQI692LWrM=
-----END CERTIFICATE-----
 1 s:C = US, O = Let's Encrypt, CN = R3
   i:C = US, O = Internet Security Research Group, CN = ISRG Root X1
-----BEGIN CERTIFICATE-----
MIIFFjCCAv6gAwIBAgIRAJErCErPDBinU/bWLiWnX1owDQYJKoZIhvcNAQELBQAw
TzELMAkGA1UEBhMCVVMxKTAnBgNVBAoTIEludGVybmV0IFNlY3VyaXR5IFJlc2Vh
cmNoIEdyb3VwMRUwEwYDVQQDEwxJU1JHIFJvb3QgWDEwHhcNMjAwOTA0MDAwMDAw
WhcNMjUwOTE1MTYwMDAwWjAyMQswCQYDVQQGEwJVUzEWMBQGA1UEChMNTGV0J3Mg
RW5jcnlwdDELMAkGA1UEAxMCUjMwggEiMA0GCSqGSIb3DQEBAQUAA4IBDwAwggEK
AoIBAQC7AhUozPaglNMPEuyNVZLD+ILxmaZ6QoinXSaqtSu5xUyxr45r+XXIo9cP
R5QUVTVXjJ6oojkZ9YI8QqlObvU7wy7bjcCwXPNZOOftz2nwWgsbvsCUJCWH+jdx
sxPnHKzhm+/b5DtFUkWWqcFTzjTIUu61ru2P3mBw4qVUq7ZtDpelQDRrK9O8Zutm
NHz6a4uPVymZ+DAXXbpyb/uBxa3Shlg9F8fnCbvxK/eG3MHacV3URuPMrSXBiLxg
Z3Vms/EY96Jc5lP/Ooi2R6X/ExjqmAl3P51T+c8B5fWmcBcUr2Ok/5mzk53cU6cG
/kiFHaFpriV1uxPMUgP17VGhi9sVAgMBAAGjggEIMIIBBDAOBgNVHQ8BAf8EBAMC
AYYwHQYDVR0lBBYwFAYIKwYBBQUHAwIGCCsGAQUFBwMBMBIGA1UdEwEB/wQIMAYB
Af8CAQAwHQYDVR0OBBYEFBQusxe3WFbLrlAJQOYfr52LFMLGMB8GA1UdIwQYMBaA
FHm0WeZ7tuXkAXOACIjIGlj26ZtuMDIGCCsGAQUFBwEBBCYwJDAiBggrBgEFBQcw
AoYWaHR0cDovL3gxLmkubGVuY3Iub3JnLzAnBgNVHR8EIDAeMBygGqAYhhZodHRw
Oi8veDEuYy5sZW5jci5vcmcvMCIGA1UdIAQbMBkwCAYGZ4EMAQIBMA0GCysGAQQB
gt8TAQEBMA0GCSqGSIb3DQEBCwUAA4ICAQCFyk5HPqP3hUSFvNVneLKYY611TR6W
PTNlclQtgaDqw+34IL9fzLdwALduO/ZelN7kIJ+m74uyA+eitRY8kc607TkC53wl
ikfmZW4/RvTZ8M6UK+5UzhK8jCdLuMGYL6KvzXGRSgi3yLgjewQtCPkIVz6D2QQz
CkcheAmCJ8MqyJu5zlzyZMjAvnnAT45tRAxekrsu94sQ4egdRCnbWSDtY7kh+BIm
lJNXoB1lBMEKIq4QDUOXoRgffuDghje1WrG9ML+Hbisq/yFOGwXD9RiX8F6sw6W4
avAuvDszue5L3sz85K+EC4Y/wFVDNvZo4TYXao6Z0f+lQKc0t8DQYzk1OXVu8rp2
yJMC6alLbBfODALZvYH7n7do1AZls4I9d1P4jnkDrQoxB3UqQ9hVl3LEKQ73xF1O
yK5GhDDX8oVfGKF5u+decIsH4YaTw7mP3GFxJSqv3+0lUFJoi5Lc5da149p90Ids
hCExroL1+7mryIkXPeFM5TgO9r0rvZaBFOvV2z0gp35Z0+L4WPlbuEjN/lxPFin+
HlUjr8gRsI3qfJOQFy/9rKIJR0Y/8Omwt/8oTWgy1mdeHmmjk7j1nYsvC9JSQ6Zv
MldlTTKB3zhThV1+XWYp6rjd5JW1zbVWEkLNxE7GJThEUG3szgBVGP7pSWTUTsqX
nLRbwHOoq7hHwg==
-----END CERTIFICATE-----
 2 s:C = US, O = Internet Security Research Group, CN = ISRG Root X1
   i:O = Digital Signature Trust Co., CN = DST Root CA X3
-----BEGIN CERTIFICATE-----
MIIFYDCCBEigAwIBAgIQQAF3ITfU6UK47naqPGQKtzANBgkqhkiG9w0BAQsFADA/
MSQwIgYDVQQKExtEaWdpdGFsIFNpZ25hdHVyZSBUcnVzdCBDby4xFzAVBgNVBAMT
DkRTVCBSb290IENBIFgzMB4XDTIxMDEyMDE5MTQwM1oXDTI0MDkzMDE4MTQwM1ow
TzELMAkGA1UEBhMCVVMxKTAnBgNVBAoTIEludGVybmV0IFNlY3VyaXR5IFJlc2Vh
cmNoIEdyb3VwMRUwEwYDVQQDEwxJU1JHIFJvb3QgWDEwggIiMA0GCSqGSIb3DQEB
AQUAA4ICDwAwggIKAoICAQCt6CRz9BQ385ueK1coHIe+3LffOJCMbjzmV6B493XC
ov71am72AE8o295ohmxEk7axY/0UEmu/H9LqMZshftEzPLpI9d1537O4/xLxIZpL
wYqGcWlKZmZsj348cL+tKSIG8+TA5oCu4kuPt5l+lAOf00eXfJlII1PoOK5PCm+D
LtFJV4yAdLbaL9A4jXsDcCEbdfIwPPqPrt3aY6vrFk/CjhFLfs8L6P+1dy70sntK
4EwSJQxwjQMpoOFTJOwT2e4ZvxCzSow/iaNhUd6shweU9GNx7C7ib1uYgeGJXDR5
bHbvO5BieebbpJovJsXQEOEO3tkQjhb7t/eo98flAgeYjzYIlefiN5YNNnWe+w5y
sR2bvAP5SQXYgd0FtCrWQemsAXaVCg/Y39W9Eh81LygXbNKYwagJZHduRze6zqxZ
Xmidf3LWicUGQSk+WT7dJvUkyRGnWqNMQB9GoZm1pzpRboY7nn1ypxIFeFntPlF4
FQsDj43QLwWyPntKHEtzBRL8xurgUBN8Q5N0s8p0544fAQjQMNRbcTa0B7rBMDBc
SLeCO5imfWCKoqMpgsy6vYMEG6KDA0Gh1gXxG8K28Kh8hjtGqEgqiNx2mna/H2ql
PRmP6zjzZN7IKw0KKP/32+IVQtQi0Cdd4Xn+GOdwiK1O5tmLOsbdJ1Fu/7xk9TND
TwIDAQABo4IBRjCCAUIwDwYDVR0TAQH/BAUwAwEB/zAOBgNVHQ8BAf8EBAMCAQYw
SwYIKwYBBQUHAQEEPzA9MDsGCCsGAQUFBzAChi9odHRwOi8vYXBwcy5pZGVudHJ1
c3QuY29tL3Jvb3RzL2RzdHJvb3RjYXgzLnA3YzAfBgNVHSMEGDAWgBTEp7Gkeyxx
+tvhS5B1/8QVYIWJEDBUBgNVHSAETTBLMAgGBmeBDAECATA/BgsrBgEEAYLfEwEB
ATAwMC4GCCsGAQUFBwIBFiJodHRwOi8vY3BzLnJvb3QteDEubGV0c2VuY3J5cHQu
b3JnMDwGA1UdHwQ1MDMwMaAvoC2GK2h0dHA6Ly9jcmwuaWRlbnRydXN0LmNvbS9E
U1RST09UQ0FYM0NSTC5jcmwwHQYDVR0OBBYEFHm0WeZ7tuXkAXOACIjIGlj26Ztu
MA0GCSqGSIb3DQEBCwUAA4IBAQAKcwBslm7/DlLQrt2M51oGrS+o44+/yQoDFVDC
5WxCu2+b9LRPwkSICHXM6webFGJueN7sJ7o5XPWioW5WlHAQU7G75K/QosMrAdSW
9MUgNTP52GE24HGNtLi1qoJFlcDyqSMo59ahy2cI2qBDLKobkx/J3vWraV0T9VuG
WCLKTVXkcGdtwlfFRjlBz4pYg1htmf5X6DYO8A4jqv2Il9DjXA6USbW1FzXSLr9O
he8Y4IWS6wY7bCkjCWDcRQJMEhg76fsO3txE+FiYruq9RUWhiF1myv4Q6W+CyBFC
Dfvp7OOGAN6dEOM4+qR9sdjoSYKEBpsr6GtPAQw4dy753ec5
-----END CERTIFICATE-----
---
Server certificate
subject=CN = www.matrix.org

issuer=C = US, O = Let's Encrypt, CN = R3

---
No client certificate CA names sent
Peer signing digest: SHA256
Peer signature type: RSA-PSS
Server Temp Key: X25519, 253 bits
---
SSL handshake has read 5218 bytes and written 382 bytes
Verification: OK
---
New, TLSv1.3, Cipher is TLS_AES_256_GCM_SHA384
Server public key is 4096 bit
Secure Renegotiation IS NOT supported
Compression: NONE
Expansion: NONE
No ALPN negotiated
Early data was not sent
Verify return code: 0 (ok)

I believe the SSL verification issue is somewhere related to python in the docker image.

Steps to reproduce

My synapse installation is probably pretty common, I use keycloak as authentication provider with a self signed certificate but I don't see any relation.

I get the error from Element when I try to list public rooms :

image

Version information

If not matrix.org:

MatthieuBarthel commented 2 years ago

I just tried to make a request from python inside the synapse container and it works :thinking:

root@6151c1073bd8:/# python
Python 3.9.10 (main, Mar  2 2022, 04:31:58) 
[GCC 10.2.1 20210110] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import requests
>>> requests.get('https://matrix.org')
<Response [200]>
DMRobertson commented 2 years ago

Hi @MatthieuBarthel, have you changed any TLS-related settings in synapse's configuration?

MatthieuBarthel commented 2 years ago

Hi @DMRobertson, thanks your help. I don't see anything special in homeserver.yaml. Maybe it could be related to SSL_CERT_FILE environment variable which points to my CA public key ? Here my docker-compose.yml :

services:
  postgres:
    image: postgres:14
    environment:
      - POSTGRES_DB=synapse
      - POSTGRES_USER=synapse
      - POSTGRES_PASSWORD=xxxxx
      - POSTGRES_INITDB_ARGS=--locale C --encoding UTF8
    volumes:
      - ./postgresdata:/var/lib/postgresql/data

  synapse:
    image: matrixdotorg/synapse:v1.54.0 
    environment:
      - SYNAPSE_REPORT_STATS=no
      - SYNAPSE_SERVER_NAME=imatt.ch
      - SSL_CERT_FILE=/data/iMattRootCA.crt
    deploy:
      labels:
        - traefik.enable=true
        - traefik.http.routers.matrix.rule=Host(`matrix.imatt.ch`)
        - traefik.http.routers.matrix.entrypoints=websecure
        - traefik.http.routers.matrix.middlewares=securityHeaders@file
        - traefik.http.services.matrix.loadbalancer.server.port=8008
    networks:
      - default
      - traefik
    volumes:
      - ./synapse:/data

networks:
  traefik:
    external: true

If I remove this environment variable synapse won't start, so I cannot test easily with my current setup. I'll try to make a test installation in the next days. Thanks again

DMRobertson commented 2 years ago

I wonder if setting that environment var is somehow preventing openssl from using the system CA certs in /etc/ssl/certs. I don't think that should be the case from my reading of https://www.openssl.org/docs/manmaster/man3/SSL_CTX_load_verify_locations.html ---but perhaps I'm mistaken.

If I remove this environment variable synapse won't start

What error message(s) do you get if you do this?

MatthieuBarthel commented 2 years ago

What error message(s) do you get if you do this?

I says it cannot initialize the OIDC provider (which has the self signed certificate) :

matrix_synapse.1.wpn85n6vbswn@web9    | Traceback (most recent call last):
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/synapse/app/_base.py", line 227, in wrapper
matrix_synapse.1.wpn85n6vbswn@web9    |     await cb(*args, **kwargs)
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/synapse/app/homeserver.py", line 371, in start
matrix_synapse.1.wpn85n6vbswn@web9    |     await oidc.load_metadata()
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/synapse/handlers/oidc.py", line 123, in load_metadata
matrix_synapse.1.wpn85n6vbswn@web9    |     raise Exception(
matrix_synapse.1.wpn85n6vbswn@web9    | Exception: Error while initialising OIDC provider 'oidc-keycloak'
matrix_synapse.1.wpn85n6vbswn@web9    | Error during startup:
matrix_synapse.1.wpn85n6vbswn@web9    | Traceback (most recent call last):
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/synapse/handlers/oidc.py", line 119, in load_metadata
matrix_synapse.1.wpn85n6vbswn@web9    |     await p.load_metadata()
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/synapse/handlers/oidc.py", line 436, in load_metadata
matrix_synapse.1.wpn85n6vbswn@web9    |     return await self._provider_metadata.get()
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/synapse/util/caches/cached_call.py", line 136, in get
matrix_synapse.1.wpn85n6vbswn@web9    |     return await self._cachedcall.get()
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/synapse/util/caches/cached_call.py", line 106, in get
matrix_synapse.1.wpn85n6vbswn@web9    |     self._result.raiseException()
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/twisted/python/failure.py", line 475, in raiseException
matrix_synapse.1.wpn85n6vbswn@web9    |     raise self.value.with_traceback(self.tb)
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/twisted/internet/defer.py", line 1656, in _inlineCallbacks
matrix_synapse.1.wpn85n6vbswn@web9    |     result = current_context.run(
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/twisted/python/failure.py", line 489, in throwExceptionIntoGenerator
matrix_synapse.1.wpn85n6vbswn@web9    |     return g.throw(self.type, self.value, self.tb)
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/synapse/util/caches/cached_call.py", line 126, in _wrapper
matrix_synapse.1.wpn85n6vbswn@web9    |     return await f()
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/synapse/handlers/oidc.py", line 448, in _load_metadata
matrix_synapse.1.wpn85n6vbswn@web9    |     metadata_response = await self._http_client.get_json(url)
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/synapse/http/client.py", line 591, in get_json
matrix_synapse.1.wpn85n6vbswn@web9    |     body = await self.get_raw(uri, args, headers=actual_headers)
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/synapse/http/client.py", line 676, in get_raw
matrix_synapse.1.wpn85n6vbswn@web9    |     response = await self.request("GET", uri, headers=Headers(actual_headers))
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/synapse/http/client.py", line 443, in request
matrix_synapse.1.wpn85n6vbswn@web9    |     response = await make_deferred_yieldable(request_deferred)
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/twisted/internet/defer.py", line 857, in _runCallbacks
matrix_synapse.1.wpn85n6vbswn@web9    |     current.result = callback(  # type: ignore[misc]
matrix_synapse.1.wpn85n6vbswn@web9    |   File "/usr/local/lib/python3.9/site-packages/synapse/http/client.py", line 789, in _timeout_to_request_timed_out_error
matrix_synapse.1.wpn85n6vbswn@web9    |     raise RequestTimedOutError("Timeout waiting for response from remote server")
matrix_synapse.1.wpn85n6vbswn@web9    | synapse.http.RequestTimedOutError: 504: Timeout waiting for response from remote server

I wonder if setting that environment var is somehow preventing openssl from using the system CA certs in /etc/ssl/certs. I don't think that should be the case from my reading of https://www.openssl.org/docs/manmaster/man3/SSL_CTX_load_verify_locations.html ---but perhaps I'm mistaken.

Yes I can now confirm that, I removed OIDC config, removed SSL_CERT_FILE and created a local user, the SSL verification issue is gone.

I also think this not the normal behavior, SSL_CERT_FILE usage is documented here: https://github.com/matrix-org/synapse/blob/develop/docs/openid.md

MatthieuBarthel commented 2 years ago

I chained my CA with this list of certificates, which fixes the issue: https://raw.githubusercontent.com/certifi/python-certifi/master/certifi/cacert.pem

I still believe there is some bug because openssl verification and even requests from python were working in my tests with SSL_CERT_FILE set my CA.

DMRobertson commented 2 years ago

Can we double-check your homeserver.yaml config again? Do you have any of the following set?

MatthieuBarthel commented 2 years ago

No I have none of them set, here is my homeserver.yaml with comments/blank lines removed :

modules:
server_name: "imatt.ch"
pid_file: /data/homeserver.pid
public_baseurl: https://matrix.imatt.ch/
presence:
listeners:
  - port: 8008
    tls: false
    type: http
    x_forwarded: true
    resources:
      - names: [client, federation]
        compress: false
manhole_settings:
limit_remote_rooms:
templates:
retention:
caches:
  per_cache_factors:
database:
  name: psycopg2
  txn_limit: 10000
  args:
    host: postgres
    user: synapse
    password: xxx
    database: synapse
    cp_min: 5
    cp_max: 10
log_config: "/data/imatt.ch.log.config"
media_store_path: "/data/media_store"
url_preview_accept_language:
oembed:
turn_uris: [ "turns:turn.imatt.ch?transport=udp", "turns:turn.imatt.ch?transport=tcp" ]
turn_shared_secret: "xxx"
turn_user_lifetime: 86400000
turn_allow_guests: True
registration_shared_secret: "xxx"
account_threepid_delegates:
metrics_flags:
report_stats: false
room_prejoin_state:
macaroon_secret_key: "xxx"
form_secret: "xxx"
signing_key_path: "/data/imatt.ch.signing.key"
old_signing_keys:
trusted_key_servers:
  - server_name: "matrix.org"
saml2_config:
  sp_config:
  user_mapping_provider:
    config:
oidc_providers:
  - idp_id: keycloak
    idp_name: "Keycloak Server"
    issuer: "https://keycloak.imatt.ch/auth/realms/iMatt"
    client_id: "synapse"
    client_secret: "xxx"
    scopes: ["openid", "profile"]
    user_mapping_provider:
      config:
        localpart_template: "{{ user.preferred_username }}"
        display_name_template: "{{ user.name }}"
cas_config:
sso:
password_config:
   policy:
ui_auth:
email:
  email:
  smtp_host: smtp.example.com
  smtp_port: 465
  smtp_user: username
  smtp_pass: xxxx
  notif_from: "Matrix <xxx@imatt.ch>"
  require_transport_security: true
push:
user_directory:
stats:
opentracing:
redis:

Many thanks :)

DMRobertson commented 2 years ago

I still believe there is some bug because openssl verification and even requests from python were working in my tests with SSL_CERT_FILE set my CA.

Note that the requests module trusts the certificates in certifi. One has to set the environment variable REQUESTS_CA_BUNDLE to use a different set of CA certs.

DMRobertson commented 2 years ago

I'm not fully sure what's going on here. My reading of the OpenSSL docs and source code suggests that the SSL_CERT_FILE works in addition to the SSL_CERT_PATH.

Frustratingly, I can't see anything in Synapse or Twisted that would explain the behaviour you're seeing.

A few desperate suggestions:

MatthieuBarthel commented 2 years ago

Thanks for the suggestions, I tried to dig but without success unfortunately.

You could try to inject your certificate into the container in /etc/ssl/certs and not set SSL_CERT_FILE. I'm not sure if that's easy to do with your Docker setup.

I did it by mounting my CA as a docker volume, it didn't work (keycloak SSL verification failed on start). Then I did it by copying my CA in a custom docker image (cloned the repo and made the change in docker/Dockerfile in both build stages before "pip install"), the result is the same. I also tried to build it with multiple python versions.

You could try explicitly setting SSL_CERT_PATH=/etc/ssl/certs in addition to SSL_CERT_FILE=/data/....

I tried but it didn't change anything.

I made a few more tests from the container, with just SSL_CERT_FILE pointing to my CA :

root@35b0f204a09a:/# echo $SSL_CERT_FILE
/data/iMattRootCA.crt

Openssl seems to verify well both matrix.org and keycloak.imatt.ch (signed by my CA) :

root@35b0f204a09a:/# openssl s_client -showcerts -connect keycloak.imatt.ch:443
...
---
SSL handshake has read 16575 bytes and written 414 bytes
Verification: OK
---

root@35b0f204a09a:/# openssl s_client -showcerts -connect matrix.org:443
---
SSL handshake has read 5218 bytes and written 382 bytes
Verification: OK
---

From a python console, only Matrix gets verified :

root@35b0f204a09a:/# python
Python 3.9.10 (main, Mar  2 2022, 04:31:58) 
[GCC 10.2.1 20210110] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import requests
>>> requests.get('https://matrix.org', verify=True)
<Response [200]>
>>> requests.get('https://keycloak.imatt.ch', verify=True)
...
urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='keycloak.imatt.ch', port=443): Max retries exceeded with url: / (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1129)')))

# if I specify my CA path explicitly, only my domain gets verified :
>>> requests.get('https://keycloak.imatt.ch',verify='/data/iMattRootCA.crt')
<Response [200]>
>>> requests.get('https://matrix.org',verify='/data/iMattRootCA.crt')
...
ssl.SSLCertVerificationError: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1129)

With SSL_CERT_PATH=/etc/ssl/certs the results are the same.

I saw in the synapse source code that a library called grequests is also used in addition of requests, I tried to import it but it fails (ModuleNotFoundError), I don't know if it is possible to open an python console "scoped" to the synapse environment.

MatthieuBarthel commented 2 years ago

oops my bad, I forgot to run update-ca-certificates after copying the CA, it's working with the custom docker image

anoadragon453 commented 2 years ago

It is possible to open a python interpreter inside a live, running Synapse process through use of Synapse's manhole, which is probably the most accurate environment. Do be careful if you're experimenting inside your production instance though :)

I saw in the synapse source code that a library called grequests is also used in addition of requests, I tried to import it but it fails (ModuleNotFoundError)

As far as I can see, this only used for a script (jitsimeetbridge.py) in the contrib directory. Synapse won't be using it for federation requests.

MatthieuBarthel commented 2 years ago

Thanks @anoadragon453, I restored my previous configuration (using SSL_CERT_FILE), so federation was broken again, opened an interpreter with manhole and I tried to request matrix.org and keycloak.imatt.ch, the results were the same that I had with the python interpreter (only matrix.org is verified), this is strange because in fact matrix boots fine with my custom certificate and fails to connect to matrix.org.

A simple way to fix the issue would be to update the doc to remove SSL_CERT_FILE usage, people would simply have to copy custom CA(s) into /usr/local/share/ca-certificates/, and to run update-ca-certificates before starting synapse.

In the docker image, it would need to be run in entrypoint.sh before the start command, it could be triggered only if a specific environment variable exists.

If this solution seems fine I believe I can submit a MR.

erikjohnston commented 2 years ago

So I think the issue here is related to Twisted doing something odd with the default trust stores: https://twistedmatrix.com/trac/ticket/9209#ticket. It notes that it should work fine if we install ca-certificates, which the docker image doesn't currently. @MatthieuBarthel could you try installing ca-certificates in the docker image and seeing if that fixes things? If it does then that feels like the "right" fix to me

MatthieuBarthel commented 2 years ago

I tried it, unfortunately it still gives the same result: my CA works but not the rest. FYI, I've just set SSL_CERT_FILE and used this custom image:

FROM matrixdotorg/synapse:v1.57.0

RUN apt-get update && \
    apt-get install -y ca-certificates && \
    rm -rf /var/lib/apt/lists/*

Do not hesitate if you want me to do others tests. Thanks :)

nakoo commented 2 years ago

I tried it, unfortunately it still gives the same result: my CA works but not the rest. FYI, I've just set SSL_CERT_FILE and used this custom image:

FROM matrixdotorg/synapse:v1.57.0

RUN apt-get update && \
    apt-get install -y ca-certificates && \
    rm -rf /var/lib/apt/lists/*

Do not hesitate if you want me to do others tests. Thanks :)

I have similar issue with Dendrite (the new matrix server). My current guess is matrix doesn't work very well with docker overlay network. I haven't faced any issue when running bridge mode.