We have a CI pipeline that runs once a day to check if it needs to renew any of our certificates. In the last couple of days, it did want to renew at least one, a cert with several Subject Alternative Names which were wildcards. However the challenges kept failing. We tried to diagnose if there was something was wrong on our end, but didn't really find anything. For example I set up the same TXT record by hand, and didn't see any issues in looking it up from various public DNS servers.
The job ran again this morning, and now it went fine; nothing changed in our setup since the last failure.
So I was wondering if perhaps there was some known failure in looking up DNS records from the Let's Encrypt side, that affected us in the last few days.
My domain is: [*.cloud.syseleven.net, *.infra.sys11cloud.net, *.infrabk.sys11cloud.net, *.infrabl.sys11cloud.net, *.infrafe.sys11cloud.net, cloud.syseleven.de]
I ran this command:
["lego", "--accept-tos", "--dns", "designate", "--path", "/tmp/lego", "--dns.resolvers", "8.8.8.8", "--dns.resolvers", "1.1.1.1", "--server=https://acme-v02.api.letsencrypt.org/directory", "--email", "noreply@syseleven.de", "--key-type", "rsa4096", "-d", "*.cloud.syseleven.net", "-d", "*.infra.sys11cloud.net", "-d", "*.infrabk.sys11cloud.net", "-d", "*.infrabl.sys11cloud.net", "-d", "*.infrafe.sys11cloud.net", "-d", "cloud.syseleven.de", "renew", "--preferred-chain", "ISRG Root X1"]
It produced this output:
(output from ansible; line breaks added for better readability)
fatal: [localhost]: FAILED! => {"changed": true, "cmd": ["lego", "--accept-tos", "--dns", "designate", "--path", "/tmp/lego", "--dns.resolvers", "8.8.8.8", "--dns.resolvers", "1.1.1.1", "--server=https://acme-v02.api.letsencrypt.org/directory", "--email", "noreply@syseleven.de", "--key-type", "rsa4096", "-d", "*.cloud.syseleven.net", "-d", "*.infra.sys11cloud.net", "-d", "*.infrabk.sys11cloud.net", "-d", "*.infrabl.sys11cloud.net", "-d", "*.infrafe.sys11cloud.net", "-d", "cloud.syseleven.de", "renew", "--preferred-chain", "ISRG Root X1"], "delta": "0:08:24.702166", "end": "2022-11-30 16:16:11.198601", "msg": "non-zero return code", "rc": 1, "start": "2022-11-30 16:07:46.496435", "stderr": "2022/11/30 16:07:47 [INFO] [*.cloud.syseleven.net] acme: Trying renewal with 731 hours remaining
2022/11/30 16:07:47 [INFO] renewal: random delay of 7m36.394249225s
2022/11/30 16:15:24 [INFO] [*.cloud.syseleven.net, *.infra.sys11cloud.net, *.infrabk.sys11cloud.net, *.infrabl.sys11cloud.net, *.infrafe.sys11cloud.net, cloud.syseleven.de] acme: Obtaining bundled SAN certificate
2022/11/30 16:15:25 [INFO] [*.cloud.syseleven.net] AuthURL: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182124365647
2022/11/30 16:15:25 [INFO] [*.infra.sys11cloud.net] AuthURL: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182218481187
2022/11/30 16:15:25 [INFO] [*.infrabk.sys11cloud.net] AuthURL: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182182570677
2022/11/30 16:15:25 [INFO] [*.infrabl.sys11cloud.net] AuthURL: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182124365677
2022/11/30 16:15:25 [INFO] [*.infrafe.sys11cloud.net] AuthURL: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182182570687
2022/11/30 16:15:25 [INFO] [cloud.syseleven.de] AuthURL: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182316532597
2022/11/30 16:15:25 [INFO] [*.cloud.syseleven.net] acme: authorization already valid; skipping challenge
2022/11/30 16:15:25 [INFO] [*.infrabl.sys11cloud.net] acme: authorization already valid; skipping challenge
2022/11/30 16:15:25 [INFO] [*.infra.sys11cloud.net] acme: authorization already valid; skipping challenge
2022/11/30 16:15:25 [INFO] [*.infrabk.sys11cloud.net] acme: authorization already valid; skipping challenge
2022/11/30 16:15:25 [INFO] [*.infrafe.sys11cloud.net] acme: authorization already valid; skipping challenge
2022/11/30 16:15:25 [INFO] [cloud.syseleven.de] acme: Could not find solver for: tls-alpn-01
2022/11/30 16:15:25 [INFO] [cloud.syseleven.de] acme: Could not find solver for: http-01
2022/11/30 16:15:25 [INFO] [cloud.syseleven.de] acme: use dns-01 solver
2022/11/30 16:15:25 [INFO] [cloud.syseleven.de] acme: Preparing to solve DNS-01
2022/11/30 16:15:25 [INFO] [cloud.syseleven.de] acme: Trying to solve DNS-01
2022/11/30 16:15:26 [INFO] [cloud.syseleven.de] acme: Checking DNS record propagation using [8.8.8.8:53 1.1.1.1:53]
2022/11/30 16:15:36 [INFO] Wait for propagation [timeout: 10m0s, interval: 10s]
2022/11/30 16:15:36 [INFO] [cloud.syseleven.de] acme: Waiting for DNS record propagation.
2022/11/30 16:15:46 [INFO] [cloud.syseleven.de] acme: Waiting for DNS record propagation.
2022/11/30 16:15:56 [INFO] [cloud.syseleven.de] acme: Waiting for DNS record propagation.
2022/11/30 16:16:09 [INFO] [cloud.syseleven.de] acme: Cleaning DNS-01 challenge
2022/11/30 16:16:10 [INFO] Skipping deactivating of valid auth: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182124365647
2022/11/30 16:16:10 [INFO] Skipping deactivating of valid auth: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182218481187
2022/11/30 16:16:10 [INFO] Skipping deactivating of valid auth: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182182570677
2022/11/30 16:16:10 [INFO] Skipping deactivating of valid auth: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182124365677
2022/11/30 16:16:10 [INFO] Skipping deactivating of valid auth: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182182570687
2022/11/30 16:16:11 [INFO] Deactivating auth: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182316532597
2022/11/30 16:16:11 error: one or more domains had a problem:
[cloud.syseleven.de] acme: error: 400 :: urn:ietf:params:acme:error:dns :: DNS problem: SERVFAIL looking up TXT for _acme-challenge.cloud.syseleven.de - the domain's nameservers may be malfunctioning", "stderr_lines":
["2022/11/30 16:07:47
(and the same output basically again)
My web server is (include version): N/A, since we do DNS challenges
The operating system my web server runs on is (include version):
My hosting provider, if applicable, is: we are the hosting provider
I can login to a root shell on my machine (yes or no, or I don't know): yes, in theory we can ssh into the CI job but it's a pain
I'm using a control panel to manage my site (no, or provide the name and version of the control panel): no
The version of my client is (e.g. output of certbot --version
or certbot-auto --version
if you're using Certbot): We don't use certbot but LEGO and we used v4.9.1.
Here is output from today, which worked:
TASK [lego : Renew certificate for global-wildcard] ****************************
task path: /builds/openstack/cert-automation/ansible/roles/lego/tasks/renew-certificates.yml:76
changed: [localhost] => {"changed": true, "cmd": ["lego", "--accept-tos", "--dns", "designate", "--path", "/tmp/lego", "--dns.resolvers", "8.8.8.8", "--dns.resolvers", "1.1.1.1", "--server=https://acme-v02.api.letsencrypt.org/directory", "--email", "noreply@syseleven.de", "--key-type", "rsa4096", "-d", "*.cloud.syseleven.net", "-d", "*.infra.sys11cloud.net", "-d", "*.infrabk.sys11cloud.net", "-d", "*.infrabl.sys11cloud.net", "-d", "*.infrafe.sys11cloud.net", "-d", "cloud.syseleven.de", "renew", "--preferred-chain", "ISRG Root X1"], "delta": "0:02:37.323749", "end": "2022-12-01 04:09:11.576960", "rc": 0, "start": "2022-12-01 04:06:34.253211", "stderr": "2022/12/01 04:06:35 [INFO] [*.cloud.syseleven.net] acme: Trying renewal with 719 hours remaining
2022/12/01 04:06:35 [INFO] renewal: random delay of 1m45.643985623s
2022/12/01 04:08:21 [INFO] [*.cloud.syseleven.net, *.infra.sys11cloud.net, *.infrabk.sys11cloud.net, *.infrabl.sys11cloud.net, *.infrafe.sys11cloud.net, cloud.syseleven.de] acme: Obtaining bundled SAN certificate
2022/12/01 04:08:22 [INFO] [*.cloud.syseleven.net] AuthURL: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182124365647
2022/12/01 04:08:22 [INFO] [*.infra.sys11cloud.net] AuthURL: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182218481187
2022/12/01 04:08:22 [INFO] [*.infrabk.sys11cloud.net] AuthURL: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182182570677
2022/12/01 04:08:22 [INFO] [*.infrabl.sys11cloud.net] AuthURL: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182124365677
2022/12/01 04:08:22 [INFO] [*.infrafe.sys11cloud.net] AuthURL: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182182570687
2022/12/01 04:08:22 [INFO] [cloud.syseleven.de] AuthURL: https://acme-v02.api.letsencrypt.org/acme/authz-v3/182511576557
2022/12/01 04:08:22 [INFO] [*.cloud.syseleven.net] acme: authorization already valid; skipping challenge
2022/12/01 04:08:22 [INFO] [*.infrabl.sys11cloud.net] acme: authorization already valid; skipping challenge
2022/12/01 04:08:22 [INFO] [*.infra.sys11cloud.net] acme: authorization already valid; skipping challenge
2022/12/01 04:08:22 [INFO] [*.infrabk.sys11cloud.net] acme: authorization already valid; skipping challenge
2022/12/01 04:08:22 [INFO] [*.infrafe.sys11cloud.net] acme: authorization already valid; skipping challenge
2022/12/01 04:08:22 [INFO] [cloud.syseleven.de] acme: Could not find solver for: tls-alpn-01
2022/12/01 04:08:22 [INFO] [cloud.syseleven.de] acme: Could not find solver for: http-01
2022/12/01 04:08:22 [INFO] [cloud.syseleven.de] acme: use dns-01 solver
2022/12/01 04:08:22 [INFO] [cloud.syseleven.de] acme: Preparing to solve DNS-01
2022/12/01 04:08:23 [INFO] [cloud.syseleven.de] acme: Trying to solve DNS-01
2022/12/01 04:08:23 [INFO] [cloud.syseleven.de] acme: Checking DNS record propagation using [8.8.8.8:53 1.1.1.1:53]
2022/12/01 04:08:33 [INFO] Wait for propagation [timeout: 10m0s, interval: 10s]
2022/12/01 04:08:33 [INFO] [cloud.syseleven.de] acme: Waiting for DNS record propagation.
2022/12/01 04:08:43 [INFO] [cloud.syseleven.de] acme: Waiting for DNS record propagation.
2022/12/01 04:08:53 [INFO] [cloud.syseleven.de] acme: Waiting for DNS record propagation.
2022/12/01 04:09:09 [INFO] [cloud.syseleven.de] The server validated our request
2022/12/01 04:09:09 [INFO] [cloud.syseleven.de] acme: Cleaning DNS-01 challenge
2022/12/01 04:09:09 [INFO] [*.cloud.syseleven.net, *.infra.sys11cloud.net, *.infrabk.sys11cloud.net, *.infrabl.sys11cloud.net, *.infrafe.sys11cloud.net, cloud.syseleven.de] acme: Validations succeeded; requesting certificates
2022/12/01 04:09:11
[INFO] [*.cloud.syseleven.net] Server responded with a certificate for the preferred certificate chains \"ISRG Root X1\".", "stderr_lines": ["2022/12/01 04:06:35