Let's Encrypt's DNS resolution process

jeromegn · May 31, 2022, 1:25pm

Hey there!

Would it be possible to get a detailed explanation of how Let's Encrypt resolves hostnames via DNS?

We have to pre-verify certificate authorizations before asking Let's Encrypt to verify them. Else we'd run into rate limits very fast (and we did, at first). So essentially we have to "emulate" how Let's Encrypt resolves hostnames.

Right now we do the following: we use both the SOA and NS of a hostname to resolve a hostname. This works in most cases, but sometimes it doesn't and yet Let's Encrypt is still able to verify an authorization if I manually push it through. This led me to believe LE is doing something differently.

For example, we had one failing our pre-verification check this morning because the SOA timed out.

Do you use the SOA to resolve the hostname? If so, do you have a timeout setup? If this fails, do you rely on the domain's NS only?

This would help us relieve some pain for our users.

jvanasco · May 31, 2022, 2:28pm

Have you seen https://unboundtest.com ? It's an unofficial system put up by ISRG staff. If you search "unbound" in this forum, you'll see a few threads that should answer your question.

petercooperjr · May 31, 2022, 3:15pm

I don't believe they look at the SOA at all to find the authoritative servers, it just uses the NS records as delegated from the DNS root.

If you want the really detailed specifics, I think what you'd need to do is to look at Unbound and at the Boulder source code.

github.com

letsencrypt/boulder/blob/main/bdns/dns.go

package bdns

import (
	"context"
	"encoding/base64"
	"errors"
	"fmt"
	"net"
	"strconv"
	"strings"
	"sync"
	"time"

	"github.com/jmhodges/clock"
	"github.com/miekg/dns"
	"github.com/prometheus/client_golang/prometheus"

	blog "github.com/letsencrypt/boulder/log"
	"github.com/letsencrypt/boulder/metrics"
)

This file has been truncated. show original

webprofusion · June 2, 2022, 3:29am

Worth stating the obvious that all of your NS for the domain have to give a valid response, not just one. So if you are writing changes to DNS before validation you need to ensure all NS have the same response (and they can all reply to a CAA record query - so no NXDOMAIN and no SERVFAILs will be tolerated during validation).

Yesterday one of my users who had a domain with Google Cloud DNS was returning a SERVFAIL response on the CAA record check, which was presumably a transient failure behind the scenes at Google, so it seems everyone is capable of getting this stuff wrong.

jeromegn · June 7, 2022, 12:12pm

I switched our logic to only check NS (and not SOA anymore).

It's working fine for us, however there are still issues from time to time.

This morning, our pre-verification process failed because we couldn't resolve the NS servers to IPs. We recursively resolve them from the root servers.

ns1.theserver.com.au
ns2.theserver.com.au
ns3.theserver.com.au

None of our attempts returned any record for these hostnames from our servers. Same issue from my local computer (using different DNS servers).

I still tried to push through the authoritzation via Let's Encrypt. It did work. This is puzzling me.

I can resolve the hostname (appeal.the9livesproject.org) to an IP, no problem. Is that all we should be testing? Perhaps in addition to CAA records...

rg305 · June 7, 2022, 12:55pm

What shows?:

nslookup ns1.theserver.com.au r.au
nslookup ns2.theserver.com.au s.au
nslookup ns3.theserver.com.au t.au

MikeMcQ · June 7, 2022, 1:01pm

There will always be transient problems when comms are involved. Taking a cue from the Let's Debug test site, you could check

Resolve hostname IP to A and/or AAAA
Connect to http://(domain)/.well-known/acme-challenge/YourSpecialToken
You should expect http 404. You could warn about any other http codes but still try cert request. If request times out then maybe not even try cert request. I assume you are doing http challenges. Note LE Servers will connect using IPv6 if AAAA record in hostname DNS else IPv4 if just A. This check is most likely to have transient errors.
If find CAA record check valid value

Let's Debug does more than that but its purpose is different. You might consider checking Let's Encrypt Server operational status like it does too.

Personally, I would only check the most likely items causing problems. You are more familiar with your clients than I am so know those best.

system · July 7, 2022, 1:01pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How does DNS Validation work behind the scenes Help	3	1092	April 25, 2018
Is Let's Encrypt right for me? Help	6	1047	May 17, 2018
Let's Encrypt can't seem to resolve my domain Help	4	1862	February 18, 2018
What's DNS uses Let's Encrypt to generate certs? Help	6	921	December 22, 2021
Help getting let's encrypt to validate domain registered with namecheap on Synology Help	17	2854	April 16, 2020

Let's Encrypt's DNS resolution process

Related topics