Intermittent failures during finalize: Unable to meet CA SCT embedding requirements

mmelo-yottaa · April 17, 2023, 12:07pm

Hi!

Our nightly regressions test our ACME client, one test creates then revokes a certificate using LE Staging. The tests have run without errors for many weeks, our client hasn't changed during that time.

Starting 16 April we are seeing intermittent failures during certificate create, while waiting for finalize, specifically this:

data:{
  "status": "invalid",
  "expires": "2023-04-23T19:38:53Z",
  "identifiers": [
    {
      "type": "dns",
      "value": "www.appoptimization.com"
    }
  ],
  "authorizations": [
    "https://acme-staging-v02.api.letsencrypt.org/acme/authz-v3/5952795884"
  ],
  "finalize": "https://acme-staging-v02.api.letsencrypt.org/acme/finalize/18175989/8297400174",
  "error": {
    "type": "urn:ietf:params:acme:error:serverInternal",
    "detail": "Error finalizing order :: Unable to meet CA SCT embedding requirements",
    "status": 500
  }
}

Most of the certificate ops executed by our nightly regressions using LE Stage pass, however there have been 7 of the finalizing-order failures since 2023-04-16 19:40:27 UTC. The client polls for finalize status to change from status "processing" to a completed status - in failure cases it generally has to wait about 90 seconds before the 500 error is returned.

Any ideas why these intermittent failures have shown in the past two days?

Thanks!

petercooperjr · April 17, 2023, 12:39pm

@lestaff: Throwing this one at you.

mcpherrinm · April 17, 2023, 2:45pm

I've identified the issue with CT submission in staging and will resolve this today.

mmelo-yottaa · April 17, 2023, 2:48pm

thank you !!

mcpherrinm · April 17, 2023, 4:27pm

This is fixed now.

The configuration for the "Sapling 2023h2" CT log was incorrect, which resulted in all our SCTs going to other test logs. Two of those other test logs became overloaded, causing slow staging finalizes and intermittent failures.

I've correctly configured Sapling 2023h2, and everything should be better now. This would have started at 2023-07-15T00:00Z when the Sapling 2023h1 log shard ended.

orangepizza · April 17, 2023, 4:35pm

kinda surprising even staging certs are capable of overloading other logs.
how much LE stageing signs per day?

mmelo-yottaa · April 17, 2023, 5:21pm

i manually ran the regression test that failed per above, it still fails but much quicker now. once domain validation succeeds and finalize order starts, the client receives

data:{
  "status": "processing",
  "expires": "2023-04-24T17:11:44Z",
  "identifiers": [
    {
      "type": "dns",
      "value": "www.appoptimization.com"
    }
  ],
  "authorizations": [
    "https://acme-staging-v02.api.letsencrypt.org/acme/authz-v3/5952795884"
  ],
  "finalize": "https://acme-staging-v02.api.letsencrypt.org/acme/finalize/18175989/8311141904"
}

then a retry 5 seconds later receives:

data:{
  "status": "invalid",
  "expires": "2023-04-24T17:11:44Z",
  "identifiers": [
    {
      "type": "dns",
      "value": "www.appoptimization.com"
    }
  ],
  "authorizations": [
    "https://acme-staging-v02.api.letsencrypt.org/acme/authz-v3/5952795884"
  ],
  "finalize": "https://acme-staging-v02.api.letsencrypt.org/acme/finalize/18175989/8311141904",
  "error": {
    "type": "urn:ietf:params:acme:error:serverInternal",
    "detail": "Error finalizing order",
    "status": 500
  }
}

any ideas? thanks!

mcpherrinm · April 17, 2023, 5:30pm

Yeah, it's still (basically) the same problem, just we're failing certificate linting after successfully submitting now. I missed a spot in my config change and posted here a little too early. Will be a few more minutes.

mmelo-yottaa · April 17, 2023, 5:30pm

np, and no rush, appreciate the attention you are giving this!

Osiris · April 17, 2023, 5:38pm

Sapling is also a staging CT log, so maybe LE has allocated less resources to it? Just speculating here though

mcpherrinm · April 17, 2023, 5:54pm

I had to restart a service to pick up the new configs, I think it's good now. I will verify.

Staging submits to three "Log Operators":

Sapling, our public CT log. This is the one that we ran off the end of the valid configured shards and I fixed.
Google's Testflume log. This one has been fine.
Some internal-only test logs, based on boulder's ct-test-srv. These are the ones that started getting very slow. I'm not sure why yet.

mmelo-yottaa · April 17, 2023, 6:55pm

three runs of the previously-failed regressions, three PASS'es now, it looks much better to me... thanks again!

zrohop · April 21, 2023, 7:53am

Could you also post the public key of the Sapling 2023h2 log? I couldn't find it on ct-logs.

mcpherrinm · April 21, 2023, 7:20pm

This is in the format of google's CT log list json files

        {
          "description": "Let's Encrypt 'Sapling2023h2' log",
          "log_id": "7audHd2Dc5Wf9SqI5Gu0vMPEzE12imDM/042LX+41mg=",
          "key": "MFkwEwYHKoZIzj0CAQYIKoZIzj0DAQcDQgAEbdCykRsTPRgfjKVQvINRLJk3gy+2qNKOU48bo/sWO0ko75S92C+PBDxsqMEd0YpCYYLogCt2LAK/U4H7UwHsjA==",
          "url": "https://sapling.ct.letsencrypt.org/2023h2/",
          "mmd": 86400,
          "temporal_interval": {
            "start_inclusive": "2023-06-15T00:00:00Z",
            "end_exclusive": "2024-01-15T00:00:00Z"
          },
          "state": {
            "usable": {
              "timestamp": "2023-05-01T00:00:00Z"
            }
          }
        },

I think somebody has a ticket to update the website, it just hasn't happened yet.

system · May 21, 2023, 7:21pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
What do you do when an SCT submission fails Issuance Tech	3	2166	June 30, 2018
Unable to meet CA SCT embedding requirements Help	7	858	September 11, 2022
Unable to renew: "Unable to meet CA SCT embedding requirements" Help	5	2657	August 29, 2018
Error finalizing order Server	5	1381	November 2, 2018
Error finalizing order :: Unable to meet CA SCT embedding requirement Help	5	183	June 28, 2025

Intermittent failures during finalize: Unable to meet CA SCT embedding requirements

Related topics