Certbot Nginx Plugin Failing Intermittently - Issues Persist After Updating to 2.11.0
Environment
- Certbot version: 2.11.0 (upgraded during troubleshooting)
- Web server: Nginx
- Application: Django
- Installation method: certbot --nginx installer
Background
We've been successfully using Let's Encrypt with our platform's custom domain functionality for about 1.5 years without issues. Our setup involves Nginx and Django, with certificates managed via the Certbot Nginx plugin.
Current Issue
Over the past couple weeks, we've been experiencing intermittent failures with the Certbot Nginx installer. In an attempt to resolve these issues, we upgraded to Certbot 2.11.0, but the problems still persist.
Key observations:
- Some domains fail consistently while others work perfectly
- Let's Debug tool confirms correct DNS configuration for failing domains
- No recent changes to our Nginx configurations
- Manual verification using webroot method succeeds, but we'd prefer to maintain our current approach
Troubleshooting Steps Taken
- Upgraded Certbot to latest version (2.11.0)
- Verified DNS configuration using Let's Debug tool
- Confirmed Nginx configurations are unchanged
- Tested certificate issuance with webroot method (successful)
- Verified process works for some domains but fails for others under identical conditions
Questions
- Is there a way to prevent Certbot from cleaning up its temporary files/configurations after challenges? This would help us debug the Nginx configuration changes it makes during the ACME challenge.
- Has anyone encountered similar intermittent failures with the Nginx plugin lately?
Any guidance on additional debugging approaches or potential solutions would be greatly appreciated.