Hi everyone,
I am seeing an intermittent backup issue and would like to understand what might be causing it.
I have a pool of 5 servers, where one server is the master and this master server stores backups as well . Backups generally work fine across all servers and sites. However, once in a while, only one specific site fails, while all other sites on the same servers back up successfully.
The failure is not consistent. On the next run, the same site may back up without any changes.
This is the error message I get when it fails:
Enhance backup failed: Error { kind: Internal, msg: "Cannot connect to https://IP:50000, check firewall" }
What makes this confusing is:
The firewall configuration is unchanged
Other sites on the same server back up successfully at the same time
The issue happens randomly and only affects one site at a time
Has anyone seen similar behavior in a multi-server setup with a master node? Any ideas on what logs or checks I should focus on would be appreciated.
Thanks in advance.