Health Checks
All health check endpoints operate on port 2346 by default. This behavior can
be changed via the --healthcheck-address flag. All endpoints expose a
verbose optional query parameter that can be used to get more detailed
information about the node's health status.
Endpoints return a 200 OK status code if the node is healthy and a
503 Service Unavailable status code if the node is not healthy.
- /livez
- /readyz
- /healthz
/livez
Liveness probes determine whether or not an application running in a container is in a healthy state. The idea behind a liveness probe is that it fails for prolonged period of time, then the application should be restarted. In our case, we require:
- The node is not in an error state (i.e., boot-looping)
- At least 1 peer is connected (without peers, the node is isolated and cannot sync)
If any of these conditions are not met, the node is not healthy. If this happens for a prolonged period of time, the application should be restarted.
curl "http://127.0.0.1:2346/livez?verbose"
Sample lively response:
[+] sync ok
[+] peers connected⏎
Sample not lively response:
[+] sync ok
[!] no peers connected
/readyz
Readiness probes determine whether or not a container is ready to serve requests. The goal is to determine if the application is fully prepared to accept traffic. In our case, we require:
- The node is in sync with the network
- The current epoch of the node is not too far behind the network
- The RPC server is running
- The Ethereum mappings are up to date (if chain indexer in enabled)
If any of these conditions are not met, the node is not ready to serve requests.
curl "127.0.0.1:2346/readyz?verbose"
Sample ready response:
[+] sync complete
[+] epoch up to date
[+] rpc server running
[+] f3 running⏎
Sample not ready response:
[!] sync incomplete
[!] epoch outdated
[+] rpc server running
[+] f3 running⏎
/healthz
This endpoint is a combination of the /livez and /readyz endpoints, except
that the node doesn't have to be fully synced. Deprecated in the Kubernetes
world, but still used in some setups.
curl "http://127.0.0.1:2346/healthz?verbose"
Sample healthy response:
[+] epoch up to date
[+] rpc server running
[+] sync ok
[+] peers connected
[+] f3 running⏎
Sample unhealthy response:
[!] epoch outdated
[+] rpc server running
[+] sync ok
[+] peers connected
[+] f3 running⏎