Skip to main content
Version: Current

Operations - Consul Cluster

The Consul cluster mode uses Hashicorp's Consul for service discovery.

Combining this with the use of an MQTT broker gives you a dynamically scalable Genesis app with automatic failover.

Prerequisites

Services

To operate the Genesis low-code platform within an HA setup using Hashicorp's Consul for Clustering, the following services must be available:

  • A Database - A database needs to be hosted independently from the platform
  • A Consul Cluster - A Consul cluster with an agent on each of the hosts running a Genesis app
  • An MQTT Broker - Any message queue broker that supports MQTT, such as Mosquitto or RabbitMQ
important

The Genesis low-code platform can run with Consul without MQTT using ZeroMQ.

However, this is not recommended as MQTT is required for dynamic scaling.

Automatic failover

When Consul is used, processes that are set to primaryOnly failover automatically to the secondary node when the primary node becomes unresponsive. (Failover must be performed manually if you are using the standard Genesis Clustering service.)

Networking

When using Consul mode, all Genesis services within an app need to be able to access all the other Genesis services in the cluster on their defined ports.

The default Genesis services and their ports are:

ServicePort
GENESIS_AUTH_CONSOLIDATOR8005
GENESIS_AUTH_MANAGER8001
GENESIS_AUTH_PERMS8003
GENESIS_CLUSTER9000
GENESIS_EVALUATOR9015
GENESIS_ROUTER9017

See Server configuration - service definitions for information on how to set the ports for user-defined services, and how to override the ports dynamically.

The Consul Agents must also be able to access all the services on their defined ports.

Configuring the app

Basic configuration

If a Consul agent is running locally to the app on the default port of 8500, then the only config required is:

genesis-system-definition.kts
systemDefinition {
global {
...
item(name = "ClusterMode", value = "CONSUL")
...
}
}

Extended configuration

ItemDescriptionDefault
ConsulHostAddressIP or hostname of the Consul Agentlocalhost
ConsulHostPortHTTP Port for the Consul Agent8500
ConsulConnectionTimeoutSecondsSet the timeout for connecting to the Consul Agent in seconds10
ConsulReadTimeoutSecondsSet the timeout for reading from the Consul Agent in seconds10
ConsulWriteTimeoutSecondsSet the timeout for writing to the Consul Agent in seconds10
ConsulWatchSecondsHow long to tell the Consul server to wait for new key/value cache values8
ConsulTtlSecondsTimeout for the TTL Consul checks15
ttlPingSecondsHow often to run the TTL checks and update Consul with the stateConsulTtlSeconds / 2
deregisterCriticalServiceAfterMinsTime to wait after a TTL check goes critical before it deregisters the service in minutes30
clusterServiceAffinityGroupGroup apps for preferred routing. Useful for multi-region set-ups""
ConsulServiceNamePatternAllows you to template your Consul service names with {{PROCESS_NAME}} eg. my_app_{{PROCESS_NAME}}null
ConsulNamespaceFormatSets Consul namespace (only used in Consul Enterprise)null

Troubleshooting

Request fails with 5xx when trying to reach a process that no longer exists

Scenario: This might happen when a resource is moved from one process to another. For example, a requestReply originally served from REQ_REP_SERVICE_A is now moved to REQ_REP_SERVICE_B, but consul still holds a reference that is being served by the former.

Solution: Check the consul console (or cli) to see if there is still a resourceMap (under KeyValues) for the process that no longer exists. If there is, delete it. Then restart the Genesis services.