Recovery of Kyvos cluster from Kyvos Manager

Recovery of Kyvos cluster from Kyvos Manager

✅ Enterprise: AWS, Azure, GCP, and On-Premises


Kyvos Manager provides a dedicated interface for disaster recovery of all cloud clusters. The dedicated wizard provides a guided flow for performing disaster recovery across all cloud clusters. Disaster recovery applies to the following cases:

  • Cross Region Disaster Recovery: Recover the cluster in a region other than the primary cluster's deployment region, which was impacted by a disaster.

  • Same Region Disaster Recovery: Recover cluster in a same region in which primary cluster was deployed. The entire primary region is not impacted due to disaster rather few such resources got impacted due to any reason. Restoring of those resources is expected via performing disaster recovery within the same region.

Step Name

Conditional

Implicit

Cross Region Recovery (CRR)

Same Region Recovery (SRR)

Single Node Standard (SNS)

 

Recovery Resources Creation

 

 

Automated

Manual

Automated

Manual

 

 

Configure Kyvos Manager Server Details

No

Yes

Yes

Yes

Yes

Yes

Yes

 

Uninstall Zookeeper

Yes

No

Yes (Conditional)

Yes (Conditional)

Yes (Conditional)

Yes (Conditional)

No

Applicable only if Zookeeper is deployed on multiple node

Delete Offline Nodes

No

No

Yes

Yes

Yes

Yes

Yes

 

Change Node Authentication

No

No

Yes

Yes

Yes

Yes

Yes

 

Configure Load Balancer

Yes

No

Yes. If not automated & found in stack 

Yes

No

No

CRR: Yes. If not automated & found in stack 

 

Add Nodes

No

No

Yes

Yes

Yes

Yes

Yes

Applicable only in Single node addition allowed in case of SNS cluster

Switch To Replica

Yes

No

Yes

Yes

No

No

CRR: Yes, SRR: No

Applicable only in Cross Region DR

Switch Repository

Yes

No

Yes

Yes

Yes (Conditional)

Yes (Conditional)

Yes

  • Applicable only for installed repository that has the same region recovery.

  • Always applicable in cross region recovery

Configure Compute Cluster

Yes

No

Yes

Yes

No

No

CRR: Yes, SRR: No

Applicable only in cross-region DR if the Compute type is set as Kyvos Native (i.e. No-Spark deployments)

Configure Compute Master

Yes

No

Yes (if applicable)

Yes

No

No

 

 Applicable only in Azure.

Configure Cloud Functions

Yes

No

No if found in stack (then applicable to manual creation)

Yes

No

No

No

Applicable only in cross-region DR

Add Kyvos Manager Instance

Yes

No

Yes (Conditional)

Yes (Conditional)

No

No

No

Applicable only in cross-region DR when Kyvos Manager HA was enabled

Install Zookeeper 

No

No

Yes

Yes

Yes

Yes

Yes

 

Points to know

  • From Kyvos 2026.3 onwards, for AWS, you can enable automated disaster recovery by configuring recovery mode to Auto in DR CFT inputs.
    For more details, see the section, Disaster Recovery on AWS.

  • As a prerequisite, keep the following things handy during disaster recovery, depending on what is affected in your cluster.

    • New certificates are applicable if existing settings (domain/subdomain) are changed after recovery.

    • Get the new license by sending the license request file in case of production.

Manual Mode: Disaster recovery through the guided flow on Kyvos Manager

Important

  • For AWS, disaster recovery can be performed in either Manual or Auto mode, depending on the recovery mode configured in the DR CFT inputs. However, for Azure and GCP, disaster recovery can only be performed through guided flow on Kyvos Manager.

  • The disaster recovery options for each cloud may vary depending on the primary deployment.

  1. Log on to the Kyvos Manager portal.

  2. After logging on to the restored Kyvos Manager, you are redirected to the Disaster Recovery page, which shows the current node status and cluster restoration steps.

  3. The step Configure Kyvos Manager Server Details is started automatically.

  4. Click the Uninstall button corresponding to Step: Uninstall Zookeeper. The option to Uninstall Zookeeper is displayed only when Zookeeper is deployed on multiple nodes.

    1. On the displayed confirmation dialog box, provide your Kyvos Manager password, and click the Uninstall button. A new browser tab is opened, showing uninstall Zookeeper operation details and status. You may switch back to the Disaster Recovery browser tab once the operation is completed.

  5. Click the Delete button corresponding to Step: Delete Offline Nodes. From the Delete Offline Nodes dialog box, select the nodes you want to delete and provide your Kyvos Manager Password. Note that you will see only the Offline nodes in this list.

    1. Click the Delete button.

      • Once deleted, nodes cannot be retrieved.

      • You may switch back to the Disaster Recovery browser tab.

      • Once the operation is completed, you will see the status shown on the Operations page.

  6. Click the Change button corresponding to Step: Change Node Authentication. On the Change Node Authentication dialog,

    1. Add the node IP to authenticate SSH connections for all Kyvos cluster nodes.

    2. Select the following options to Authenticate the node:

      • Saved credentials: Use this option to use the saved credentials.

      • New credentials: Use this option to use the new credentials. Upload a new private key or enter password for node authentication.

    3. Click Validate.

    4. Enter your Kyvos Manager password.

  7. Click the Configure button corresponding to Step: Configure Load Balancer. The Load Balancer Configuration dialog is displayed.

Note

The option is displayed only when the wizard-based (manual) backup and recovery is selected.

  1. Click the Add button corresponding to Step: Add Nodes. On the Add Kyvos Core Services Nodes dialog box, click the Next button.
    Ensure that you can add multiple new nodes as per your license. Once done, provide your Kyvos Manager Password, and click the Add button. A new browser tab is opened, showing add node operation details and status. You may switch back to the Disaster Recovery browser tab.
    Once the operation is completed, you will see the status shown in the following figure. At this point, you will be able to perform the next step for switching to replica.

  2. Click the Switch button corresponding to Step: Switch to Replica. You will be prompted to switch to replicated resources, such as bucket name, secret and Kyvos Repository. Click Switch. Resources created for disaster recovery are prepopulated. You need to review the details on the Manage Kyvos Repository page and click ok and then save the configurations.

  3. Click the Switch button corresponding to Step: Switch Repository. You will be redirected to the Manage Kyvos Repository page. Refer to the Manage Kyvos Repository section to learn more.

  4. Click the Configure button corresponding to Step: Configure Compute Cluster. You are redirected to the Compute Cluster page. On the page, information of the resources which you have created in secondary region are auto populated. You can also modify the details as needed.

  5. Click the Configure button corresponding to Step: Configure Compute Master. The Compute master is only supported with Kubernetes in Azure. In Kyvos 2026.3, Kubernetes is not supported with automated DR.

  6. Click the Configure button corresponding to Step: Configure Cloud Functions. Fill in the required details on the Cloud Functions dialog box and click Save.

Note

  • The option is displayed only when the wizard-based (manual) backup and recovery is selected.

  • The Cloud Functions configuration option is not displayed for Single Node Standard Disaster Recovery deployments.

  1. Click the Add button corresponding to Step: Kyvos Manager Instances. The Add Kyvos Manager Service dialog is displayed.

  2. Click the Install button corresponding to Step: Install Zookeeper. Enter your Kyvos Manager password.

  3. Click the Home icon to go to the Home page of Kyvos Manager. On this page, verify that all nodes are running after disaster recovery.

Auto Mode: Disaster recovery via Kyvos Manager for AWS

For AWS, disaster recovery can be performed in either Manual or Auto mode, depending on the recovery mode configured in the DR CFT inputs. In Auto mode, no user intervention is required. The system automatically executes the necessary recovery steps. If a failure occurs during automated execution, the system switches to Manual mode and automatically returns to Auto mode once the issue is resolved.

  1. Log on to the Kyvos Manager portal.

  2. When you log on to the restored Kyvos Manager, you will be automatically redirected to the Disaster Recovery page. The system is automatically executing the required recovery steps.

  3. Click the Home icon to go to the Home page of Kyvos Manager. On this page, verify that all nodes are running after disaster recovery.

    image-20260324-171343.png

 

Copyright Kyvos, Inc. 2026. All rights reserved.