Skip to main content
Skip table of contents

AWS GPU: Install Kommander

You have installed the Konvoy component and created a cluster. Now it is time to Install Kommander which will allow you to access the UI and attach new or existing clusters to monitor.

Prerequisites

  • Ensure you have reviewed all Prerequisites for Install.

  • Ensure you have a default StorageClass.

  • Note the name of the cluster where you want to install Kommander. If you do not know the cluster name, use kubectl get clusters -A to display and find it.

Create and customize your Kommander Installer Configuration File

  1. Set the environment variable for your cluster:

    CODE
    export CLUSTER_NAME=<your-management-cluster-name>
  2. Copy the kubeconfig file of your Management cluster to your local directory:

    CODE
    dkp get kubeconfig -c ${CLUSTER_NAME} >> ${CLUSTER_NAME}.conf
  3. Create a configuration file for the deployment:

    CODE
    dkp install kommander --init > kommander.yaml 
  4. If required: Customize your kommander.yaml.
    (info) See Kommander Customizations for customization options. Some of them include:
    Custom Domains and Certificates, HTTP proxy, External Load Balancer, GPU utilization, Rook Ceph customization (Pre-provisioned envs), etc.

  5. If required: If your cluster uses a custom AWS VPC and requires an internal load-balancer, set the traefik annotation to create an internal-facing ELB:

    CODE
    apps:
      traefik:
        enabled: true
        values: |
          service:
            annotations:
              service.beta.kubernetes.io/aws-load-balancer-internal: "true"

Enable GPU Resources

  1. In the same kommander.yaml file, enable Nvidia platform services.

    CODE
    apps:
      nvidia-gpu-operator:
       enabled: true
  2. Append the correct Toolkit version based on your OS:

    The NVIDIA Container Toolkit allows users to run GPU accelerated containers. The toolkit includes a container runtime library and utilities to automatically configure containers to leverage NVIDIA GPU and must be configured correctly according to your base operating system.

    Centos 7.9/RHEL 7.9:
    If you’re using Centos 7.9 or RHEL 7.9 as the base operating system for your GPU enabled nodes, set the toolkit.version parameter in your Kommander Installer Configuration file or <kommander.yaml> to the following:

    CODE
    kind: Installation
    apps:
      nvidia-gpu-operator:
       enabled: true
       values: |
         toolkit:
           version: v1.13.1-centos7

    RHEL 8.4/8.6 and SLES 15 SP3
    If you’re using RHEL 8.4/8.6 or SLES 15 SP3 as the base operating system for your GPU enabled nodes, set the toolkit.version parameter in your Kommander Installer Configuration file or <kommander.yaml> to the following:

    CODE
    kind: Installation
    apps:
      nvidia-gpu-operator:
       enabled: true
       values: |
         toolkit:
           version: v1.13.1-ubi8

    Ubuntu 18.04 and 20.04
    If you’re using Ubuntu 18.04 or 20.04 as the base operating system for your GPU enabled nodes, set the toolkit.version parameter in your Kommander Installer Configuration file or <kommander.yaml> to the following:

    CODE
    kind: Installation
    apps:
      nvidia-gpu-operator:
       enabled: true
       values: |
         toolkit:
           version: v1.13.1-ubuntu20.04

Enable DKP Catalog Applications and Install Kommander

If you want to enable DKP Catalog applications after installing DKP, see Enable DKP Catalog Applications after Installing DKP.

  1. In the same kommander.yaml of the previous section, add these values for dkp-catalog-applications:

    CODE
    apiVersion: config.kommander.mesosphere.io/v1alpha1
    kind: Installation
    catalog:
      repositories:
        - name: dkp-catalog-applications
          labels:
            kommander.d2iq.io/project-default-catalog-repository: "true"
            kommander.d2iq.io/workspace-default-catalog-repository: "true"
            kommander.d2iq.io/gitapps-gitrepository-type: "dkp"
          gitRepositorySpec:
            url: https://github.com/mesosphere/dkp-catalog-applications
            ref:
              tag: v2.7.0

    (warning) If you only want to enable catalog applications to an existing configuration, add these values to an existing installer configuration file to maintain your Management cluster’s settings.

  2. Use the customized kommander.yaml to install DKP:

    CODE
    dkp install kommander --installer-config kommander.yaml --kubeconfig=${CLUSTER_NAME}.conf

Tips and recommendations

  • The --kubeconfig=${CLUSTER_NAME}.conf flag ensures that you install Kommander on the correct cluster. For alternatives, see Provide Context for Commands with a kubeconfig File.

  • Applications can take longer to deploy, and time out the installation. Add the --wait-timeout <time to wait> flag and specify a period of time (for example, 1h) to allocate more time to the deployment of applications.

  • If the Kommander installation fails, or you wish to reconfigure applications, rerun the install command to retry.

Next Step:

AWS GPU: Verify Install and Log in the UI

JavaScript errors detected

Please note, these errors can depend on your browser setup.

If this problem persists, please contact our support.