AWS with GPU Install
This section provides instructions to install DKP in an AWS non-air-gapped GPU environment.
If not already done, refer to Get Started section of the documentation for:
AWS Prerequisites
Before you begin using Konvoy with AWS, you must:
Follow the steps to create permissions and roles on the Minimal Permissions and Role to Create Clusters page.
Create Cluster IAM Policies and Roles.
Export the AWS region where you want to deploy the cluster:
CODEexport AWS_REGION=us-west-2
Export the AWS profile with the credentials you want to use to create the Kubernetes cluster:
CODEexport AWS_PROFILE=<profile>
If using AWS ECR as your local private registry, more information can be found on the Registry Mirror Tools page.
To deploy a cluster with a custom image in a region where CAPI images are not provided, you need to use Konvoy Image Builder to create your own image for the region.
For multi-tenancy, every tenant should be in a different AWS account to ensure they are truly independent of other tenants in order to enforce security.
GPU Prerequisites
Before you begin, you must:
Ensure nodes provide an NVIDIA GPU.
If you are using a public cloud service such as AWS, create an AMI with KIB using the instructions on the KIB for GPU page.
If you are deploying in a pre-provisioned environment, ensure that you have created the appropriate secret for your GPU nodepool and have uploaded the appropriate artifacts to each node. See the GPU only steps section on the Pre-provisioned Prerequisites Air-gapped page for additional information.
Specific instructions must be followed for enabling nvidia-gpu-operator
depending on if you want to deploy the app on a Management cluster or a Attached or a Managed cluster.
For instructions on enabling the NVIDIA platform application on a Management cluster, follow the instructions in the NVIDIA Platform Application Management Cluster section.
For instructions on enabling the NVIDIA platform application on attached or managed clusters, follow the instructions in the NVIDIA Platform Application Attached or Managed Cluster section.
After nvidia-gpu-operator
has been enabled depending on the cluster type, proceed to the Select the correct Toolkit version for your NVIDIA GPU Operator on each of those pages.