- Overview
- Requirements
- Installation
- Post-installation
- Cluster administration
- Managing products
- Managing the cluster in ArgoCD
- Setting up the external NFS server
- Automated: Enabling the Backup on the Cluster
- Automated: Disabling the Backup on the Cluster
- Automated, Online: Restoring the Cluster
- Automated, Offline: Restoring the Cluster
- Manual: Enabling the Backup on the Cluster
- Manual: Disabling the Backup on the Cluster
- Manual, Online: Restoring the Cluster
- Manual, Offline: Restoring the Cluster
- Additional configuration
- Migrating objectstore from persistent volume to raw disks
- Monitoring and alerting
- Migration and upgrade
- Migration options
- Step 1: Moving the Identity organization data from standalone to Automation Suite
- Step 2: Restoring the standalone product database
- Step 3: Backing up the platform database in Automation Suite
- Step 4: Merging organizations in Automation Suite
- Step 5: Updating the migrated product connection strings
- Step 6: Migrating standalone Insights
- Step 7: Deleting the default tenant
- B) Single tenant migration
- Product-specific configuration
- Best practices and maintenance
- Troubleshooting
- How to Troubleshoot Services During Installation
- How to Uninstall the Cluster
- How to clean up offline artifacts to improve disk space
- How to clear Redis data
- How to enable Istio logging
- How to manually clean up logs
- How to clean up old logs stored in the sf-logs bucket
- How to disable streaming logs for AI Center
- How to debug failed Automation Suite installations
- How to delete images from the old installer after upgrade
- How to automatically clean up Longhorn snapshots
- How to disable TX checksum offloading
- How to address weak ciphers in TLS 1.2
- Unable to run an offline installation on RHEL 8.4 OS
- Error in Downloading the Bundle
- Offline installation fails because of missing binary
- Certificate issue in offline installation
- First installation fails during Longhorn setup
- SQL connection string validation error
- Prerequisite check for selinux iscsid module fails
- Azure disk not marked as SSD
- Failure After Certificate Update
- Automation Suite not working after OS upgrade
- Automation Suite Requires Backlog_wait_time to Be Set 1
- Volume unable to mount due to not being ready for workloads
- RKE2 fails during installation and upgrade
- Failure to upload or download data in objectstore
- PVC resize does not heal Ceph
- Failure to Resize Objectstore PVC
- Rook Ceph or Looker pod stuck in Init state
- StatefulSet volume attachment error
- Failure to create persistent volumes
- Storage reclamation patch
- Backup failed due to TooManySnapshots error
- All Longhorn replicas are faulted
- Setting a timeout interval for the management portals
- Update the underlying directory connections
- Cannot Log in After Migration
- Kinit: Cannot Find KDC for Realm <AD Domain> While Getting Initial Credentials
- Kinit: Keytab Contains No Suitable Keys for *** While Getting Initial Credentials
- GSSAPI Operation Failed With Error: An Invalid Status Code Was Supplied (Client's Credentials Have Been Revoked).
- Alarm Received for Failed Kerberos-tgt-update Job
- SSPI Provider: Server Not Found in Kerberos Database
- Login Failed for User <ADDOMAIN><aduser>. Reason: The Account Is Disabled.
- ArgoCD login failed
- Failure to get the sandbox image
- Pods not showing in ArgoCD UI
- Redis Probe Failure
- RKE2 Server Fails to Start
- Secret Not Found in UiPath Namespace
- After the Initial Install, ArgoCD App Went Into Progressing State
- MongoDB pods in CrashLoopBackOff or pending PVC provisioning after deletion
- Unexpected Inconsistency; Run Fsck Manually
- Degraded MongoDB or Business Applications After Cluster Restore
- Missing Self-heal-operator and Sf-k8-utils Repo
- Unhealthy Services After Cluster Restore or Rollback
- RabbitMQ pod stuck in CrashLoopBackOff
- Prometheus in CrashloopBackoff state with out-of-memory (OOM) error
- Missing Ceph-rook metrics from monitoring dashboards
- Pods cannot communicate with FQDN in a proxy environment
- Using the Automation Suite Diagnostics Tool
- Using the Automation Suite support bundle
- Exploring Logs
Step 1.2: Configuring the VM
- To connect to the machine using SSH, follow the Azure instructions.
-
Alternatively, you can connect to the machine on your terminal using SSH:
# If you set a password the command is: ssh <user>@<dns_of_vm> # If you used an ssh key: ssh -i <.\Path\To\myKey1.pem> <user>@<dns_of_vm>
# If you set a password the command is: ssh <user>@<dns_of_vm> # If you used an ssh key: ssh -i <.\Path\To\myKey1.pem> <user>@<dns_of_vm>
Log in to the machine via SSH using the following commands:
-
If you set a password:
ssh <user>@<dns_of_vm>
ssh <user>@<dns_of_vm> -
If you used an SSH key:
ssh -i <.\Path\To\myKey1.pem> <user>@<dns_of_vm>
ssh -i <.\Path\To\myKey1.pem> <user>@<dns_of_vm>
The disk device name is different from the disk name. You will need the disk device name when configuring the disk.
To configure the disk for installation, see the following:
These additional inbound ports are needed only for multi-node HA-ready production installations. Add them to all VMs.
Port |
Protocol |
Source |
Destination |
Purpose |
---|---|---|---|---|
443
|
TCP |
Any |
Any |
https traffic |
2379
|
TCP |
VirtualNetwork |
VirtualNetwork |
etcd client port |
2380
|
TCP |
VirtualNetwork |
VirtualNetwork |
etcd peer port |
6443
|
TCP |
Any |
Any |
Kubernetes API |
8472
|
UDP |
VirtualNetwork |
VirtualNetwork |
Flannel |
9345
|
TCP |
Any |
Any |
Kubernetes API |
10250
|
TCP |
VirtualNetwork |
VirtualNetwork |
kubelet |
30071
|
TCP |
VirtualNetwork |
VirtualNetwork |
NodePort |
Opening TCP ports on an Azure VM for multi-node installations
Create new inbound networking rules for the ports needed over TCP protocol.