Mastering Kubernetes Resiliency with Dynatrace: Avoiding Pitfalls, Optimizing and Auto-scaling Workloads
This repository contains all the files used during the demo of the Observability clinic: Mastering Kubernetes Resiliency with Dynatrace
This repository showcase the usage of several solutions with Dynatrace:
- OpenCost
- Keptn lifecylce Toolkit
- HPA
The following tools need to be install on your machine :
- jq
- kubectl
- git
- gcloud ( if you are using GKE)
- Helm
PROJECT_ID="<your-project-id>"
gcloud services enable container.googleapis.com --project ${PROJECT_ID}
gcloud services enable monitoring.googleapis.com \
cloudtrace.googleapis.com \
clouddebugger.googleapis.com \
cloudprofiler.googleapis.com \
--project ${PROJECT_ID}
ZONE=europe-west3-a
NAME=observabilitclinic-masteringk8s
gcloud container clusters create ${NAME} --zone=${ZONE} --machine-type=e2-standard-8 --num-nodes=2
git clone https://github.com/dynatrace-perfclinics/observabilitclinic-masteringk8s
cd observabilitclinic-masteringk8s
If you don't have any Dynatrace tenant , then i suggest to create a trial using the following link : Dynatrace Trial
Once you have your Tenant save the Dynatrace (including https) tenant URL in the variable DT_TENANT_URL
(for example : https://dedededfrf.live.dynatrace.com)
DT_TENANT_URL=<YOUR TENANT URL>
The dynatrace operator will require to have several tokens:
- Token to deploy and configure the various components
- Token to ingest metrics and Traces
One for the operator having the following scope:
- Create ActiveGate tokens
- Read entities
- Read Settings
- Write Settings
- Access problem and event feed, metrics and topology
- Read configuration
- Write configuration
- Paas integration - installer downloader
Save the value of the token . We will use it later to store in a k8S secret
API_TOKEN=<YOUR TOKEN VALUE>
Create a Dynatrace token with the following scope:
- Ingest metrics (metrics.ingest)
- Ingest logs (logs.ingest)
- Ingest events (events.ingest)
- Ingest OpenTelemetry
- Read metrics
DATA_INGEST_TOKEN=<YOUR TOKEN VALUE>
cd ..
chmod 777 deployment.sh
./deployment.sh --clustername "${NAME}" --dturl "${DT_TENANT_URL}" --dtingesttoken "${DATA_INGEST_TOKEN}" --dtoperatortoken "${API_TOKEN}"
To let Dynatrace ingest the OpenCost metrics in dynatrace, we need to add the dynatrace annotations on the openCost servic/
kubectl edit svc opencost -n opencost
OpenCost expose the Prometheus metrics on the port 9003, so let's add the following annotations :
apiVersion: v1
kind: Service
metadata:
annotations:
metrics.dynatrace.com/path: /metrics
metrics.dynatrace.com/port: "9003"
metrics.dynatrace.com/scrape: "true"
Save the changes .
First let's start with the cluster efficiency Dashboard :
curl -X 'POST' \
'https://bix24852.dev.dynatracelabs.com/api/config/v1/dashboards' \
-H 'accept: application/json; charset=utf-8' \
-H 'Content-Type: application/json; charset=utf-8' \
-H 'Authorization: Api-Token ${API_TOKEN}'\
-d @dynatrace/Cluster efficiency.json'
then the K6 dashboard:
curl -X 'POST' \
'https://bix24852.dev.dynatracelabs.com/api/config/v1/dashboards' \
-H 'accept: application/json; charset=utf-8' \
-H 'Content-Type: application/json; charset=utf-8' \
-H 'Authorization: Api-Token ${API_TOKEN}'\
-d @dynatrace/K6 load test.json'
We can see that the deployed workload is not efficient and the namespace hipster-shop is the most expensive. We can reduce the cost of the cluster by modifying the request & limits.
The repository has another version of the hipster-shop deployment file having lower value for the ressource :
- requests
- limit Let's apply the update version of the hipster-shop :
kubectl apply -f hipstershop/k8s-manifest.yaml -n hipster-shop
kubectl apply -f k6/loadtest_job.yaml -n hipster-shop
To handle the load properly let's deploy some HPA rules on the following deployments :
- frontend
- productcalalogservice
- cartservice
- checkoutservice
- recommendationservice
kubectl apply -f hpa/hpa_cpu.yaml-n hipster-shop
kubectl apply -f k6/loadtest_job.yaml -n hipster-shop
By looking at Dynatrace, we can see that :
- the cost of the cluster has increased
- we have pending workload
- we still have performance issues
kubectl apply -f keptn/metricProvider.yaml -n hipster-shop
In Dynatrace, let's create a metric expression to measure :
- the number of request coming in the frontend service
- the % of CPU throttling
kubectl apply keptn/keptnmetric.yaml -n hipster-shop
kubectl apply keptn/hpa.yaml -n hipster-shop
kubectl apply -f k6/loadtest_job.yaml -n hipster-shop