What should you do first?

You support a popular mobile game application deployed on Google Kubernetes Engine (GKE) across several Google Cloud regions. Each region has multiple Kubernetes clusters. You receive a report that none of the users in a specific region can connect to the application. You want to resolve the incident while following Site Reliability Engineering practices .

What should you do first?
A . Reroute the user traffic from the affected region to other regions that don’t report issues.
B . Use Stackdriver Monitoring to check for a spike in CPU or memory usage for the affected region.
C . Add an extra node pool that consists of high memory and high CPU machine type instances to the cluster.
D . Use Stackdriver Logging to filter on the clusters in the affected region, and inspect error messages in the logs.

Answer: D

Subscribe
Notify of
guest
0 Comments
Inline Feedbacks
View all comments