Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Troubleshooting Kubernetes
Rating: 3.8 out of 5(40 ratings)
420 students

Troubleshooting Kubernetes

Practical solutions to common issues faced while deploying, managing, and operating your Kubernetes clusters
Last updated 1/2020
English

What you'll learn

  • Plan, operate and manage a Kubernetes cluster in production
  • Troubleshoot cluster issues
  • Select the correct resources in Kubernetes to serve your infrastructure goals
  • Manage application deployments on Kubernetes clusters
  • Configure cluster-wide logging
  • Monitor and set up alerts for cluster health and determine if any action is required
  • Best practices for working with Kubernetes

Course content

6 sections41 lectures2h 55m total length
  • The Course Overview2:52

    This video will give you an overview about the course.

  • Planning and Installing2:48

    Introduction to planning and installation of a Kubernetes cluster.

       •  Explain architectural decisions

       •  Select the install options

       •  Links for refences

  • Resource Management3:29

    Introduction to resource management in Kubernetes.

       •  Learn what a resource is and its types

       •  Learn about Pod

       •  Troubleshoot resource management issues

  • Scenario – Insufficient Cluster Capacity3:12

    Pod fails to get scheduled due to insufficient cluster capacity.

       •  Look at Pod related events, for an indication on resource shortage

       •  Look at node details, to find out allocated and available capacity

       •  Reduce Pod resource requirement or add more capacity

  • Pod Security Policy7:02

    Introduction and troubleshooting of issues with PSP.

       •  Check Pod related events for indication of PSP errors

       •  Check policy, role, and role bindings

       •  Create and modify the role and role bindings

  • Troubleshooting Networking Issues9:10

    Understand Kubernetes network model and basic network info gathering.

       •  Get an overview of the Kubernetes network model

       •  Understand the troubleshooting steps

       •  Perform Calico specific debugging

  • Scenario – Deploy the Debug Tool2:32

    Introduction to network debug tool and it’s usage in troubleshooting.

       •  Use sample YAML

       •  Use ‘dig’ command

  • DNS or Pod-to-Service Communication Issues6:01

    Pods unable to resolve network service.

       •  Deploy network debug tool in Pod network. and run basic network debug commands

       •  Look at network settings on the host

       •  Modify Kubernetes network settings or host network settings as required

  • Scenario – Kubernetes Dashboard Loading Is Slow11:18

    Kubernetes dashboard loading is slow.

       •  Look at dashboard Pods and logs for any errors

       •  Look at the node running dashboard Pods for resource starvation, stats, and errors

       •  Modify Kubernetes network settings or host network settings as required

  • Multi-Platform Kubernetes Cluster4:46

    Introduction to multi-platform cluster, scheduling, node selectors, and node affinity.

       •  Understand what Kubernetes multiplatform clusters are

       •  Learn about Kubernetes scheduling primitives

       •  Explore node selectors and node affinity

  • Taints and Tolerations4:55

    Introduction to taints and tolerations.

       •  Introduction to both

       •  Look at Pod details and related events

       •  Look at further Pod details and related events

  • Common Issues with Multi-Platform Clusters2:38

    Explore the Pod restarting and in CrashLoopBackOff state post deployment, in a multi-platform cluster.

       •  Look at Pod related events

       •  Look at Pod logs

       •  Modify Pod specs for targeted deployment

  • Scenario – Pending State Post-Deployment4:17

    Pod status remains in pending state.

       •  Look at Pod details and related events

       •  Look at cluster node details

       •  Modify Pod specs or update cluster as required

  • Test your knowledge

Requirements

  • Working knowledge of Kubernetes is assumed; this course explores it in more detail and examines the practical aspects of managing your clusters.

Description

Containers are changing the way applications are developed, deployed, and managed. Consequently, container orchestration has become a critical piece of infrastructure for enterprises on their digital transformation journey. Kubernetes is an open-source container orchestration platform for automating the deployment, scaling, and management of containerized applications. It has become the de-facto standard for container orchestration.

This course provides a tour of how to debug issues with the Kubernetes cluster infrastructure and presents a working conceptual model of how clusters operate. You will be addressing common problems faced by developers in cluster management, monitoring, and logging. Subsequently, we examine common issues when creating and deploying apps on Kubernetes.

By the end of the course, you'll have gained full control of your Kubernetes infrastructure and will be able to manage it with ease.

Please note a working knowledge of Kubernetes is required for this course.

About the Author

Pradipta Kumar Banerjee

IBM, Senior Technical Staff Member, Bangalore, India

Pradipta is a Senior Technical Staff Member for the IBM Cloud Private platform and leads cloud-native platform initiatives for IBM Power Microprocessor. He comes with an extensive infrastructure and cloud background and has worked on many first-of-a-kind projects for large enterprise customers across various industries, focusing primarily on virtualization, cloud, and container technologies. Pradipta is an IBM Master Inventor with many patents and received the Best of IBM and Outstanding Technical Achievement awards.

Who this course is for:

  • This course is for sysadmins and developers who would like to implement easy and practical solutions to common issues encountered while using Kubernetes for large-scale container application management.