What We Learned: 7 Field-Tested Best Practices for Kubernetes

Published: September 18th, 2019

Our Kubernetes journey at Algolia began with one question: How can our development team deploy new services with more flexibility?

Two years ago, we were a big user of bare metal machines. This changed when we assessed Kubernetes for automating hardware allocation and workloads and services life cycle management. Today, most of our products are deployed on Kubernetes.

Below are seven “best practices” for using Kubernetes — a combination of tips, explanations and lessons learned from the field.

1. Do not use the “root” user in containers

Kubernetes runs applications in containers, or logical partitions of the underlying host’s resources. Contrary to full virtualization, applications running in containers work directly within the host operating system. This greatly improves resource consumption and startup time, but also loosens security isolation. By default, the root user in a container is the same as the underlying host’s, and applications running in it have full access to the entire host.

There are several ways to avoid this. When running on stock Kubernetes, you can simply modify the setup of the container for your application (typically described in a text file as a Docker image) to create specific users with limited rights, and run said application using these identities.

2. Configure resource requests and limits

Kubernetes is a container orchestrator. Ask Kubernetes to run a Docker image, and it will select “nodes” (machines), create “pods” (entities to manage containers), and run the image. For Kubernetes to achieve good hardware allocation, it is recommended to declare the typical hardware resource consumption of a pod for a given image.

With the “requests” property, you can tell Kubernetes what resources a pod absolutely requires for running a given container. Resist the urge to systematically ask for big machines — it would make you consume more than needed, which will cost you in the long run.

With the “limits” property, you can inform Kubernetes of the number of resources a pod is not supposed to surpass for a given container. See it as the last safeguard against CPU loops and memory leak: Kubernetes will kill any pod going beyond its limits.

3. Specify pod anti-affinity

Assigning a pod to a node is not limited to finding a machine matching specific hardware requirements. Consider a critical service having high availability requirements: when deploying several instances of a server — several “replicas” of the same “pod” in Kubernetes parlance — running each on different machines is generally desirable.

This can be accomplished by specifying “anti-affinity,” or rules preventing a given pod to be allocated to some nodes — in this instance, to nodes already running a replica of that pod.

4. Configure the liveness and readiness probes

Liveness and readiness are ways for applications to communicate their health. Configuring both helps Kubernetes manage pods’ lifecycles correctly.

The liveness probe assesses whether the applications running in a pod are answering in an acceptable amount of time. When an application enters a faulty state, its liveness signal should reflect it, so that Kubernetes can decide the best course of action, such as restarting the pod.

The readiness probe tells if a pod can receive traffic. As the liveness one, the readiness probe is continuously checked; it is a great way to temporarily disconnect pods from traffic.

5. Specify a Pod Disruption Budget

Pods can be terminated at any time. This is called a “disruption.” Involuntary disruptions are triggered by exceptional events, like hardware failures. Voluntary disruptions are initiated by someone, or by Kubernetes.

Defining a “Pod Disruption Budget” tells Kubernetes how to manage pods when a voluntary disruption happens. This is crucial to improve the availability of your system, as it prevents Kubernetes from voluntarily terminating too many instances of your services at once.

6. Handle the “SIGTERM” signal

When a shutdown is decided, Kubernetes notifies the pod that it is about to be terminated.

This is notably done by sending the “SIGTERM” signal to the pod, which gets propagated to applications. Make sure your applications react appropriately to it (e.g. by closing connections, or saving their states) and stop gracefully.

7. Select declarative manifests

There are two ways to interact with Kubernetes: the imperative way (asking Kubernetes to create, update or delete entities) or the declarative way (sending Kubernetes a manifest describing a target state).

The declarative way makes the description of your setup independent from the state Kubernetes is currently in. This is key for performing rollback painlessly. Note that for this to work correctly, manifests themselves must reference external resources in a stable way, via stable identifiers like hashes, and not through tags such as “latest.”

While we see these practices as good recommendations for working with Kubernetes, they are by no means absolute. Your team will have its own interpretation, and will come with its own set of tips. When this happens, log them and share them, so that we can all benefit from our discoveries!

Article Tags

Kubernetes

About Benoit Perrot

Benoit Perrot is director of engineering at Algolia.

View all posts by Benoit Perrot

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__atuvc	1 year 1 month	AddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.
__atuvs	30 minutes	AddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.

Cookie	Duration	Description
__gads	1 year 24 days	The __gads cookie, set by Google, is stored under DoubleClick domain and tracks the number of times users see an advert, measures the success of the campaign and calculates its revenue. This cookie can only be read from the domain they are set on and will not track any data while browsing through other sites.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_WTGVKVXEZJ	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_107693958_2	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_jsuid	1 year	This cookie contains random number which is generated when a visitor visits the website for the first time. This cookie is used to identify the new visitors to the website.
at-rand	never	AddThis sets this cookie to track page visits, sources of traffic and share counts.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
iutk	5 months 27 days	This cookie is used by Issuu analytic system to gather information regarding visitor activity on Issuu products.
uvc	1 year 1 month	Set by addthis.com to determine the usage of addthis.com service.

Cookie	Duration	Description
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
loc	1 year 1 month	AddThis sets this geolocation cookie to help understand the location of users who share the information.
mc	1 year 1 month	Quantserve sets the mc cookie to anonymously track user behaviour on the website.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__gpi	1 year 24 days	No description
_heatmaps_g2g_101137905	10 minutes	No description
cf_7167_id	20 years	No description
cf_7167_person_last_update	session	No description
GoogleAdServingTest	session	No description
prism_252377639	1 month	No description
querylyvid	3 months	No description
xtc	1 year 1 month	No description

What We Learned: 7 Field-Tested Best Practices for Kubernetes

Article Tags

Subscribe to SDTimes

About Benoit Perrot

Related Articles

Komodor adds new cost optimizing capabilities to its Kubernetes management platform

CNCF releases ‘magical’ Kubernetes version 1.33

KubeCon + CloudNativeCon Europe Day 2: k0s and k0smotron join CNCF Sandbox, New Relic announces eAPM, and Valkey 8.1 released

KubeCon + CloudNativeCon Europe Day 1: Argo CD v3 pre-release, Golden Kubestronaut program, Red Hat Developer Hub 1.5, and more