So you wanna write Kubernetes controllers? (opens in new tab)

(ahmet.im)

244 pointsgokhan1y ago119 comments

119 comments

58 comments · 7 top-level

Vampiero1y ago· 31 in thread

Why do devops keep piling abstractions on top of abstractions?

There's the machine. Then the VM. Then the container. Then the orchestrator. Then the controller. And it's all so complex that you need even more tools to generate the configuration files for the former tools.

I don't want to write a Kubernetes controller. I don't even know why it should exist.

stouset1y ago

Right now I’m typing on a glass screen that pretends to have a keyboard on it that is running a web browser developed with a UI toolkit in a programming language that compiles down to an intermediate bytecode that’s compiled to machine code that’s actually interpreted as microcode on the processor, half of it is farmed out to accelerators and coprocessors of various kinds, all assembled out of a gajillion transistors that neatly hide the fact that we’ve somehow made it possible to make sand think.

The number of layers of abstraction you’re already relying on just to post this comment is nigh uncountable. Abstraction is literally the only way we’ve continued to make progress in any technological endeavor.

zug_zug1y ago

I think the point is that there are abstractions that require you to know almost nothing (e.g. that my laptop has a SSD with blocks that are constantly dying is abstracted to a filesystem that looks like a basic tree structure).

Then there are abstractions that may actually increase cognitive load "What if instead of thinking about chairs, we philosophically think about ALL standing furniture types, stools, tables, etc. They may have 4 legs, 3, 6? What about a car seats too?"

AFAICT writing a kubernetes controller is probably overkill challenge-yourself level exercise (e.g. a quine in BF) because odds are that any resource you've ever needed to manage somebody else has built an automated way to do it first.

Would love to hear other perspectives though if anybody has great examples of when you really couldn't succeed without writing your own kubernetes controller.

3 more replies

zenethian1y ago

Seemingly endlessly layered abstraction is also why phones and computers get faster and faster yet nothing seems to actually run better. Nobody wants to write native software anymore because there are too many variations of hardware and operating systems but everyone wants their apps to run on everything. Thus, we are stuck in abstraction hell.

I'd argue the exact opposite has happened. We have made very little progress because everything is continually abstracted out to the least common denominator, leaving accessibility high but features low. Very few actual groundbreaking leaps have been accomplished with all of this abstraction; we've just made it easier to put dumb software on more devices.

4 more replies

petercooper1y ago

Then all of that data is turned into HTTP requests which turn into TCP packets distributed over IP over wifi over Ethernet over PPPoE over DSL and probably turned into light sent over fiber optics at various stages... :-)

ok1234561y ago

The problem isn't abstractions. The problem is leaky abstractions that make it harder to reason about a system and add lots of hidden states and configurations of that state.

What could have been a static binary running a system service has become a Frankenstein mess of opaque nested environments operated by action at a distance.

danielklnstn1y ago

CRDs and their controllers are perhaps the reason Kubernetes is as ubiquitous as it is today - the ability to extend clusters effortlessly is amazing and opens up the door for so many powerful capabilities.

> I don't want to write a Kubernetes controller. I don't even know why it should exist.

You can take a look at Crossplane for a good example of the capabilities that controllers allow for. They're usually encapsulated in Kubernetes add-ons and plugins, so much as you might never have to write an operating system driver yourself, you might never have to write a Kubernetes controller yourself.

raffraffraff1y ago

One of the first really pleasant surprises I got while learning was that the kubectl command itself was extended (along with tab completion) by CRDs. So install external secrets operator and you get tab complete on those resources and actions.

dijit1y ago

> Why do devops keep piling abstractions on top of abstractions?

Mostly, because developers keep trying to replace sysadmins with higher levels of abstraction. Then when they realise that they require (some new word for) sysadmins still, they pile on more abstractions again and claim they don't need them.

The abstraction du-jour is not Kubernetes at the moment, it's FaaS. At some point managing those FaaS will require operators again and another abstraction on top of FaaS will exist, some kind of FaaS orchestrator, and the cycle will continue.

robertlagrant1y ago

I think it's clear that Kubernetes et al aren't trying to replace sysadmins. They're trying to massively increase the ratio of sysadmin:machine.

2 more replies

GiorgioG1y ago

I don’t want Kubernetes period. Best decision we’ve made at work is to migrate away from k8s and onto AWS ECS. I just want to deploy containers! DevOps went from something you did when standing up or deploying an application, to an industry-wide jobs program. It’s the TSA of the software world.

frazbin1y ago

If I may ask, just to educate myself

where do you keep the ECS service/task specs and how do you mutate them across your stacks?

How long does it take to stand up/decomm a new instance of your software stack?

How do you handle application lifecycle concerns like database backup/restore, migrations/upgrades?

How have you supported developer stories like "I want to test a commit against our infrastructure without interfering with other development"?

I recognize these can all be solved for ECS but I'm curious about the details and how it's going.

I have found Kubernetes most useful when maintaining lots of isolated tenants within limited (cheap) infrastructure, esp when velocity of software and deployments is high and has many stakeholders (customer needs their demo!)

2 more replies

nijave1y ago

ECS is very very similar to Kubernetes and duplicates pretty much all of the functionality except AWS names and manages each piece as a separate service/offering.

ECS+Route53+ALB/ELB+EFS+Parameter Store+Secrets Manager+CloudWatch (Metrics, Logs, Events)+VPC+IAM/STS and you're pretty close in functionality.

Spivak1y ago

I'm so confused about the jobs program thing. I'm an infra engineer who has had the title devops for parts of my career. I feel like I've always been desperately needed by teams of software devs that don't want to concern themselves with the gritty reality of actually running software in production. The job kinda sucks but for some reason jives with my brain. I take a huge amount of work and responsibility off the plates of my devs and my work scales well to multiple teams and multiple products.

I've never seen an infra/devops/platform team not swamped with work and just spinning their tires on random unnecessary projects. We're more expensive on average than devs, harder to hire, and two degrees separated from revenue. We're not a typically overstaffed role.

k8sToGo1y ago

It is always this holier than thou attitude of Software engineers towards DevOps that is annoying. Especially if it comes from ignorance.

These days often DevOps is done by former Software Engineers rather than "old fashioned" Sys admins.

Just because you are ignorant on how to use AKS efficiently, doesn't mean your alternative is better.

3 more replies

blazing2341y ago

Why don't you just deploy to cloud run on gcp and call it a day

mugsie1y ago

Thats great if that works for you, and for a lot people and teams. You have just shifted the complexity of networking, storage, firewalling, IP management, L7 proxying to AWS, but hey, you do have click ops there.

> DevOps went from something you did when standing up or deploying an application, to an industry-wide jobs program. It’s the TSA of the software world.

DevOps was never a job title, or process, it was a way of working, that went beyond yeeting to prod, and ignoring it.

From that one line, you never did devops - you did dev, with some deployment tools (that someone else wrote?)

1 more reply

bshacklett1y ago

K8s really isn't about piling up abstractions. The orchestrator sits beside containers (which can be run on bare metal, btw) and handles tasks which already need to be done. Orchestration of any system is always necessary. You can do it with K8s (or a related platform), or you can can cobble together custom shell scripts, or even perform the tasks manually.

One of these gives you a way to democratize the knowledge and enable self-service across your workforce. The others result in tribal knowledge being split into silos all across an organization. If you're just running a couple of web servers and rarely have to make changes, maybe the manual way is OK for you. For organizations with many different systems that have complex interactions with each other, the time it takes to get a change through a system and the number of potential errors that manual tasks add are just infeasible.

Controllers are just one way to bring some level of sanity to all of the different tasks which might be required to maintain any given system. Maybe you don't need your own custom controllers, as there are a huge number which have already been created to solve the most common requirements. Knowing how to write them allows one to codify business rules, reduce human error, and get more certainty over the behavior of complex systems.

globular-toast1y ago

Because, like it or not, that's how we build big things.

A bridge connects two otherwise separate geographical regions. To a government it's an abstract piece of infrastructure that will have economic and social impacts. To users it's a convenience that will change the way they plan journeys. To traffic planners it's another edge in a graph. To cartographers it's another line on a map. To road builders it's another surface to tarmac. To geologists it sits on a (hopefully) stable foundation that isn't expected to move or subside for at least a few hundred years. To cement people it's made of a particular blend that's the product of a specialised industry and expected to last for a hundred years. To metal workers it's reinforced with steel with particular strengths and weaknesses.

Nobody understands it all. Abstraction is not the source of complexity, abstraction is how we deal with complexity. The complexity is just there whether you want it or not. You think it's easy because you're the guy walking across the bridge.

solatic1y ago

Current example from work: an extreme single-tenant architecture, deployed for large N number of tenants, which need both logically and physically isolation; the cost of the cloud provider's managed databases is considered Too Expensive to create one per tenant, so an open-source Kubernetes controller for the database is used instead.

Not all systems are small-N modern multi-tenant architectures deployed at small scale.

bg241y ago

This is the point. Right tool for the job. Kubernetes was incubated at Google and designed for deployments at scale. Lot of teams are happily using it. But it is definitely not for startups or solo devs, unless you are an expert user already.

ryandv1y ago

You have some computing resource that needs to be provisioned according to the specifications laid out in a Kubernetes manifest (YAML). Something needs to go out and actually "physically" create or retrieve that resource, with all the side-effects that involves, bring its state into accordance with whatever the manifest specifies, and continuously make adjustments when the resource's state diverges from the manifest throughout the lifetime of the resource.

One example is a controller responsible for fulfilling ACME challenges to obtain x509 certificates. Something needs to actually publish the challenge responses somewhere on the internet, retrieve the x509 certificate, and then persist it onto the cluster so that it may be used by other applications. Something needs to handle certificate renewal on an ongoing basis. That something is the controller.

MathMonkeyMan1y ago

> I don't want to write a Kubernetes controller. I don't even know why it should exist.

I don't want to write one either. Given the choice, I won't even touch one.

I think I know why they exist, though. Kubernetes is a system of actors (resources) and events (state transitions). If you want to derive new state from existing state, and to maintain that new state, then you need something that observes "lower" state transitions and takes action on the system to achieve its desired "higher" state.

Whether we invent terminology for these things or not, controllers exist in all such systems.

mugsie1y ago

Yeah, for a lot of companies, this is way overkill. Thats fine, don't use it! In the places I have seen use it when it is actually needed, the controller makes a lot of work for teams disappear. It exists, because thats how K8S itself works? - how it translates from a deployment -> replica set -> pod -> container.

Abstractions are useful to stop 100000s lines of boiler plate code. Same reason we have terraform providers, Ansible modules, and well, the same concepts in programming ...

1 more reply

ianburrell1y ago

How do you run multiple copies of an application? How do you start new copy when one fails? How do you deploy changes to the system? That is the orchestrator.

What do you do when site gets really popular and needs new copies? What happens when fill the VMs?If you want to automate it, that is a controller.

Also, if you are running on-premise, you don't need VM, you can use the whole machine for Kubernetes and containers for isolation. If you need more isolation, you can run VM containers; being able to switch is advantage of Kubernetes.

chrismarlow91y ago

Because most places never needed kubernetes but used it to put their technical debt on a credit line. So what do you do when they try to collect? Well you just take out another loan to pay off the first one.

nejsjsjsbsb1y ago

Because the works on my machine meme, plus the cattle not pets lore.

Why do this for relational databases? Why do I need to write a pg extension and SQL and an ORM when I can just write to disk?

antonvs1y ago

If you're implementing a distributed system that needs to manage many custom resources (of whatever kind, not Kubernetes-specific), implementing a Kubernetes controller for it can save a great deal of development time and give you a better system in the end, with standard built-in observability, manageability, deployment automation, and a whole lot else.

It's certainly true that some use of Kubernetes is overkill. But if you actually need what it offers, it can be a game-changer. That's a big reason why it caught on so fast in big enterprises.

Don't fall into the trap of thinking that because you don't understand the need for something, that the need doesn't exist.

nijave1y ago

I'm always surprised when people say Kubernetes is overkill in the context of distributed systems. You'll end up running all the same stuff yourself but have to manage the integration yourself as well (traffic/L7, config, storage, app instances, network/L1-4)

1 more reply

javcasas1y ago

Why do developers keep piling abstractions on top of abstractions?

There is machine code. Then the assembler. Then the compiler, that targets the JVM. Then the programming language. Then classes and objects. And then modules. And then design patterns. And then architectural patterns.

Why all of this should exist?

...

Well, because each level is intended to provide something the previous levels cannot provide.

My last "operator" (not really an operator, but conceptually similar) is Airflow. Because Kubernetes doesn't have a way to chain job executions, as in "run this job after these two jobs finished".

hnarayanan1y ago

Job security

sofixa1y ago

> There's the machine. Then the VM. Then the container. Then the orchestrator

If you're running your orchestrator on top of VMs, you're doing it wrong (or you're at a very small scale or just getting started).

clx751y ago· 9 in thread

At work we are using Metacontroller to implement our "operators". Quoted because these are not real operators but rather Metacontroller plugins, written in Python. All the watch and update logic - plus the resource caching - is outsourced to Metacontroller (which is written in Go). We define - via its CompositeController or DecoratorController CRDs - what kind of resources it should watch and which web service it should call into when it detects a change. The web service speaks plain HTTP (or HTTPS if you want).

In case of a CompositeController, the web service gets the created/updated/deleted parent resource and any already existing child resources (initially none). The web service then analyzes the parent and existing children, then responds with the list of child resources whose existence and state Metacontroller should ensure in the cluster. If something is left out from the response compared to a previous response, it is deleted.

Things we implemented using this pattern:

- Project: declarative description of a company project, child resources include a namespace, service account, IAM role, SMB/S3/FSX PVs and PVCs generated for project volumes (defined under spec.volumes in the Project CR), ingresses for a set of standard apps

- Job: high-level description of a DAG of containers, the web service works as a compiler which translates this high-level description into an Argo Workflow (this will be the child)

- Container: defines a dev container, expands into a pod running an sshd and a Contour HTTPProxy (TCP proxy) which forwards TLS-wrapped SSH traffic to the sshd service

- KeycloakClient: here the web service is not pure - it talks to the Keycloak Admin REST API and creates/updates a client in Keycloak whose parameters are given by the CRD spec

So far this works pretty well and makes writing controllers a breeze - at least compared to the standard kubebuilder approach.

https://metacontroller.github.io/metacontroller/intro.html

JeffMcCune1y ago

As other sibling comments suggest these use cases are better solved with a generator.

The rendered manifest pattern is a simpler alternative. Holos [1] is an implementation of the pattern using well typed CUE to wrap Helm and Kustomize in one unified solution.

It too supports Projects, they’re completely defined by the end user and result in the underlying resource configurations being fully rendered and version controlled. This allows for nice diffs for example, something difficult to achieve with plain ArgoCD and Helm.

[1]: https://holos.run/docs/overview/

Kinrany1y ago

The rendered manifests pattern is a great read by itself: https://akuity.io/blog/the-rendered-manifests-pattern

ec1096851y ago

Curious why using controller for these aspects versus generating the K8s objects as part of your deployment pipeline that you just apply? The latter gives you versioned artifacts you can roll forward and back and independent deployment of these supporting pieces with each app.

Is there runtime dynamism that you need the control loop to handle beyond what the built-in primitives can handle?

clx751y ago

Some of the resources are short-lived, including jobs and dev containers. The corresponding CRs are created/updated/deleted directly in the cluster by the project users through a REST API. For these, expansion of the CR into child resources must happen dynamically.

Other CRs are realized through imperative commands executed against a REST API. Prime example is KeycloakRealm and KeycloakClient which translate into API calls to Keycloak, or FSXFileSystem which needs Boto3 to talk to AWS (at least for now, until FSXFileSystem is also implemented in ACK).

For long-lived resources up-front (compile time?) expansion would be possible, we just don't know where to put the expansion code. Currently long-lived resource CRs are stored in Git, deployment is handled with Flux. When projects want an extra resource, we just commit it to Git under their project-resources folder. I guess we could somehow add an extra step here - running a script? - which would do the expansion and store the children in Git before merging desired state into the nonprod/prod branches, I'm just not clear on how to do this in a way that feels nice.

Currently the entire stack can be run on a developer's laptop, thanks to the magic of Tilt. In local dev it comes really handy that you can just change a CRs and the children are synced immediately.

Drawbacks we identified so far:

If we change the expansion logic, child resources of existing parents are (eventually) regenerated using the new logic. This can be a bad thing - for example jobs (which expand into Argo Workflows) should not change while they are running. Currently the only idea we have to mitigate this problem is storing the initial expansion into a ConfigMap and returning the original expansion from this "expansion cache" if it exists at later syncs.

Sometimes the Metacontroller plugin cannot be a pure function and executing the side effects introduces latency into the sync. This didn't cause any problems so far but maybe will as it goes against the Metacontroller design expressed in the docs.

Python is a memory hog, our biggest controllers can take ~200M.

1 more reply

remram1y ago

The choice is always between a controller and a generator.

The advantage of a controller is that it can react to external conditions, for example nodes/pods failing, etc. The is great for e.g. a database where you need to failover and update endpointslices. The advantage of a generator is that it can be tested easier, it can be dry-runned, and it is much simpler.

All of your examples seem to me like use cases that would be better implemented with a generator (e.g. Helm, or any custom script outputting YAML) than a controller. Any reason you wrote these as controllers anyway?

KGunnerud1y ago

I've seen different aproaches to controllers, some times it should have been a generator instead, but the problem with generators is that they don't allow (in the same sense) for abstractions at the same level of controllers.

E.g. at one company I worked, they made a manifest to deploy apps that, in v1 was very close to Deployment. It felt owerkill. As they iterated, suddenly you got ACLs that changed NetworkPolicy in Calico (yes can be done with generator), then they added Istio manifests, then they added App authroizations for EntraID - Which again provisioned EntraID client and injected certificate into pods. All I did was add: this app, in this namespace, can talk to me and I got all this for "free". They code in the open so some of the documentation is here: https://docs.nais.io/explanations/nais/

One day, they decided to change from Istio to LinkerD. We users changed nothing. The point is, the controller was 2 things: 1: for us users to have a golden path and 2: for the plattform team themselves to have an abstraction over some features of kube. Although I do see that it might be easy to make poor abstractions as well, e.g. just because you don't create a Deployment (its done for you), you still have to own that Deployment and all other kube constructs.

I'm currently in a org that does not have this and I keep missing it every, every day.

Kinrany1y ago

Even if a controller is necessary, wouldn't you still want to have a generator for the easy stuff?

Kinda like "functional core, imperative shell"?

fsniper1y ago

At work we are using nolar/kopf for writing controllers that provisions/manages our kubernetes clusters. This also includes managing any infrastructure related apps that we deploy on them.

We were using whitebox controller at the start, which is also like metacontroller that runs your scripts on kubernetes events. That was easy to write. However not having full control on the lifecycle of the controller code gets in the way time to time.

Considering you are also writing Python did you review kopf before deciding on metacontroller?

clx751y ago

Yes, we started with Kopf.

As we understood it, Kopf lets you build an entire operator in Python, with the watch/update/cache/expansion logic all implemented in Python. But the first operator we wrote in it just didn't feel right. We had to talk to the K8S API from Python to do all the expansions. It was too complex. We also had aesthetic issues with the Kopf API.

Metacontroller gave us a small, Go binary which takes care of all the complex parts (watch/update/cache). Having to write only the expansion part in Python felt like a great simplification - especially now that we have Pydantic.

liampulles1y ago· 6 in thread

I used to be fascinated by the automation power of Kubernetes custom components. The declarative approach and reconciliation loop offers so many possibilities for creating higher level descriptions of domain specific infrastructure.

On reflection though, I think this stuff can lead to a lot of complexity layers which don't benefit the product relative to the time investment. You are probably not Google.

fragmede1y ago

The funny thing about that is that Google doesn't use Kubernetes internally because it doesn't scale to their level. Borg is more advanced than Kubernetes in the ways that Google needs, so really Kubernetes is dumbed down for everyone else, and everyone else isn't Google scale, (except for those that are, eg Meta has Twine). so yeah, you're probably not Google, but people out there are Tinder or Reddit or Pinterest and all shouldn't have to reinvent the wheel.

osigurdson1y ago

Not Google but leverage a lot of compute at work and use Kubernetes for that. However, I use it even on small side projects as well because I am on the other side of the learning curve. The control plan is free in some cloud providers or can run locally. It brings a lot of consistency between on-premise and various cloud providers and easy to use once you get the hang of it.

znpy1y ago

There was an article by one of the original creators of kubernetes on how the mistakes they did when developing the thing to release to the public.

I can’t find it again and i suspect the original author has deleted it.

One of the point for example was to go with ipv6 from the start, and another was about storage.

If anybody has a link, please paste it :)

But yes, the “public” kubernetes is a dumbed down version of borg. But frankly i think it’s mostly two things:

1. They probably had to redesign the things that in borg were too specific to the custom google infrastructure

2. They didn’t want to give away the _really_ good things, the competitive advantage.

sofixa1y ago

> The declarative approach and reconciliation loop offers so many possibilities for creating higher level descriptions of domain specific infrastructure.

Terraform running on a schedule gets you 3/4 of the way there for 5% of the complexity though.

kmac_1y ago

Terraform's not an orchestrator, it's for something totally different.

osigurdson1y ago

Helm gets you 99% of the way there with less complexity than running Terraform in loop. Terraform is great for bootstrapping Kubernetes of course.

3 more replies

branislav1y ago· 2 in thread

Controllers are a complex topic, but as the linked talk describes, it all comes down to some basic control theory concepts. I wrote about them in my Desired state systems post https://branislavjenco.github.io/desired-state-systems/ if somebody wants a high-level overview of how to think about them.

Basically, declarative state implies value semantics which makes it easier to reason about. Underlying complexity is high though, and you need to judge how necessary it is.

Kinrany1y ago

I always thought that React and Kubernetes indeed have a lot in common. Thank you for the post!

gokhanOP1y ago

Very good read, thank you.

never_inline1y ago· 2 in thread

I'd please ask people to don't write operators unless absolutely necessary.

I used a certain tool which had its own config format, and it's "cloudnative" operator implemented CRDs of which multiple can exist and they would update the config file in some mounted volume. Such thing is a hell to debug. Why can't we just store the config file in configmap/ secret and listen to changes?

(If we had a better templating solution than helm, I think quite a few operators wouldn't need to exist.)

dilyevsky1y ago

Operators that just render a config are a waste of resources. Operators that manage state of other operators (e.g Deployment) or external resources (e.g Crossplane) are a really convenient, developer-friendly way of solving infra automation challenges.

pas1y ago

> can't we just store the config file in configmap/ secret and listen to changes?

isn't that what an operator also does basically? could you explain the problem with operators in more detail? thanks!

neuroelectron1y ago· 1 in thread

No not really

antirez1y ago

Came here looking for this comment.

Havoc1y ago

Low barrier to entry was not a phrase I was expecting in that article.

Either way I’m going to try my hardest to avoid this. K8s is hard enough to get right as is

j / k navigate · click thread line to collapse

119 comments

58 comments · 7 top-level

Vampiero1y ago· 31 in thread

Why do devops keep piling abstractions on top of abstractions?

I don't want to write a Kubernetes controller. I don't even know why it should exist.

stouset1y ago

zug_zug1y ago

Would love to hear other perspectives though if anybody has great examples of when you really couldn't succeed without writing your own kubernetes controller.

3 more replies

zenethian1y ago

4 more replies

petercooper1y ago

ok1234561y ago

The problem isn't abstractions. The problem is leaky abstractions that make it harder to reason about a system and add lots of hidden states and configurations of that state.

What could have been a static binary running a system service has become a Frankenstein mess of opaque nested environments operated by action at a distance.

danielklnstn1y ago

> I don't want to write a Kubernetes controller. I don't even know why it should exist.

raffraffraff1y ago

dijit1y ago

> Why do devops keep piling abstractions on top of abstractions?

robertlagrant1y ago

I think it's clear that Kubernetes et al aren't trying to replace sysadmins. They're trying to massively increase the ratio of sysadmin:machine.

2 more replies

GiorgioG1y ago

frazbin1y ago

If I may ask, just to educate myself

where do you keep the ECS service/task specs and how do you mutate them across your stacks?

How long does it take to stand up/decomm a new instance of your software stack?

How do you handle application lifecycle concerns like database backup/restore, migrations/upgrades?

How have you supported developer stories like "I want to test a commit against our infrastructure without interfering with other development"?

I recognize these can all be solved for ECS but I'm curious about the details and how it's going.

2 more replies

nijave1y ago

ECS is very very similar to Kubernetes and duplicates pretty much all of the functionality except AWS names and manages each piece as a separate service/offering.

ECS+Route53+ALB/ELB+EFS+Parameter Store+Secrets Manager+CloudWatch (Metrics, Logs, Events)+VPC+IAM/STS and you're pretty close in functionality.

Spivak1y ago

k8sToGo1y ago

It is always this holier than thou attitude of Software engineers towards DevOps that is annoying. Especially if it comes from ignorance.

These days often DevOps is done by former Software Engineers rather than "old fashioned" Sys admins.

Just because you are ignorant on how to use AKS efficiently, doesn't mean your alternative is better.

3 more replies

blazing2341y ago

Why don't you just deploy to cloud run on gcp and call it a day

mugsie1y ago

> DevOps went from something you did when standing up or deploying an application, to an industry-wide jobs program. It’s the TSA of the software world.

DevOps was never a job title, or process, it was a way of working, that went beyond yeeting to prod, and ignoring it.

From that one line, you never did devops - you did dev, with some deployment tools (that someone else wrote?)

1 more reply

bshacklett1y ago

globular-toast1y ago

Because, like it or not, that's how we build big things.

solatic1y ago

Not all systems are small-N modern multi-tenant architectures deployed at small scale.

bg241y ago

ryandv1y ago

MathMonkeyMan1y ago

> I don't want to write a Kubernetes controller. I don't even know why it should exist.

I don't want to write one either. Given the choice, I won't even touch one.

Whether we invent terminology for these things or not, controllers exist in all such systems.

mugsie1y ago

Abstractions are useful to stop 100000s lines of boiler plate code. Same reason we have terraform providers, Ansible modules, and well, the same concepts in programming ...

1 more reply

ianburrell1y ago

How do you run multiple copies of an application? How do you start new copy when one fails? How do you deploy changes to the system? That is the orchestrator.

What do you do when site gets really popular and needs new copies? What happens when fill the VMs?If you want to automate it, that is a controller.

chrismarlow91y ago

nejsjsjsbsb1y ago

Because the works on my machine meme, plus the cattle not pets lore.

Why do this for relational databases? Why do I need to write a pg extension and SQL and an ORM when I can just write to disk?

antonvs1y ago

It's certainly true that some use of Kubernetes is overkill. But if you actually need what it offers, it can be a game-changer. That's a big reason why it caught on so fast in big enterprises.

Don't fall into the trap of thinking that because you don't understand the need for something, that the need doesn't exist.

nijave1y ago

1 more reply

javcasas1y ago

Why do developers keep piling abstractions on top of abstractions?

Why all of this should exist?

...

Well, because each level is intended to provide something the previous levels cannot provide.

My last "operator" (not really an operator, but conceptually similar) is Airflow. Because Kubernetes doesn't have a way to chain job executions, as in "run this job after these two jobs finished".

hnarayanan1y ago

Job security

sofixa1y ago

> There's the machine. Then the VM. Then the container. Then the orchestrator

If you're running your orchestrator on top of VMs, you're doing it wrong (or you're at a very small scale or just getting started).

clx751y ago· 9 in thread

Things we implemented using this pattern:

- Job: high-level description of a DAG of containers, the web service works as a compiler which translates this high-level description into an Argo Workflow (this will be the child)

- Container: defines a dev container, expands into a pod running an sshd and a Contour HTTPProxy (TCP proxy) which forwards TLS-wrapped SSH traffic to the sshd service

- KeycloakClient: here the web service is not pure - it talks to the Keycloak Admin REST API and creates/updates a client in Keycloak whose parameters are given by the CRD spec

So far this works pretty well and makes writing controllers a breeze - at least compared to the standard kubebuilder approach.

https://metacontroller.github.io/metacontroller/intro.html

JeffMcCune1y ago

As other sibling comments suggest these use cases are better solved with a generator.

The rendered manifest pattern is a simpler alternative. Holos [1] is an implementation of the pattern using well typed CUE to wrap Helm and Kustomize in one unified solution.

[1]: https://holos.run/docs/overview/

Kinrany1y ago

The rendered manifests pattern is a great read by itself: https://akuity.io/blog/the-rendered-manifests-pattern

ec1096851y ago

Is there runtime dynamism that you need the control loop to handle beyond what the built-in primitives can handle?

clx751y ago

Currently the entire stack can be run on a developer's laptop, thanks to the magic of Tilt. In local dev it comes really handy that you can just change a CRs and the children are synced immediately.

Drawbacks we identified so far:

Python is a memory hog, our biggest controllers can take ~200M.

1 more reply

remram1y ago

The choice is always between a controller and a generator.

KGunnerud1y ago

I'm currently in a org that does not have this and I keep missing it every, every day.

Kinrany1y ago

Even if a controller is necessary, wouldn't you still want to have a generator for the easy stuff?

Kinda like "functional core, imperative shell"?

fsniper1y ago

At work we are using nolar/kopf for writing controllers that provisions/manages our kubernetes clusters. This also includes managing any infrastructure related apps that we deploy on them.

Considering you are also writing Python did you review kopf before deciding on metacontroller?

clx751y ago

Yes, we started with Kopf.

liampulles1y ago· 6 in thread

On reflection though, I think this stuff can lead to a lot of complexity layers which don't benefit the product relative to the time investment. You are probably not Google.

fragmede1y ago

osigurdson1y ago

znpy1y ago

There was an article by one of the original creators of kubernetes on how the mistakes they did when developing the thing to release to the public.

I can’t find it again and i suspect the original author has deleted it.

One of the point for example was to go with ipv6 from the start, and another was about storage.

If anybody has a link, please paste it :)

But yes, the “public” kubernetes is a dumbed down version of borg. But frankly i think it’s mostly two things:

1. They probably had to redesign the things that in borg were too specific to the custom google infrastructure

2. They didn’t want to give away the _really_ good things, the competitive advantage.

sofixa1y ago

> The declarative approach and reconciliation loop offers so many possibilities for creating higher level descriptions of domain specific infrastructure.

Terraform running on a schedule gets you 3/4 of the way there for 5% of the complexity though.

kmac_1y ago

Terraform's not an orchestrator, it's for something totally different.

osigurdson1y ago

Helm gets you 99% of the way there with less complexity than running Terraform in loop. Terraform is great for bootstrapping Kubernetes of course.

3 more replies

branislav1y ago· 2 in thread

Basically, declarative state implies value semantics which makes it easier to reason about. Underlying complexity is high though, and you need to judge how necessary it is.

Kinrany1y ago

I always thought that React and Kubernetes indeed have a lot in common. Thank you for the post!

gokhanOP1y ago

Very good read, thank you.

never_inline1y ago· 2 in thread

I'd please ask people to don't write operators unless absolutely necessary.

(If we had a better templating solution than helm, I think quite a few operators wouldn't need to exist.)

dilyevsky1y ago

pas1y ago

> can't we just store the config file in configmap/ secret and listen to changes?

isn't that what an operator also does basically? could you explain the problem with operators in more detail? thanks!

neuroelectron1y ago· 1 in thread

No not really

antirez1y ago

Came here looking for this comment.

Havoc1y ago

Low barrier to entry was not a phrase I was expecting in that article.

Either way I’m going to try my hardest to avoid this. K8s is hard enough to get right as is

j / k navigate · click thread line to collapse