That will become one of the biggest blockers because of the amount of automation that already exists.
By leveraging Terraform, you will also have the added benefit of getting all the other pieces of AWS/GCP/azure components for free - and is rock stable and production tested.
The declaration of deployment state is a very BIG and hard problem that has had millions of collective man hours spent over decades. I urge you not to think of it as a simple configuration.
In fact it is so hard that AWS has to build a new language on top of typescript ..versus cloudformation templates that it already had.
https://docs.aws.amazon.com/cdk/latest/guide/home.html
What you are building makes sense - I would drop cloudformation and surface Terraform right till the top.
So the way to use your tool is to install and use a new Terraform "provider".
Seriously, Every single fucking stupid infrastructure-deployment-tool/"platform" whatever has it's own, dumb in-house language that winds up basically re-implementing the programming language the tool is written in badly.
- Puppet: Has a stupid ad-hoc config language.
- Terraform: Has a stupid ad-hoc config language.
- SaltStack: Has a stupid ad-hoc config language.
- Ansible: Has a stupid ad-hoc config language.
If you're even considering implementing a tool like this, use a goddamn existing language for your configuration files.You don't need to use the entire language, but at least use the language's lexer/parser (cf. json/javascript). That way, all existing tooling for the language will work for the config files (ask me about how saltstack happily breaks their API because you're not "supposed" to use it, despite the fact that they have public docs for it). Additionally, people won't need to figure out all the stupid corner cases in your terrible piecemeal configuration language.
Additionally, by making your configuration language an actual language, you also simplify a lot of the system design, because the configuration can act directly against your API. This means using your tool from other tools becomes much more straightforward, because the only interface you actually need is the API.
Pulumi = Terraform in Typescript. That's good as well - but i was not sure if the OP is familiar with Pulumi
- Deployments are defined with declarative configuration and no custom Docker images are required (although you can use your own if you want)
- You have full access to the instances, autoscaling groups, security groups, etc
- Less tied to AWS (GCP support is in the works)
- We are working on higher level features like prediction monitoring, alerting, and model retraining
- It's open source and free vs SageMaker's ~40% markup
I bet you could get Cortex running on Kubeflow pretty easily since it's all K8s anyway.