Dhall: A Non-Repetitive Alternative to YAML (opens in new tab)

(dhall-lang.org)

193 pointsff_6y ago177 comments

177 comments

101 comments · 24 top-level

markandrewj6y ago· 8 in thread

My comment isn't specifically about Dhall, but about the note on Turing completeness. I often read comments about how YAML/JSON is not turning complete. These comments normally frame the lack of Turing completeness as being a short coming of the format(s). I find this interesting because one the reasons that the industry moved away from XML was to have cleaner separation between data and logic. I generally tend to think that it is cleaner to separate logic and data, instead of creating a tight coupling. I don't read many people making comments from this perspective though. I am not trying to say we can't do better then YAML/JSON, I am just trying to offer some food for thought. I tend to view JSON/YAML as a data exchange format, and not a programming language, so I am not bothered by the lack of Turing completeness.

nine_k6y ago

Not being Turing-complete is a feature.

A Turing-complete language allows to write programs that never terminate. This is not what a config file should be capable of.

dharmab6y ago

Previously in my career I've abused Jinja templating to build "scripts" out of Ansible and SaltStack YAML. It solved business problems effectively but I'm sure when I left that role I passed on a big plate of spaghetti to my successor with minimal automated tests.

If it depends on conditional logic or iteration, it probably belongs in a proper programming language with a linter, type checkers, debugger and unit test framework.

1 more reply

comex6y ago

Of all the bugs you've ever dealt with in programs in general, what fraction of them of them were infinite loops?

For me I'd guess maybe... 0.1%? It's definitely under 1%.

Given that, it makes no sense to me that I'd want to make myself jump through hoops to express some basic coding patterns [1], just to rule out that single class of bugs. It seems like a solution in search of a problem.

[1] https://github.com/dhall-lang/dhall-lang/wiki/How-to-transla...

1 more reply

tannhaeuser6y ago

> one [of] the reasons that the industry moved away from XML was to have cleaner separation between data and logic

How is XML coupling data and logic? The only kind of "processing" it does by itself I can think of is composing documents from pieces and "processing instructions" as a generic extension mechanism. That is, features to support its original use case of authoring and capturing structured text. Now SGML has more processing features (tag inference, stylesheets/link processes, notations), but is still far away from Turing-completeness.

markandrewj6y ago

I am not sure if this completely answers your question, but I am talking about XML in a SOAP based architecture.

javier26y ago

I 100% agree. It should be considered an important feature that my configuration files (and transfer data) don't suffer from the halting problem.

dragonwriter6y ago

> I find this interesting because one the reasons that the industry moved away from XML was to have cleaner separation between data and logic.

That's not really true or even sensible, since XML doesn't combine data and logic. Sure, there were XML-based logic languages (most notably XSLT) as well as XML-based data languages, but while all were applications of XML they were separate languages.

XML lost ground to JSON, etc., as the fashion pendulum swung away from heavyweight tooling and detailed specs for most things (though it's swinging back again), and to some closer-to-memory-layout binary formats as efficiency became a concern in some of the places where rigid specs remained important.

markandrewj6y ago

Hello, just to clarify I am specifically talking about how XML was used in the 90's for web development, i.e. SOAP.

zeliard6y ago· 7 in thread

Couple of contenders:

- Jsonnet (https://jsonnet.org/) - simpler syntax and less concepts to learn, just an extension of JSON. But no type checking. An open source offspring of Google's internal config language (GCL/BCL)

- Cue (https://github.com/cuelang/cue) - a more ambitious attempt to fix GCL/BCL by replacing inheritance as the fundamental compositional primitive with constraint unification.

Great thread comparing them against each other by the authors of both: https://github.com/cuelang/cue/issues/33

Cue seems kind of similar to Dhall on first sight, but I haven't used either enough for an informed opinion yet.

lihaoyi6y ago

We make heavy use of Jsonnet at work (https://databricks.com/blog/2017/06/26/declarative-infrastru...). It's worked great. Having a hermetic, pure templating system whose only output is a set of JSON/YAML files means that you can refactor fearlessly: as long as the materialized JSON/YAML doesn't change, you are 100% sure your refactor is safe. Bazel's StarLark dialect of Python (https://github.com/bazelbuild/starlark) has similar benefits.

The language is simple and remarkably well specified, enough that we implemented our own intellij plugin (https://plugins.jetbrains.com/plugin/10852-jsonnet) and even our own faster compiler (https://github.com/databricks/sjsonnet) without much effort at all.

There are odd corners in the language, but not something that most people will end up bumping into in typical usage. The templates certainly get messy in large configurations, but no more messy than any other code, and the hermeticity/purity greatly helps in managing the messiness. It's certainly less messy/odd than the copy-paste configs or be-spoke JSON/YAML templating systems that inevitably appear in messy deployment environments!

The last thing of note is the lack of static types: this definitely affects usability to some extent, and especially hinders IDE support from being as useful as it is in e.g. Java. But having a useful/ergonomic type system that fits this specific problem space is probably still an unsolved research question.

dharmab6y ago

We tried to introduce Jsonnet at our org. It failed miserably because ops kept mistaking the name for JSON which they hated. (International multilingual team).

It was a real shame because ops then implemented some features of Jsonnet via scripts to to parse and merge YAML. What was 0 LOC in Jsonnet is now about 300 LOC plus custom CI checkers, all because of a marketing problem.

Fnoord6y ago

If I go to the mentioned Jsonnet homepage it says "A simple extension of JSON". The graphic explains its relation to JSON. The example looks awfully similar to JSON.

What I don't understand is the following: config files are read by text editors, and in the end, by human beings. Because of the latter they should have certain traits. We must agree on the importance of these traits before we can settle on a standard.

For me, important features are that they must be readable, and easily editable. They must be readable with a certain text editor (vi) for backwards compatibility. So that means it shouldn't require syntax highlighting or schema. Well, these 2 simple requirements of mine rule out anything remotely resembling JSON.

It just appears to me that JSON is for JavaScript developers, YAML for Python developers, and Dhall for ML (the whole family I suppose, not just Haskell) developers.

Well then if we're going that route then perhaps all we need is some kind of glue between text config and binary config (which reminds me of Systemd...). Ie. that it accepts multiple config file formats.

1 more reply

cppforlife6y ago

i'll throw in one of my projects as a contender: ytt - YAML templating tool - https://get-ytt.io (check out live playground!).

it works with yaml structures (hence avoids text templating problems) and uses familiar python-like language, starlark, making quite easy to get started. it makes use of yaml comments to assign metadata/templating directives to yaml nodes, so it looks something like this:

  #@ load("@ytt:data", "data")

  #@ def labels():
  app: echo
  org: test
  #@ end

  kind: Pod
  apiVersion: v1
  metadata:
    name: echo-app
    labels: #@ labels()
  spec:
    containers:
    #@ for/end echo in data.values.echos:
    - name: #@ echo.name
      image: hashicorp/http-echo
      args:
      - #@ "-text=" + echo.text

it doesn't include type checking, however, it does have a system to "overlay" structures on top of each other via overlay feature -- https://github.com/k14s/ytt/blob/master/docs/lang-ref-ytt-ov.... merge/replace/remove operations expect to find one node by default so map key typos or wrong structural nesting problems are caught easily in common cases.

jbergstroem6y ago

My favorite is still the nginx config-like libucl: https://github.com/vstakhov/libucl

Ericson23146y ago

Cue is a bit too cute trying to combine the subtyping and inhabitance relations into one.

skybrian6y ago

What problems do you see?

isoprophlex6y ago· 7 in thread

To me, worrying about config files seems like the ultimate exercise in bikeshedding.

You either need a simple list of items (eg. dependencies) or key/value pairs. Use a text file or yml or json or whatever.

Or you need templating, the use of functions, etc, like dhall provides. But then, why not use the language you're already using for the rest of your project, or a bash script to export some variables?

Might sound like I'm throwing sourness around, but I just don't see the niche for this, except inventing a new thing for the joy of it?

zeliard6y ago

>But then, why not use the language you're already using for the rest of your project, or a bash script to export some variables?

Well, for same reason, say, why people would use a javascript framework to build a webapp over vanilla js. Both could do the job, and for simple cases there's little reason to go with a framework resp. specialized config language.

But as your app/config gets larger and more complex, using a framework resp. config language would tend to get the job done more efficiently by providing you structure and toolbox with solutions to common pain points.

Config generators themselves tend to be a rather heavyweight all-or-nothing solution which leads people to compromise on some adhoc middle-ground solutions like YAML with jinja templates with unclear evaluation semantics. A good config language designed from the ground up can be so much better than this unholy yaml/jinja mess!

Finally, one of the key selling points of specifically Dhall is type checking. Implementing that in config generators in a generic untyped scripting language would be a nontrivial amount of boilerplate, and boilerplate elimination is what config languages are all about.

leg1006y ago

> You either need a simple list of items (eg. dependencies) or key/value pairs. Use a text file or yml or json or whatever.

You clearly haven't wrestled with kubernetes configuration files.

Reams and reams of YAML, heavily indented to represent umpteen nested objects.

Templating, bash scripts, using your favourite language to roll your own config generator - these are all well trodden approaches that fail to scale.

Official shitshows like Helm have come along with nothing more innovative to offer than templated YAML. The next version uses lua to generate config, but I remain skeptical given previous design choices.

We can question whether kubernetes has pushed the dial too far towards necessitating mountains of config, but for now it is most definitely a problem for us users.

mbrock6y ago

My favorite is when people write scripts that concatenate YAML files in such a way that you have to be really careful about how this one file always needs to be indented by 12 spaces otherwise everything breaks in horrible ways.

afiori6y ago

One reason is that "configuration language" is an extremely wide topic. Some configs use yaml to encode bash scripts for example.

Overall a reason enough is that human-friendly languages have different priority than parser-friendly ones.

Honestly it is the reason I like TOML, with the exception of the date data type it cleanly maps to json (which everyone agrees it is a good enough serialization format) and it is specifically focused on human friendliness (except uniform lists) and readability

As an underappreciated feature, the ability to have scoped keyvalues allow to define nested table with flat statements.

isoprophlex6y ago

> Overall a reason enough is that human-friendly languages have different priority than parser-friendly ones.

Okay thanks, that's a good one. Readability might be a big deal.

GordonS6y ago

I kind of agree, to a point (e.g. Erlang config is, usually, a complete shitshow).

I wonder if JSON had allowed for comments if we'd see such a proliferation of config system? At least IME, that seems to be the biggest pain point with JSON.

guggle6y ago

My thoughts, exactly. Especially with dynamic, scripting languages with no need for compilation and simple enough syntax (ie. php, python, ruby, etc). Why would I want to add another layer of complexity to an application with an unknown language while I already have a capable one ? Why add more dependencies on your project ? Why add more cognitive load on your brain ?

tobr6y ago· 6 in thread

So far, only comments complaining about syntax. You can do better, HN!

nikolay6y ago

Yeah, because that's what most configuration languages differ in.

kccqzy6y ago

And yet, in focusing on the syntax, you missed the biggest difference between Dhall and other configuration languages: safe, termination-guaranteed non-Turing-complete computation.

1 more reply

marcosdumay6y ago

And yet, HN is commenting about a language people use to describe entire networks of computers on a few non-redundant centralized files... And all the comments are about syntax.

lelf6y ago

https://wiki.haskell.org/Wadler's_Law + https://en.wikipedia.org/wiki/Law_of_triviality

Vosporos6y ago

I'm afraid years of n-gate have shown that HN cannot do better.

mitchtbaum6y ago

n-gategate strikes again

cryptonector6y ago· 5 in thread

If you have functions that can call functions, you'd better not have recursion if you want to not be Turing-complete.

Non-Turing-completeness is certainly very important in many cases (e.g., in DTrace and eBPF), but I'm not sure that it's so important for configuration. Assuming for a moment that I don't need non-Turing-completeness for configuration, my choice of DSL would be jq[0]! Using jq for configuration means that I can use JSON, TOML-style, and other ways of expressing complex data, including combinations of them, all with "interpolation" (not quite) and complex computation being available.

  [0] https://stedolan.github.io/jq/

Quekid56y ago

One valuable point of Dhall is that it is programmable (yet not TC) in such a way to that you can e.g. describe a whole system entirely in Dhall and then (in Dhall!) derive whatever further configurations (plural!) you need from that. This is much more feasible than in e.g. YAML because Dhall is strongly typed.

So you could describe e.g. a cluster of machines entirely in Dhall and derive Ansible YAML scripts (with all their boilerplate), derive DNS config files, etc. etc. all from a single strongly typed description.

Boulth6y ago

It goes even further: because Dhall supports functions one can write a function that will migrate old configs to new format. Migration function can also be type checked (it must accept old config format and emit new one). Of course all of this without TC.

cryptonector6y ago

I mean, jq is a powerful programming language. Did you look at the link I posted?

2 more replies

codebje6y ago

Recursion is fine, as long as an argument gets smaller at every iteration, since that guarantees termination.

ignaloidas6y ago

Oh, like Ackermann function[0]? Because I wouldn't want my server try to evaluate that.

[0] https://en.wikipedia.org/wiki/Ackermann_function

2 more replies

nikolay6y ago· 5 in thread

Dhall keeps popping up on HN. Here what I don't like about it:

- Why use '=' instead of ':' for attributes? If you used ':', then '=' could be variable assignment and eliminate the need for 'let'.

- Why is there a need for commas?

- Why quote via ticks?! Gee!

- What's with the '{-' and '-}' for comments?! It's like its author decided to differ at any price!

In general, good ideas, but it's too weird and unnecessarily deviates from common syntax.

duijf6y ago

Dhall heavily borrows both ideas and syntax from the ML family of languages. E.g. Haskell, OCaml, Elm, Purescript

Colons are used for type signatures.

Commas are presumably required because you can have multi line and nested records. (don't quote me on this, not a parser expert)

The comment syntax is from Haskell.

Not saying this syntax is familiar to everyone, but it is familiar to some. The lineage of the syntax might help you understand where the language is coming from

dymk6y ago

Weird that it's billed as an alternative to YAML, but effectively has zero roots or influence from YAML. Looks more like an alternative to... whatever configuration language is popular in ML language projects?

4 more replies

nikolay6y ago

Yeah, but how is this relevant? YAML, for example, makes JSON a subset, which lets it consume existing JSON without any need for conversion. It's easy to learn, and readable. Leading commas are eyesores. Plus, not everybody is a fan of ML! These are not mass market languages.

2 more replies

Thorrez6y ago

None of the examples show quoting via ticks, that seems to be a pretty obscure feature.

nikolay6y ago

Actually, at least one does and for the wrong reason (an attribute named True, for example, needs to be quoted):

  { -- Unlike YAML, Dhall does not accept YES|NO|ON|OFF
    validDhallBools = [ True, False ]
      , someNumbers = [ 1
    ,
  -- Dhall is not indentation-sensitive
  2, 3 ]
    -- Field names that conflict with reserved identifiers must be quoted
  , `True` = True
  , version = "9.3"  {- Strings must be quoted

                        All Dhall literals have unambiguous types -}
  }

1 more reply

andybak6y ago· 5 in thread

Immediate response?

I hate commas at the start of lines and I would prefer not to have curly braces in a human editable/readable format.

Neither reason is terribly rational but my first impressions weren't great.

piotrkubisa6y ago

It looks that Dhall has been inspired of the Elm language [0] and it's formatter.

[0]: https://guide.elm-lang.org/

bvaldivielso6y ago

It's a common practice in the Haskell community. Knowing where the creator of dhall comes from I would say that that's the source of inspiration

1 more reply

GordonS6y ago

I get that this makes diffs a tiny bit nicer when adding new lines, if you don't use trailing commas, but christ it's ugly!

Jeff_Brown6y ago

> commas at the start of lines

are not a requirement.

anentropic6y ago

OTOH I would guess it doesn't allow a trailing comma (same problem as JSON...) so you end up with weird ugly formatting conventions

2 more replies

oalessandr6y ago· 4 in thread

I'm trying to use it for Kubernetes since it can both work like helm (paramerizing functions) and kustomize (using the merge // operator). Moreover it has (safe) imports which make defining constants quite easy.

There are already kubernetes bindings available https://github.com/dhall-lang/dhall-kubernetes .

The syntax in the examples looks a bit more verbose and less readable than yaml but I think building sensible abstractions on top of it will alleviate the pain (abstractions here are innocuous since you can 'normalize' the code and they disappear)

I'm not too happy with the default formatting though. I think if the formatter indented nested values similar to yaml that would look better to the human eye.

amluto6y ago

> Moreover it has (safe) imports which make defining constants quite easy.

I read about dhall’s imports, and I don’t think I like it. If I add a text configuration mechanism to software, I do not want it accessing the network by default, full stop. To me, a “safe” configuration language means that parsing terminates, does not have side effects, does not touch the network, and that parsing the same file twice gives the same output unless I explicitly change an input. Pulling a prelude off of github does misses several of these requirements.

(Having your config file fail to parse if your network is down is bad, bad news if that config is needed to bring your network up. It’s also bad news if a parsing failure due to a transient network issue leaves your system in a state where it won’t quickly recover if the network comes back.)

jose_zap6y ago

You can still do that, though. In Dhall you may import things remotely as you develop and then tell Dhall to pre-fetch the result, you can commit that and it will not access anything.

You may also just download any imports yourself and source them locally.

Additionally Dhall supports import fallbacks, for example you may try first a remote import, and if it fails it will look for another place, which I’ve could be remote or local. This is a good strategy for developing locally and then committing imports for production use.

You can also, of course, host the files in your local network.

oalessandr6y ago

I get your point, but you don't need to run imports over the network (local imports are fine).

Also, if you were to import over the network, by running `dhall freeze` a semantic hash of the content is computed so you are 100% sure that what you are importing is not going to change. Moreover, files that have a hash value will be cached by dhall.

If you don't want to bother with copying over Prelude and you don't trust the cache, you can also normalize the code before pushing it to the network. This will flatten all your imports and reduce your file to normal form.

You might be interested in what they say about imports here: https://github.com/dhall-lang/dhall-lang/blob/master/standar...

singpolyma36y ago

Yeah, the formatting the `dhall` CLI tool uses isn't my favourite. Though luckily I don't usually have to look at it much :)

hjk056y ago· 4 in thread

This isn’t an alternative to yaml. It’s a yaml generator. To me it’s not competing with yaml it’s competing with python or Haskell, and i’d argue that putting yet another language in your stack just for generating config files is added unneeded complexity. And sure while both python and Haskell are Turing complete, how often do we actually run into issues when generations flat config files? I mean I’ve never had that issue, and I’ve never caught myself thinking “if only there was a nice way to limit myself to a non Turing complete subset of python/Haskell”...

guggle6y ago

> i’d argue that putting yet another language in your stack just for generating config files is added unneeded complexity.

Well... I'd argue that when using Python I don't feel the need for a config file language in the first place... it's human friendly enough and I don't have to learn another syntax, use another parser, etc. I had to work on a Symfony project recently and I wish it wasn't sprinkled with all those yaml files.

> I’ve never caught myself thinking “if only there was a nice way to limit myself to a non Turing complete subset of python/Haskell”

Ditto... seems like bloat to me. There may be some use cases I don't know about but these config file languages tend to repel me.

eridius6y ago

That's kind of like saying C isn't an alternative to assembly, it's an assembly generator.

HelloNurse6y ago

On a practical level, C is an alternative to assembly because the appropriate black-box tools and combinations of tools allow the user to easily turn source code in either Language or a combination of both into an executable.

Generating assembly from C is an implementation detail, and many C compilers don't do that.

On the other hand, Dhall really is a YAML generator: the available tools allow only one-way conversion (in particular, there is no interpreter/library to ingest Dhall from the configured application itself).

2 more replies

singpolyma36y ago

There is a tool to generate YAML from Dhall, but there are also language bindings for Ruby, JVM, and Haskell with more on the way. I don't think generating YAML will be a main use case for long.

NuSkooler6y ago· 4 in thread

Still much prefer HJSON (http://hjson.org/) for stuff that people might need to touch.

If it's truly for end-users (read: non-admin/dev types), you probably shouldn't have them touching configuration files _at all_.

epage6y ago

Don't fully remember why I prefer json5 to hjson but at a quick glance, bare values is one. Bare values are ripe for someone entering in a string and accidentally getting a bool or number instead.

tobr6y ago

How is this at all related to Dhall? It looks like a completely different thing with a completely different purpose.

NuSkooler6y ago

They are both text based configuration file formats made to be easier for humans to interact with, so I'm not sure what you're confused about?

1 more reply

ape46y ago

Also Relaxed JSON. http://www.relaxedjson.org/

_j7tr6y ago· 4 in thread

What's with the commas at the start of lines?

jbaum986y ago

This is a common convention in some languages, most often functional languages in my experience. I associate it most with OCaml.

So it's not so surprising to see it here, seeing as Dhall is written in Haskell.

mitchtbaum6y ago

Better Syntax for Lists, Records (and Unions) #66

https://github.com/dhall-lang/dhall-lang/issues/66

edoceo6y ago

Makes it easier when commenting out. Use this trick for SQL and JS too (to prevent trailing comma issue)

dymk6y ago

I don't get why you'd build a language in 2019 which disallows a trailing comma in lists.

5 more replies

jrudolph6y ago· 3 in thread

Dhall is an awesome tool to have in your DevOps tool belt - we're heavy dhall users at meshcloud [0] and couldn't be happier about it. We picked it after evaluating a long list of contenders (yaml madness with anchors, jsonnet, ksonnet, j2/jinja, a hacked ejs compiler [1] and some more I forgot). It's so good we're looking into how we can give back/donate to the project.

Dhall elegantly solves a major challenge: configuration management at scale. We build a multi-cloud management platform, which serves DevOps teams, IT Governance, Controlling and IT Management in large enterprises. That means we're an integration solution for a lot of things, so we need to be highly configurable. Because we also manage private clouds (a la OpenStack, Cloud Foundry, OpenShift etc.), we often run on-premises and operate our software as a managed service. Using dhall allows us to _compile and type check_ all our configuration for all our customers before rolling things out. We use dhall to compile everything from terraform/ansible, kubernetes templates, spring config, to concourse ci pipelines and customer-specific reference data to load into our product. Since adopting dhall earlier this year, we measurably reduced our deployment defect rate and re-gained the ability to safely refactor configuration.

It takes a little time to get used to, but we appreciate that it's highly opinionated around formatting and "how to do things" - somewhat in the same way as golang is. It has certainly helped that we had a member with haskell experience on the team, as dhall is built in haskell and the syntax feels familiar.

Plug: if you're looking for a job working with dhall, reach out :-)

- 0: https://meshcloud.io - 1: https://github.com/Meshcloud/ejs-compiler

solatic6y ago

We're also heavy Dhall users in production. Functional, strongly typed configuration is such a powerful concept that I struggle to understand how the language isn't more popular yet.

Common example: let's say I want to set up a PostgreSQL database for a service running in Kubernetes in AWS. How best to get it done?

Well, it turns out there's a number of different options: you can set up a DB through RDS, and a service in Kubernetes which directs to it through an externalName, which is probably what you want in production; you can set up Postgres as a StatefulSet, which is probably what you want in an ephemeral testing environment; or maybe you have a customer with a full-time DBA who will create the database for you and give you a connection string.

With Dhall, you set up a union type with each of these scenarios as options, and then you have a Dhall function for your Terraform and Kubernetes configurations. In your Terraform configuration, you have an RDS module where the count is set to 1 for the RDS/production scenario and 0 otherwise. In your Kubernetes configurations, you set up a service with an external name appropriately when you need to, set up a StatefulSet when that's relevant, etc.

Because they all use the same type in their function's parameter, they're guaranteed to stay consistent. You're guaranteed to never have an RDS instance setup alongside a Postgres StatefulSet. If you need to make changes (add options, change options, etc.) then you will get type errors in each and every place which forces you to address them, including in places you forgot about.

We started to adopt Dhall more than half a year ago now and we've barely scratched the surface of what the language makes possible. Purity in infrastructure and operations is a powerful drug.

tedmiston6y ago

Heads up - Your naked subdomain redirect to www doesn't seem to be working. If I go to www directly, I don't get the timeout.

jrudolph6y ago

thanks! turns out it wasn't working for https, should be fixed now.

1 more reply

adev_6y ago· 3 in thread

For a pragmatic, really readable configuration file format, TOML never disappointed me ( https://github.com/toml-lang/toml#user-content-local-date ).

- This is human readable contrary to the JSON family and its {} abuses.

- It is not space / ident base contrary to YAML that becomes very quickly a mess to write and a mess to parse.

epage6y ago

As a fan of TOML, I want to be clear on the downsides.

TOML is good for data layed out with TOML. Representing arbitrary nested arrays and tables gets messy.

Also, the constraint on homogenous shallow types has impacted me in some cases. Originally, I was all on board. Arrays should be homogenous. The problem is logically homogenous vs syntactically homogenous.

Cargo uses tables to declare dependencies. The values are logically homogenous, they are declarations. Synatictically, some values are strings while the rest are sub-tables. The string is just shorthand for a table though.

This feature can't be implemented in arrays like it can with tables.

nikolay6y ago

TOML is almost perfect. The only things I don't like are the need for commas and the double brackets.

smitty1e6y ago

Perfection is a bugaboo. Give me 95%, minor inconveniences, and declare vict'ry, say I.

choeger6y ago· 3 in thread

Hmm...

So the authors claim that their language is guaranteed to terminate for all well-typed programs. That is actually a nice spot for configuration languages. Yet, I wonder how

a) they guarantee it, as I have seen no obvious link to the language's semantics

b) useful this is in practice.

Nevertheless, very nice approach, indeed.

yunyu6y ago

There is no support for recursion and the usual workarounds don't apply, so the language is not Turing complete: https://github.com/dhall-lang/dhall-lang/wiki/Safety-guarant...

codebje6y ago

Build systems tend to get very complex Turing complete scripts, such as gradle for Java or Make for C. Having something almost as powerful but reducible to a normal form is very helpful for CASE tools.

skybrian6y ago

Banning Turing completeness doesn't give you the property you want, though. Knowing that reducing to a normal form eventually terminates if you wait a million years may be something mathematicians care about, but isn't of practical use.

What matters is that you can analyze the code quickly. To find that out, one way is to try it and kill the process if it takes too long.

Or perhaps better would be to come up with a portable definition of what "takes too long" means that you can put in a presubmit check. Something like "running out of gas" in Ethereum.

KirinDave6y ago· 2 in thread

Dhall is fantastic and I try to encourage everyone in tech I meet try it.

GordonS6y ago

OK, why?

KirinDave6y ago

Because it is a good mix of features, syntax, execution speed and correctness.

Of course. Didn't you read the article?

1 more reply

voidmain6y ago· 2 in thread

How far does "non turing completeness" really get you in this context? It looks easy to write a program in this language that will take longer than the age of the universe to evaluate and whose result can't be represented explicitly without collapsing the galaxy into a black hole. How much comfort can you take in the fact that you know it doesn't diverge?

mbrock6y ago

How would you write such a program?

comex6y ago

    let replicate = http://prelude.dhall-lang.org/List/replicate
    in replicate 999999999999 Natural 1

(add additional nines if necessary)

mitchtbaum6y ago· 2 in thread

This looks very useful.

mitchtbaum6y ago

looking further, it seems that aside from repetitiveness, safety is the main focus:

https://github.com/dhall-lang/dhall-lang/wiki/Safety-guarant...

which in Rust, we're solving this via SANE and SCL:

https://gitlab.com/bloom42/sane-rs

https://github.com/keats/scl

I'm not sure how much need there is for an additional programming layer, especially within config (the part of a program with the simplest syntactic requirements).

for my projects where "ahead-of-time validation" is needed, we're currently using SCL's parser for safety guarantees:

https://github.com/foundpatterns/contentdb

https://github.com/foundpatterns/lighttouch/blob/d7ada4576a6...

https://github.com/foundpatterns/torchbear/blob/4dd2b9ea76ba...

ff_OP6y ago

From a cursory look to SANE and SCL it looks like Dhall still offers some more:

- functions

- a powerful typesystem

- remote (HTTP) imports with sha256 checksums

desc6y ago· 2 in thread

Programmable configuration is always and without exception a monumentally stupid idea.

Programmatic generation of static configuration files can be very useful.

Sufficiently complex examples of the latter might as well be the former as far as maintenance is concerned.

If you need to write a program to configure your program, you're probably doing it wrong.

nine_k6y ago

Configs allow to add flexibility past compile time, often dynamically at runtime.

desc6y ago

Yes, that's the problem. I'd like to be able to look at a config file, on disk, loaded at startup, which defines the initial state of the server without having to think through how it was evaluated.

Generating the config during deployment, eh... often necessary. Best done with transforms and templates because they're simple.

Executable config, run during startup or, worse, on each request? NO.

[edit] I think that's the main disconnect here: 'past compile time'. The whole point of testing, strong type systems, etc is to lock down the set of states the system can be in. If your configuration is so 'dynamic' you are essentially abandoning all those benefits and saying 'yeah, do what you like to our live servers'.

In short, configuration which is that powerful is indistinguishable from running untested code in production.

1 more reply

javier26y ago· 1 in thread

I love this!

How small is a static binary to run this in my containers?

How are some ways to integrate the typed config in a language?

Gabriel4396y ago

The static binaries for the various interpreters and conversion utilities (i.e. `dhall`/`dhall-to-yaml`/`yaml-to-dhall`) are all roughly 10 MB each

The following languages natively bind to Dhall:

* Haskell * Clojure * Ruby

... and the following language bindings are in progress:

* Rust * Go * Python * PureScript

In the absence of a native language binding, you can convert Dhall to YAML or JSON and read that in.

dang6y ago

Thread from 2018: https://news.ycombinator.com/item?id=17523623

2017: https://news.ycombinator.com/item?id=15185015

2016: https://news.ycombinator.com/item?id=13109672

mitchtbaum6y ago

Rust bindings tracking issue

https://github.com/Nadrieril/dhall-rust/issues/77

ilaksh6y ago

I'm sure people will be happy to crucify me for throwing this out there but I don't see a big risk in just using JavaScript in most cases if you want something like that. You could use template literals to replicate the example.

arkh6y ago

   let input =
      { relative = "daughter"
      , movies   = [ "Boss Baby", "Frozen", "Moana" ]
      }

We don't frequent the same kind of "non-technical users" I guess.

amingilani6y ago

I thought the typo in the challenge was that the keys were in the root of the user's home directory, instead of the `.ssh` directory. So, I added `.ssh/` between the key and user home directory.

j / k navigate · click thread line to collapse

177 comments

101 comments · 24 top-level

markandrewj6y ago· 8 in thread

nine_k6y ago

Not being Turing-complete is a feature.

A Turing-complete language allows to write programs that never terminate. This is not what a config file should be capable of.

dharmab6y ago

If it depends on conditional logic or iteration, it probably belongs in a proper programming language with a linter, type checkers, debugger and unit test framework.

1 more reply

comex6y ago

Of all the bugs you've ever dealt with in programs in general, what fraction of them of them were infinite loops?

For me I'd guess maybe... 0.1%? It's definitely under 1%.

[1] https://github.com/dhall-lang/dhall-lang/wiki/How-to-transla...

1 more reply

tannhaeuser6y ago

> one [of] the reasons that the industry moved away from XML was to have cleaner separation between data and logic

markandrewj6y ago

I am not sure if this completely answers your question, but I am talking about XML in a SOAP based architecture.

javier26y ago

I 100% agree. It should be considered an important feature that my configuration files (and transfer data) don't suffer from the halting problem.

dragonwriter6y ago

> I find this interesting because one the reasons that the industry moved away from XML was to have cleaner separation between data and logic.

markandrewj6y ago

Hello, just to clarify I am specifically talking about how XML was used in the 90's for web development, i.e. SOAP.

zeliard6y ago· 7 in thread

Couple of contenders:

- Jsonnet (https://jsonnet.org/) - simpler syntax and less concepts to learn, just an extension of JSON. But no type checking. An open source offspring of Google's internal config language (GCL/BCL)

- Cue (https://github.com/cuelang/cue) - a more ambitious attempt to fix GCL/BCL by replacing inheritance as the fundamental compositional primitive with constraint unification.

Great thread comparing them against each other by the authors of both: https://github.com/cuelang/cue/issues/33

Cue seems kind of similar to Dhall on first sight, but I haven't used either enough for an informed opinion yet.

lihaoyi6y ago

dharmab6y ago

We tried to introduce Jsonnet at our org. It failed miserably because ops kept mistaking the name for JSON which they hated. (International multilingual team).

Fnoord6y ago

If I go to the mentioned Jsonnet homepage it says "A simple extension of JSON". The graphic explains its relation to JSON. The example looks awfully similar to JSON.

It just appears to me that JSON is for JavaScript developers, YAML for Python developers, and Dhall for ML (the whole family I suppose, not just Haskell) developers.

1 more reply

cppforlife6y ago

i'll throw in one of my projects as a contender: ytt - YAML templating tool - https://get-ytt.io (check out live playground!).

  #@ load("@ytt:data", "data")

  #@ def labels():
  app: echo
  org: test
  #@ end

  kind: Pod
  apiVersion: v1
  metadata:
    name: echo-app
    labels: #@ labels()
  spec:
    containers:
    #@ for/end echo in data.values.echos:
    - name: #@ echo.name
      image: hashicorp/http-echo
      args:
      - #@ "-text=" + echo.text

jbergstroem6y ago

My favorite is still the nginx config-like libucl: https://github.com/vstakhov/libucl

Ericson23146y ago

Cue is a bit too cute trying to combine the subtyping and inhabitance relations into one.

skybrian6y ago

What problems do you see?

isoprophlex6y ago· 7 in thread

To me, worrying about config files seems like the ultimate exercise in bikeshedding.

You either need a simple list of items (eg. dependencies) or key/value pairs. Use a text file or yml or json or whatever.

Might sound like I'm throwing sourness around, but I just don't see the niche for this, except inventing a new thing for the joy of it?

zeliard6y ago

>But then, why not use the language you're already using for the rest of your project, or a bash script to export some variables?

leg1006y ago

> You either need a simple list of items (eg. dependencies) or key/value pairs. Use a text file or yml or json or whatever.

You clearly haven't wrestled with kubernetes configuration files.

Reams and reams of YAML, heavily indented to represent umpteen nested objects.

Templating, bash scripts, using your favourite language to roll your own config generator - these are all well trodden approaches that fail to scale.

We can question whether kubernetes has pushed the dial too far towards necessitating mountains of config, but for now it is most definitely a problem for us users.

mbrock6y ago

afiori6y ago

One reason is that "configuration language" is an extremely wide topic. Some configs use yaml to encode bash scripts for example.

Overall a reason enough is that human-friendly languages have different priority than parser-friendly ones.

As an underappreciated feature, the ability to have scoped keyvalues allow to define nested table with flat statements.

isoprophlex6y ago

> Overall a reason enough is that human-friendly languages have different priority than parser-friendly ones.

Okay thanks, that's a good one. Readability might be a big deal.

GordonS6y ago

I kind of agree, to a point (e.g. Erlang config is, usually, a complete shitshow).

I wonder if JSON had allowed for comments if we'd see such a proliferation of config system? At least IME, that seems to be the biggest pain point with JSON.

guggle6y ago

tobr6y ago· 6 in thread

So far, only comments complaining about syntax. You can do better, HN!

nikolay6y ago

Yeah, because that's what most configuration languages differ in.

kccqzy6y ago

And yet, in focusing on the syntax, you missed the biggest difference between Dhall and other configuration languages: safe, termination-guaranteed non-Turing-complete computation.

1 more reply

marcosdumay6y ago

And yet, HN is commenting about a language people use to describe entire networks of computers on a few non-redundant centralized files... And all the comments are about syntax.

lelf6y ago

https://wiki.haskell.org/Wadler's_Law + https://en.wikipedia.org/wiki/Law_of_triviality

Vosporos6y ago

I'm afraid years of n-gate have shown that HN cannot do better.

mitchtbaum6y ago

n-gategate strikes again

cryptonector6y ago· 5 in thread

If you have functions that can call functions, you'd better not have recursion if you want to not be Turing-complete.

  [0] https://stedolan.github.io/jq/

Quekid56y ago

Boulth6y ago

cryptonector6y ago

I mean, jq is a powerful programming language. Did you look at the link I posted?

2 more replies

codebje6y ago

Recursion is fine, as long as an argument gets smaller at every iteration, since that guarantees termination.

ignaloidas6y ago

Oh, like Ackermann function[0]? Because I wouldn't want my server try to evaluate that.

[0] https://en.wikipedia.org/wiki/Ackermann_function

2 more replies

nikolay6y ago· 5 in thread

Dhall keeps popping up on HN. Here what I don't like about it:

- Why use '=' instead of ':' for attributes? If you used ':', then '=' could be variable assignment and eliminate the need for 'let'.

- Why is there a need for commas?

- Why quote via ticks?! Gee!

- What's with the '{-' and '-}' for comments?! It's like its author decided to differ at any price!

In general, good ideas, but it's too weird and unnecessarily deviates from common syntax.

duijf6y ago

Dhall heavily borrows both ideas and syntax from the ML family of languages. E.g. Haskell, OCaml, Elm, Purescript

Colons are used for type signatures.

Commas are presumably required because you can have multi line and nested records. (don't quote me on this, not a parser expert)

The comment syntax is from Haskell.

Not saying this syntax is familiar to everyone, but it is familiar to some. The lineage of the syntax might help you understand where the language is coming from

dymk6y ago

4 more replies

nikolay6y ago

2 more replies

Thorrez6y ago

None of the examples show quoting via ticks, that seems to be a pretty obscure feature.

nikolay6y ago

Actually, at least one does and for the wrong reason (an attribute named True, for example, needs to be quoted):

  { -- Unlike YAML, Dhall does not accept YES|NO|ON|OFF
    validDhallBools = [ True, False ]
      , someNumbers = [ 1
    ,
  -- Dhall is not indentation-sensitive
  2, 3 ]
    -- Field names that conflict with reserved identifiers must be quoted
  , `True` = True
  , version = "9.3"  {- Strings must be quoted

                        All Dhall literals have unambiguous types -}
  }

1 more reply

andybak6y ago· 5 in thread

Immediate response?

I hate commas at the start of lines and I would prefer not to have curly braces in a human editable/readable format.

Neither reason is terribly rational but my first impressions weren't great.

piotrkubisa6y ago

It looks that Dhall has been inspired of the Elm language [0] and it's formatter.

[0]: https://guide.elm-lang.org/

bvaldivielso6y ago

It's a common practice in the Haskell community. Knowing where the creator of dhall comes from I would say that that's the source of inspiration

1 more reply

GordonS6y ago

I get that this makes diffs a tiny bit nicer when adding new lines, if you don't use trailing commas, but christ it's ugly!

Jeff_Brown6y ago

> commas at the start of lines

are not a requirement.

anentropic6y ago

OTOH I would guess it doesn't allow a trailing comma (same problem as JSON...) so you end up with weird ugly formatting conventions

2 more replies

oalessandr6y ago· 4 in thread

There are already kubernetes bindings available https://github.com/dhall-lang/dhall-kubernetes .

I'm not too happy with the default formatting though. I think if the formatter indented nested values similar to yaml that would look better to the human eye.

amluto6y ago

> Moreover it has (safe) imports which make defining constants quite easy.

jose_zap6y ago

You can still do that, though. In Dhall you may import things remotely as you develop and then tell Dhall to pre-fetch the result, you can commit that and it will not access anything.

You may also just download any imports yourself and source them locally.

You can also, of course, host the files in your local network.

oalessandr6y ago

I get your point, but you don't need to run imports over the network (local imports are fine).

You might be interested in what they say about imports here: https://github.com/dhall-lang/dhall-lang/blob/master/standar...

singpolyma36y ago

Yeah, the formatting the `dhall` CLI tool uses isn't my favourite. Though luckily I don't usually have to look at it much :)

hjk056y ago· 4 in thread

guggle6y ago

> i’d argue that putting yet another language in your stack just for generating config files is added unneeded complexity.

> I’ve never caught myself thinking “if only there was a nice way to limit myself to a non Turing complete subset of python/Haskell”

Ditto... seems like bloat to me. There may be some use cases I don't know about but these config file languages tend to repel me.

eridius6y ago

That's kind of like saying C isn't an alternative to assembly, it's an assembly generator.

HelloNurse6y ago

Generating assembly from C is an implementation detail, and many C compilers don't do that.

2 more replies

singpolyma36y ago

There is a tool to generate YAML from Dhall, but there are also language bindings for Ruby, JVM, and Haskell with more on the way. I don't think generating YAML will be a main use case for long.

NuSkooler6y ago· 4 in thread

Still much prefer HJSON (http://hjson.org/) for stuff that people might need to touch.

If it's truly for end-users (read: non-admin/dev types), you probably shouldn't have them touching configuration files _at all_.

epage6y ago

Don't fully remember why I prefer json5 to hjson but at a quick glance, bare values is one. Bare values are ripe for someone entering in a string and accidentally getting a bool or number instead.

tobr6y ago

How is this at all related to Dhall? It looks like a completely different thing with a completely different purpose.

NuSkooler6y ago

They are both text based configuration file formats made to be easier for humans to interact with, so I'm not sure what you're confused about?

1 more reply

ape46y ago

Also Relaxed JSON. http://www.relaxedjson.org/

_j7tr6y ago· 4 in thread

What's with the commas at the start of lines?

jbaum986y ago

This is a common convention in some languages, most often functional languages in my experience. I associate it most with OCaml.

So it's not so surprising to see it here, seeing as Dhall is written in Haskell.

mitchtbaum6y ago

Better Syntax for Lists, Records (and Unions) #66

https://github.com/dhall-lang/dhall-lang/issues/66

edoceo6y ago

Makes it easier when commenting out. Use this trick for SQL and JS too (to prevent trailing comma issue)

dymk6y ago

I don't get why you'd build a language in 2019 which disallows a trailing comma in lists.

5 more replies

jrudolph6y ago· 3 in thread

Plug: if you're looking for a job working with dhall, reach out :-)

- 0: https://meshcloud.io - 1: https://github.com/Meshcloud/ejs-compiler

solatic6y ago

We're also heavy Dhall users in production. Functional, strongly typed configuration is such a powerful concept that I struggle to understand how the language isn't more popular yet.

Common example: let's say I want to set up a PostgreSQL database for a service running in Kubernetes in AWS. How best to get it done?

We started to adopt Dhall more than half a year ago now and we've barely scratched the surface of what the language makes possible. Purity in infrastructure and operations is a powerful drug.

tedmiston6y ago

Heads up - Your naked subdomain redirect to www doesn't seem to be working. If I go to www directly, I don't get the timeout.

jrudolph6y ago

thanks! turns out it wasn't working for https, should be fixed now.

1 more reply

adev_6y ago· 3 in thread

For a pragmatic, really readable configuration file format, TOML never disappointed me ( https://github.com/toml-lang/toml#user-content-local-date ).

- This is human readable contrary to the JSON family and its {} abuses.

- It is not space / ident base contrary to YAML that becomes very quickly a mess to write and a mess to parse.

epage6y ago

As a fan of TOML, I want to be clear on the downsides.

TOML is good for data layed out with TOML. Representing arbitrary nested arrays and tables gets messy.

This feature can't be implemented in arrays like it can with tables.

nikolay6y ago

TOML is almost perfect. The only things I don't like are the need for commas and the double brackets.

smitty1e6y ago

Perfection is a bugaboo. Give me 95%, minor inconveniences, and declare vict'ry, say I.

choeger6y ago· 3 in thread

Hmm...

So the authors claim that their language is guaranteed to terminate for all well-typed programs. That is actually a nice spot for configuration languages. Yet, I wonder how

a) they guarantee it, as I have seen no obvious link to the language's semantics

b) useful this is in practice.

Nevertheless, very nice approach, indeed.

yunyu6y ago

There is no support for recursion and the usual workarounds don't apply, so the language is not Turing complete: https://github.com/dhall-lang/dhall-lang/wiki/Safety-guarant...

codebje6y ago

skybrian6y ago

What matters is that you can analyze the code quickly. To find that out, one way is to try it and kill the process if it takes too long.

Or perhaps better would be to come up with a portable definition of what "takes too long" means that you can put in a presubmit check. Something like "running out of gas" in Ethereum.

KirinDave6y ago· 2 in thread

Dhall is fantastic and I try to encourage everyone in tech I meet try it.

GordonS6y ago

OK, why?

KirinDave6y ago

Because it is a good mix of features, syntax, execution speed and correctness.

Of course. Didn't you read the article?

1 more reply

voidmain6y ago· 2 in thread

mbrock6y ago

How would you write such a program?

comex6y ago

    let replicate = http://prelude.dhall-lang.org/List/replicate
    in replicate 999999999999 Natural 1

(add additional nines if necessary)

mitchtbaum6y ago· 2 in thread

This looks very useful.

mitchtbaum6y ago

looking further, it seems that aside from repetitiveness, safety is the main focus:

https://github.com/dhall-lang/dhall-lang/wiki/Safety-guarant...

which in Rust, we're solving this via SANE and SCL:

https://gitlab.com/bloom42/sane-rs

https://github.com/keats/scl

I'm not sure how much need there is for an additional programming layer, especially within config (the part of a program with the simplest syntactic requirements).

for my projects where "ahead-of-time validation" is needed, we're currently using SCL's parser for safety guarantees:

https://github.com/foundpatterns/contentdb

https://github.com/foundpatterns/lighttouch/blob/d7ada4576a6...

https://github.com/foundpatterns/torchbear/blob/4dd2b9ea76ba...

ff_OP6y ago

From a cursory look to SANE and SCL it looks like Dhall still offers some more:

- functions

- a powerful typesystem

- remote (HTTP) imports with sha256 checksums

desc6y ago· 2 in thread

Programmable configuration is always and without exception a monumentally stupid idea.

Programmatic generation of static configuration files can be very useful.

Sufficiently complex examples of the latter might as well be the former as far as maintenance is concerned.

If you need to write a program to configure your program, you're probably doing it wrong.

nine_k6y ago

Configs allow to add flexibility past compile time, often dynamically at runtime.

desc6y ago

Yes, that's the problem. I'd like to be able to look at a config file, on disk, loaded at startup, which defines the initial state of the server without having to think through how it was evaluated.

Generating the config during deployment, eh... often necessary. Best done with transforms and templates because they're simple.

Executable config, run during startup or, worse, on each request? NO.

In short, configuration which is that powerful is indistinguishable from running untested code in production.

1 more reply

javier26y ago· 1 in thread

I love this!

How small is a static binary to run this in my containers?

How are some ways to integrate the typed config in a language?

Gabriel4396y ago

The static binaries for the various interpreters and conversion utilities (i.e. `dhall`/`dhall-to-yaml`/`yaml-to-dhall`) are all roughly 10 MB each

The following languages natively bind to Dhall:

* Haskell * Clojure * Ruby

... and the following language bindings are in progress:

* Rust * Go * Python * PureScript

In the absence of a native language binding, you can convert Dhall to YAML or JSON and read that in.

dang6y ago

Thread from 2018: https://news.ycombinator.com/item?id=17523623

2017: https://news.ycombinator.com/item?id=15185015

2016: https://news.ycombinator.com/item?id=13109672

mitchtbaum6y ago

Rust bindings tracking issue

https://github.com/Nadrieril/dhall-rust/issues/77

ilaksh6y ago

arkh6y ago

   let input =
      { relative = "daughter"
      , movies   = [ "Boss Baby", "Frozen", "Moana" ]
      }

We don't frequent the same kind of "non-technical users" I guess.

amingilani6y ago

I thought the typo in the challenge was that the keys were in the root of the user's home directory, instead of the `.ssh` directory. So, I added `.ssh/` between the key and user home directory.

j / k navigate · click thread line to collapse