JSON vs. XML (opens in new tab)

(corecursive.com)

152 pointsgeffchang3y ago245 comments

245 comments

131 comments · 30 top-level

sanitycheck3y ago· 32 in thread

I have huge respect for Doug Crockford, and I never imagined I would disagree with him.

However I think by now we've seen that a lot of that "unnecessary" XML complexity was not, in fact, entirely unnecessary. These days we use JSON for everything, but now we've got JSON Schema, Swagger/OpenAPI, Zod, etc etc. It's not really simpler and there's a lot of manual work - we might as well be using XML, XSD & SOAP/WSDL.

mgr863y ago

XML is pretty awesome. As a 12 year old in 1998 I remember being enthralled with it and my young mind imagined a lot of possibilities. But alas, I had no real use for it until xhtml, but even then...As the years passed I learned to drive and get the heck off my computer for a few years.

It wasn't until about a decade later when I finally got to use XML "for real". At my academic publishing job. One of my first real projects was having a set of academics analyze documents in a web application I built. Prior to that they were analyzing them by hand, were converted to SGML somewhere in Korea, and we would use omnimark to move them to XML and eventually a library application.

The XML community, the one's who haven't retired or passed on, have been more welcoming of the competition too. They went from XML is everywhere, to being able to return JSON from an XSLT. I am in a small shop, and so I wear many hats. But I am always satisfied when I get to work with XML, or craft an xsl/xq script that does exactly what I need. Additionally, the community as a whole is very helpful, and a bit more grey. Meaning, they are less likely to fall for trends and bullshit.

A bit disjointed, but ,in short, XML is awesome. Now only if they would move Balisage back to Montreal. I'm no fan of DC or virtual conferences.

PurpleRamen3y ago

With XML, the complexity is the baseline, and it only goes up from there. With JSON, the complexity is just an option, the baseline is pretty simple. Also, good XML-tools are rare or expensive.

andyjohnson03y ago

Baseline for XML would be a document that doesn't use schemas, namespaces, attributes, or any of the SGML legacy stuff like DTDs and PCDATA.

Such a document is essentially as simple as the equivalent JSON.

1 more reply

kitsunesoba3y ago

Also this may just be the time in which I got into programming showing, but it seems like JSON encoding/decoding has been built into more languages than support for XML ever was. That's one less required dependency and thing to have to think about in many cases, like in Swift projects all I have to do is make sure my model structs/classes conform to Codable and I'm ready to hit endpoints.

1 more reply

adolph3y ago

Here to say this too. Compositional complexity is an advantage.

As a human in a repl, I appreciate the balance of readibility between XML which uses a larger set of syntactical characters, and YAML which uses fewer.

I also appreciate JSON's ontological simplicity over XML. This primarily boils down to the lack of attribute nodes and explicit difference between objects (lists of key-values) and arrays (lists of values).

djedr3y ago

> With XML, the complexity is the baseline, and it only goes up from there. With JSON, the complexity is just an option, the baseline is pretty simple.

Very well put. And we could lower the baseline substantially towards simplicity, even from JSON.

It's pretty clear that a lot of people think this way. Some even seriously try to figure out what such a baseline of simplicity would look like.

There are lots of simple indentation-based designs (similar to YAML) such as NestedText[0], Tree Notation[1], StrictYAML[2], or even @Kuyawa's Dixy[3] linked in this thread.

There seem to be less new ideas based around nested brackets, the way S-expressions are. Over the years, I have developed a few in this space, most notably Jevko[4]. If there ever will be another lowering of the simplicity baseline, I believe something like Jevko is the most sensible next step.

[0] https://nestedtext.org/en/stable/ [1] https://treenotation.org/ [2] https://hitchdev.com/strictyaml/ [3] https://news.ycombinator.com/item?id=35469643 [4] https://jevko.org/

pointlessone3y ago

I guess, it depends on how you define XML baseline. You can have a very simple XML with only bare tags. It will work just fine. Arguably, it's even simpler than JSON that way. A basic parser for that it probably not more complex than a JSON parser.

All the optional complexity that can go on top, though, is probably better specified for XML. Transformation is well defined for XML (XSLT) but not at all for JSON (I guess, you write your own code to manipulate native objects).

Schemas are basically a native feature for XML. Not so much for JSON.

All sorts of specialised vocabularies are defined for XML. A few are defined for JSON, too.

1 more reply

goto113y ago

If we are going for simplicity, surely S-expressions wins? You can support structures similar to JSON or XML on top of it, but the baseline is simpler.

2 more replies

davemp3y ago

I don’t write many APIs but every JSON schema I’ve created had been automatically generated by openapi tools. Even then I’ve found schemas of very little use, because everything gets validated on deserialization anyways. Client side validation usually already taken care of in practice because users should be serializing using the same type library that deserializes or reading the docs very thoroughly.

JSON is so much more ergonomic than XML as the lingua franca because I can actually read it. That being said I still have my share of problems with JSON.

radicalbyte3y ago

That was the cause of the XML problems - everything was generated.

Me? Schemas are a requirement in areas where you need to integrate over different technology / with different implementations. JSON Schema is in those contexts a bit of a kids toy compared to what XML can do.

3 more replies

js23y ago

> because everything gets validated on deserialization anyways

First, it really depends what you're deserializing with. There is a lot of code out there that just does JSON.parse and then starts accessing the data and then you have an "undefined" get passed deep into the call stack where maybe it explodes or maybe the program just misbehaves. So if you're using a language like JavaScript or Python, then a JSON schema can be used to validate input right away. Think of it like enforcing a pre-condition.

It's also useful in cases where JSON is being used for configuration files. At my company we have quite a few places where JSON files checked-in to a git repo are our source-of-truth which then get POST'ed to an API. We can enforce the schema of those files using pre-commit hooks so no one even wastes time opening a PR that will fail to POST to the API. The same JSON schema is also used by the API to ensure the POST'ed data is correct.

1 more reply

adamc3y ago

JSON is great, but I surely wish it supported comments. That's the nature of its failings: too minimal.

wongarsu3y ago

That depends on what you want it to be. For a data interchange format, having no comments is arguably a strength. For a config file format, having no comments is a big weakness.

2 more replies

keneda73y ago

This always bothered me. A coworker once suggested using fields ending in 'notes' to put in comments but I never really warmed up to that.

1 more reply

beached_whale3y ago

Luckily a good number of parsers support extensions to JSON like comments and trailing comma's.

PurpleRamen3y ago

Comments are simple to parse, but preserving them on the dump is complex. I guess they were sacrificed for the simplicity.

2 more replies

maxloh3y ago

You can use JSONC, which is JOSN with C style comments.

bilalq3y ago

The ambiguity difference around lists alone makes JSON over XML compelling.

It is simpler than XML/XSD. Without the schema, you never know if a certain element should be treated as being part of a list or not. When interoperating with anything other than XML, that matters.

wil4213y ago

I dislike SOAP and avoid working with it when I can. However, the WSDL is an excellent part of SOAP that really makes it easier to work with. Teams tended to over engineer their APIs and all kinds of cruft would develop. I like the HTTP operations with REST.

I can remember hardcoding and manipulating a bunch of non-sense legacy fields just to get a ticket created via their SOAP enterprise service bus. Not to mention all the operations that made no clear sense.

foolfoolz3y ago

soap may have taken liberties with http to get its work done (graphql: so what??) but it really felt like we reinvented the wheel. i was consuming massive wsdls in 2013/2014 and i consume massive open api specs in 2023. did anything actually improve?

tracker13y ago

Unless the implementation defines too many things as just "Object" and you're consuming from a stronger typed language... and the generated library doesn't give you anything resembling a real interface. I've used a dynamic language (Node) a few times to bridge such wsdl/soap services to consume from C# and similar.

Consuming SOAP/WSDL from languages other than the one it's published in isn't fun. Man, some of the PHP implementations were beyond horrible... well defined REST/RPC +_JSON is generally much easier in the end.

haburka3y ago

> I dislike SOAP and avoid working with it when I can.

I disagree. I think personal hygiene is very important for in-office coworking.

tracker13y ago

> I dislike SOAP and avoid working with it when I can.

Well, I'm about to take a shower now, and shame on you.

agumonkey3y ago

What I appreciate compared to xml is:

  - generic concepts like arrays and maps
  - lack of opportunity to invent names

Every xml schema is a potential DSL that reinvents things they might now.

Other than that it's true that the xml era was just addressing a lot of important stuff early, I guess it was only compatible with big corp mindset and not early web dynamic / fluid / small scale apps. (a bit like how PHP started to write PSR to avoir dynamic code / effects in libs .. formalization etc.

halostatue3y ago

Every JSON schema is also a potential DSL that reinvents everything. Yes, there seems to be some convergence on things, but object arrays in XML aren’t really any more complex than object arrays in JSON — there just might be multiple ways to represent them.

For this JSON:

    {
      "part_numbers": [1, 2, 3, 4, 5]
    }

You have two main ways to represent these in XML:

    <!-- repetition = array -->
    <order>
      <part_number>1</part_number>
      <part_number>2</part_number>
      <part_number>3</part_number>
      <part_number>4</part_number>
      <part_number>5</part_number>
    </order>

    <!-- wrapped repetition -->
    <order>
      <part_numbers>
        <part_number>1</part_number>
        <part_number>2</part_number>
        <part_number>3</part_number>
        <part_number>4</part_number>
        <part_number>5</part_number>
      </part_numbers>
    </order>

Is this better than JSON? No, not particularly. But it’s no less clear than the JSON, and it compresses pretty well (it compresses better for larger documents, obviously).

The larger problem with XML is that the tooling is often lacking outside of Java and C#/.NET and none of the tooling is well-built for the sort of streaming manipulation that `jq` does (it exists, but IMO one of the least usable ideas from the XML camp is XSLT), and JSON support is pretty universal everywhere, even if the advanced things like JSONpath and JSON Schema aren’t.

I also think that there’s a problem when you have to choose between SAX and DOM parsing early in your process. Most JSON usage is the equivalent of using a DOM parser because the objects are expected to be relatively small, but many XML systems are built for much larger documents, and therefore need to parse the stream because the memory use otherwise would be unacceptable. The use of a JSON streaming parser is much rarer, IME.

1 more reply

aidenn03y ago

I have literally never used any of these things.

The hate I have for XML is the high markup overhead. Anybody who has configured a trunk of the century product with XML config files knows what I mean; the screen is usually 2/3 XML tags, which means 1/3 closing tags, which add nothing semantically

irrational3y ago

> but now we've got JSON Schema, Swagger/OpenAPI, Zod, etc etc. It's not really simpler and there's a lot of manual work - we might as well be using XML, XSD & SOAP/WSDL.

Uh... do we? I've never used any of those. Plain JSON has always worked fine for me.

bryik3y ago

> but now we've got JSON Schema, Swagger/OpenAPI, Zod, etc etc.

You don't have to use any of those.

halostatue3y ago

You don’t have to use anything for XML, either. The simplest XML document is almost indistinguishable from the simplest JSON document. Nothing in XML requires XML schemas or namespaces or anything else that is usually attributed to the complexities of XML.

dylan6043y ago

I have to say that I was bit disappointed the first time I learned about JSON Schema. My immediate reaction was to wonder if they were trying to become XML.

throwawaymaths3y ago

OpenAPI is complex not because of JSON, but because it's a nearly complete description of http.

steve19773y ago

… and have proper comments.

adamgordonbell3y ago· 21 in thread

This quote is funny:

    Douglas: The first time I saw JavaScript when it was first announced in 1995, I thought it was the stupidest thing I’d ever seen. And partly why I thought that was because they were lying about what it was.

A bigger more interesting thing though is how his company failed, in part, because they used hand-rolled JSON for messaging.

    Douglas: And some of our customers were confused and said, “Well, where’s the enormous tool stack that you need in order to manage all of that?” 

    “There isn’t one, because it’s not necessary”, and they just could not understand that. They assumed there wasn’t one because we hadn’t gotten around to writing it. They couldn’t accept that it wasn’t necessary.

    Adam: It’s like you had an electric car and they were like, “Well, where do we put the gas in?”

    Douglas: It was very much like that, very much like that. There were some people who said, “Oh, we just committed to XML, sorry, we can’t do anything that isn’t XML.”

I started my career during peak XML crazy and while I liked parts of it at the time, the number of things it was used for was quite insane. I had to maintain a system once where a major part of it was XSLT, when could have just been a simple imperative algo with some config settings.

Anyhow, hope you like the episode!

bambax3y ago

> I had to maintain a system once where a major part of it was XSLT

Every time the topic comes up I feel the need to say that I loved XSLT. It was so nice. XML frankly was kind of simple, too. It had elements and attributes and that was it. And it had xpath, which offered, among other things, a parent axis, so you could walk the node tree upwards.

In JSON you can't get to the parent from the child. And walking down a tree is unintuitive, because nodes can be of different types, and if you want to maintain the order, or use successive instances of the same things (that would have the same name) you need to use arrays, and arrays of arrays of arrays look bad. Schemas are an afterthought.

JavaScript is cool -- it has mostly eaten the world anyway. But JSON is not so good IMHO.

stickfigure3y ago

I would describe it something like: XML is great as a document format, but shitty as an RPC format. JSON is vice-versa. Web developers spend a lot of time with JSON as an RPC format, so they tend to put it on a pedestal. But try keeping your recipe collection structured in JSON text files and the pain will start immediately. YAML is even worse.

XSLT was (and still is) great for transforming documents. Want that recipe collection as HTML? Easy.

1 more reply

chrismorgan3y ago

I first touched XSLT in 2010. I appreciated what it could do, but it was painful to work with due to poor documentation and tooling. This has only gotten worse by comparison with alternatives.

You can still do XSLT in the browser. You can serve arbitrary XML and transform it. As an example, Atom feeds on my website (such as <https://chrismorgan.info/blog/tags/meta/feed.xml>) render just fine in all mainstream browsers, thanks to this processing instruction at the start of the file:

  <?xml-stylesheet type="text/xsl" href="/atom.xsl"?>

But working with it is not particularly fun, because XML support in browsers has been only minimally maintained for the last twenty or so years. Error handling is atrocious (e.g. largely not giving you any stack trace or equivalent, or emitting errors only to stdout), documentation is lousy, some features you’d have expected from what the specs say are simply unsupported (and not consistently across engines), and there are behavioural bugs all over the place, e.g. in Firefox loading any of my feeds that also fetch resources from other origins will occasionally just hang, and you’ll have to reload the page to get it to render; and if you reload the page, you’ll have to close and reopen the dev tools for them to continue working.

1 more reply

kgwxd3y ago

> In JSON you can't get to the parent from the child. And walking down a tree is unintuitive, because nodes can be of different types

JSON only competes with XML. XSLT, XPath, and XSD are just as much an afterthought in that they are completely separate from XML and are entirely optional. The engines written around those is where the powers to walk the tree and validate come from, not XML itself. There's a wide range of tools to get the same benefits for JSON sources, and they usually handle XML and other data sources too, because it shouldn't matter. The reason the X* tools have fallen out of favor is because they're unnecessarily tied to a single type of source data.

krzyk3y ago

JavaScript is as good as JSON. It has eaten The world jest because it was in every browser. Similarly Chrome, advertised on the biggest search engine.

fatnoah3y ago

> I started my career during peak XML crazy and while I liked parts of it at the time, the number of things it was used for was quite insane. I had to maintain a system once where a major part of it was XSLT, when could have just been a simple imperative algo with some config settings.

Same here. XML was going to save the world! Remember XML data islands with data embedded in page source and displayed via XSLT?

The craziest thing I had to build was a tool to manage the dozens to hundreds of XML configuration files that powered our product. The tool allowed editing and deploying the files, complete with validation and even input suggestion based on associated XSD for each XML file.

mgr863y ago

I remember the XML is everywhere phase. The community that hasn't retired or passed on has largely come off of that. You can return JSON natively from XSLT 3.0 now. I've been on both sides of the love/hate fence with XML, but these days when I have the need to work with it I leave the projects really satisfied.

1 more reply

ssdspoimdsjvv3y ago

Nowadays you have the same type of tools, only for YAML. Not sure if that's so much better.

2 more replies

rezaprima3y ago

I do just like this with json. Creating api blueprints.

justin663y ago

I really respect that you provide transcripts. It's terribly important for accessibility and for getting what people have said into the various search engines.

I was sad to hear that Crockford is not aiming to be the author of "the next language" anymore, but I wonder how sincere that really is. His thoughts on actor-based languages are interesting.

adamgordonbell3y ago

Thanks!

Crockford's thoughts on actors are really interesting. I tried to pull them apart but I didn't get very far and ended up not including them in the episode.

What he is envisioning is not exactly like Erlang but not exactly like Scheme. He said that Carl Hewitt had a lot of ideas and they were hard to unpack.

If you're interested though, I would reach out to him. He is very approachable and excited to talk to people with ideas for new ways of making things simple.

2 more replies

meepmorp3y ago

I remember a meeting where a consultant from an MCP excitedly told our mutual client that the XP in the upcoming version of Windows stood for 'XML Protocol.'

More innocent times.

lowercased3y ago

I had a power strip which had "works with windows 95" on the packaging box.

3 more replies

adamgordonbell3y ago

Scala had XML literals as part of the language!

Apparently Philip Wadler was the person who told them needed it, because the future was XML.

( Walder is big Haskell/PL person)

5 more replies

Seanny1233y ago

What are some examples of the "enormous tool stack" required for XML? I ask, because I came into software development after everyone adopted JSON. When I do need to parse XML, there was a library I could use, although I will admit that needing xpath was a bit annoying.

orthoxerox3y ago

If your XML is written the way people write JSON, then the stack isn't enormous. But XML is usually wrapped in layers of additional complexity. SOAP envelopes and namespaces they require, XSLT that someone invariably used to write an XML transformer, etc.

1 more reply

bryik3y ago

This was before my time, but I believe the WS-* series of specifications is an example.

> Like with the original J2EE spec, which sought to complicate the basic mechanics of connecting databases via HTML to the internet, this new avalanche of specifications under the WS-* umbrella sought to complicate the basic mechanics of making applications talk to each other over the internet. With such riveting names as WS-SecurityPolicy, WS-Trust, WS-Federation, WS-SecureConversation, and on and on ad nauseam, this monstrosity of complexity mushroomed into a cloud of impenetrable specifications in no time. All seemingly written by and for the same holders of those advanced degrees in enterprisey gibberish.

https://world.hey.com/dhh/they-re-rebuilding-the-death-star-...

WorldMaker3y ago

> When I do need to parse XML, there was a library I could use, although I will admit that needing xpath was a bit annoying.

It sounds a bit like someone paved a garden path for you by that point. One of the reasons for the "enormous tool stack" wasn't just depth of tools needed ("tool X feeds tool Y which needs tool Z to process namespace A, but tool B to process namespace C, …"), but also the breadth. I recall there were at least six types of parsers to choose from with all sorts of trade-offs in memory utilization, speed, programming API: a complicated spectrum from forward-only parsers that read a node at a time very quickly but had the memory of a goldfish through to HTML DOM-like parsers that would slowly read an entire XML document all at once and take up a huge amount of memory for their XML DOM but you could query through the DOM beautifully and succinctly. (ETA: Plus or minus if you needed XSD validation at parsing time, and if you wanted the type hints from XSD to build type-safe DOMs, etc.)

A lot of XML history was standards proliferation in the xkcd 927 way: https://xkcd.com/927/

XPath tried to unify a lot of mini-DSLs defined for different DOM-style XML parsers.

XSLT tried to unify a bunch of XML transformation/ETL DSLs.

The things XPath and XSLT were designed to replace lingered for a while after those standards were accepted.

Eventually quite a few garden paths were paved from best practices and accepted "best recommended" standards and greenfield projects start to look easy and a simple number of well-coordinated tools. But do enough legacy Enterprise work and you can find all sorts of wild, brownfield gardens full of multiple competing XML parsers using all sorts of slightly different navigation and transformation tools.

sgtnoodle3y ago

The last time I worked with XML, using an external library wasn't really a great option. I ended up writing my own parser in C++. It took about a week to get all the features required for my purpose.

tracker13y ago

Honestly, if you just need a one-off transformer, VB.Net is probably one of the better options. The .Net XML library is pretty good in the box, and VB.Net has XML literal support on the top... if you just need to read, then C# is a better language imo.

1 more reply

gweinberg3y ago

XML might not have been so bad, if there weren't dopes pushing SOAP.

hk13373y ago· 8 in thread

Honestly, I would relegate XML to application configuration. Trying to communicate with it with something like HTTP requests/responses is absurd.

wongarsu3y ago

I just had to comment on the irony of this comment being embedded in a document that is delivered via HTTP and very close to valid XML.

Even if XHTML died on the wayside, HTML is imho a stereotypical example where XML is a good fit. Most of the complexity has valid use cases, and it's mostly obvious what should be an attribute and what should be content of the tag. And at least in HTML 4 you even had a doctype tag filling the role of specifying the schema used. Of course SVG is a better showcase for some other aspects of XML, with every editor putting their own metadata in, nicely partitioned into separate namespaces.

hk13373y ago

In broad strokes, I suppose you're right to see the irony. Even with that, we need specific client applications (aka browsers) to translate that into something readable.

1 more reply

tannhaeuser3y ago

> HTML is imho a stereotypical example where XML is a good fit

Indeed, this was what XML was created for. From W3C's XML specification:

> The Extensible Markup Language (XML) is a subset of SGML that is completely described in this document. Its goal is to enable generic SGML to be served, received, and processed on the Web in the way that is now possible with HTML.

Honestly, what's absurd is GP comment's cluelessness.

bayindirh3y ago

When you can get the data as XML, verify via its schema externally and then transform it via XSLT, it's not.

Also, it's way better in transferring/storing big, complex intricate data like 3D objects.

Kuyawa3y ago

I remember playing with an invoice data in XML sending it via email and opening it in a browser being beautifully shown in all it's visual glory using an XSLT directive in just one line at the top of the data file. Absolutely amazing. I wondered all the implications and applications of just transmitting data that knew how to present itself to the user

mattacular3y ago

> Also, it's way better in transferring/storing big, complex intricate data like 3D objects.

Curious how come?

1 more reply

xiande043y ago

Agreed. That and knowledge representation (ontologies) is another good use case for XML, since JSON can't natively represent attributes (has-a relationships).

mgr863y ago

I'm in a similar sphere with XML and ontological representation. I've inherited maintenance of an ontology (of sorts) that has been used in social sciences since the 1940s. Can I ask what domain you are in? How do you like to represent your ontologies? SKOS?

1 more reply

dvh3y ago· 6 in thread

For me 3 killer features of JSON are:

1. Parsing JSON doesn't require adding new firewall rules

2. There are no comments, so nobody will try to invent their own meta format or annotations in comments and instead they will put data in the JSON as they should

3. (When compared to JS) someone finally had the balls and picked one type of quotes, this makes making parser so much simpler.

IshKebab3y ago

Not supporting comments in JSON was a huge mistake. Yes I'm sure that someone, somewhere has once added comment directives to a file that caused issues. But that's such a rare problem compared to the very real and damaging and annoying problem of not being able to add comments to config files (hello package.json) that it's definitely the wrong choice.

XML supports comments and I have not seen a single use of comment directives in it ever.

I have seen plenty of comment directives in programming languages, HDLs and so on. But they are usually used as hints, e.g. to linters or to control compiler warnings, and they work perfectly well and cause no problems at all in my experience.

You might say that Crockford didn't anticipate JSON being used for config files. Fair enough. But now that it is, it should support comments.

My recommendation is to use JSON5 since it has a distinct file extension and fixes some other things about JSON too (e.g. trailing commas, hex constants) without being full on YAML insane.

asdfafe3y ago

directives can just be put in adjacent fields anyways

Ygg23y ago

> There are no comments, so nobody will try to invent their own meta format or annotations in comments and instead they will put data in the JSON as they should

It also means it's worse format for configs where you sometimes need to annotate a few nodes with comments.

lttlrck3y ago

Yep.

"comment": gets littered across the JSON... or temporary changes are copied and the original property name is invalidate with a prefix. The simple structure is gone, replaced with adhoc workarounds.

Similarly when you want to use a type not supported by JSON such as datetime or binary data, you might end up with "type":"binary" and use base64 or whatever in the value (shoehorning attribs) - when it really needs a schema to follow during parse and stringify. Or OpenAPI, which is hardly lightweight and really doesn't match the simplicity of JSON.

hiccuphippo3y ago

For configuration files I add an extra key "_" and a string with the comment as the value. I even add multiple "_" keys to the same object and never seen something break.

1 more reply

oblio3y ago

JSON could really use schemas as part of the main implementation.

Local schemas, not crazy remote schemas.

Or some sort of way to bless an "official" schema format.

HideousKojima3y ago· 6 in thread

What if I hate both formats? XML is overly verbose, while JSON isn't specific enough or precise enough for a lot of my needs.

- This message brought to you by TOML gang

filoeleven3y ago

I just checked out the spec, and it gets pretty ugly in the Table section. A lot of the json examples are both shorter and IMO more precise. Stuff that’s not allowed with [table] is allowed with [[table]], and it’s confusing to understand what level of depth I’m at.

I’ll take edn over any of “em. https://github.com/edn-format/edn

Comments and time stamps allowed, arbitrary nesting of data structures, make your own tagged literals if you need them. And commas are whitespace, mostly unnecessary.

629514133y ago

I've got yet another markup language for your hate group's target list :)

Come join the dark side where we enjoy the wonders of binary formats such as avro and protobuf.

HideousKojima3y ago

I actually love binary formats, especially for network communication. We probably waste tons of processing power and network bandwidth needlessly sending JSON back and forth everywhere and re-deserializing it. I'm personally a fan of MessagePack.

Though for something where you want human readability it's hard to beat TOML in my opinion.

dralley3y ago

https://github.com/ron-rs/ron

rewgs3y ago

Absolutely agree. TOML is far and away the best for config files.

mdaniel3y ago

You can't be serious; I can't stand having to guess what kind of crazy markup is required to express things in toml. As a concrete example I converted my local kubeconfig (which is yaml) and here are the completely random characters indicating some kind of hierarchy

    apiVersion = "v1"
    current-context = ""
    kind = "Config"

    [[clusters]]
    name = "my-cluster"

    [clusters.cluster]
    certificate-authority-data = "LS0tL..."
    server = "https://example.com"

    [[contexts]]
    name = "context0"

    [contexts.context]
    cluster = "my-cluster"
    user = "my-user"

    [[contexts]]
    name = "context1"

    [contexts.context]
    cluster = "my-cluster"
    user = "my-user"

    [[users]]
    name = "my-user"

    [users.user]
    [users.user.exec]
    apiVersion = "client.authentication.k8s.io/v1beta1"
    args = ["eks", "get-token"]
    command = "aws"

2 more replies

taeric3y ago· 4 in thread

> Turned out JavaScript was the first language to give us lambdas, and that was an amazing breakthrough.

I mean... with charity I can see the context and get it. But. What!?

Overall fun read through history, even if definitely from Doug's perspective only. (As evidence by JavaScript being an originator of lambdas...) I do find the idea that JSON was as novel as history says it was kind of odd. I remember inlining javascript objects years before "JSON" was a thing. Making it a subset of what javascript could already do seems straight forward and a good execution. Getting rid of comments feels asinine to me. (I'll also note that the plethora of behaviors you get from JSON parsers shows that it is effectively CSV. Sure, there may be a "standard" out there, but by and large it is a duck typed one.)

I'm also a bit on the camp that XML is better than JSON. Being able to have better datatypes, for a start. Schemas that allow autocompletion. Is also easier to see as a markup language (per the name). That said, they clearly went too far with entities and despite making sense for markup, attributes versus children are more than a touch awkward.

I also recall that what killed XML and WSDL files in general, was the complete shit show that was getting a single document to work with both MS and non-MS clients.

goatlover3y ago

Crockford mentions Scheme right before that, so he's aware lambdas originated with Lisp, presumably. I guess he means JS was the first mainstream language to popularize them?

taeric3y ago

Yeah, that is why I think I can see the point with charity to the discussion. Still an awkward proclamation. Many people were coding with LISPs for a long time before javascript came onto the scene. And I don't think LISPs were the only language with lambdas?

slaymaker19073y ago

The current XML standard is hot garbage since it completely disallows null characters even via "" despite most languages now supporting nulls in the middle of strings. Also, JSON definitely allows schemas, primarily through the JSON schema standard, but I've also seen TypeScript notation used for this as well which has the convenience of being readable by more people (I strongly suspect more people know TypeScript than know either XML schemas or JSON schemas combined).

taeric3y ago

JSON is garbage to read largely due to how much needs escaping. This is largely fine for smaller documents, but there is a reason yaml and toml both gained traction over raw json for config files.

And I don't make any real defense of some of the darker corners of XML. In particular, I already criticized entities being a bit too much. Namespaces are also something that, while I can see the desire, the implementation is way too much for most of us.

JSON schema is going to be cursed for a long time. Just the odd treatment of it will be a problem. (In particular, that it is a subset of the numbers that javascript itself supports is... awkward.)

I also confess, though; that I'm not clear why I would want a null in the middle of a string? That feels like a gun loaded and aimed squarely at a foot.

1 more reply

Finnucane3y ago· 4 in thread

What a pointless debate. I've worked with XML manuscript archives and I can be certain that if I'd had to do it in JSON I'd have killed myself.

adamgordonbell3y ago

This is about JSON being created or discovered and Doug struggling to convince people it was relevant when everyone was so bought in on XML.

Are you saying you think JSON shouldn't exist and everyone should use XML for everything?

Tooling around XML was certainly more established, but man there was a lot of complexity built up around it.

bayindirh3y ago

No. JSON is great as Javascript's serialization format, but it's not as readable and robust as XML, period.

I use both extensively, and for bigger objects and definitions, XML is a very clear winner.

I'm a big believer in horses for courses type of approach, and my personal gripe is the push to replace one thing with another. These data types can coexist, and can be used where they shine. XML can be read and written stupidly fast, so it's way better as a on disk file format if people gonna touch that file.

YAML and JSON are not the best fit for configuration files. JSON is good as an on-disk serialization format if humans not gonna touch that. XML is the best format for carrying complex and big data around. TOML is the best format for human readable, human editable config files.

5 more replies

daveslash3y ago

The Complexity of XML reminds me of something from Adam Bosworth's ISCOC04 Talk [0]. To me, the big takeaway is that HTML succeeded because of it's limitations, not despite of them. JSON seems very simple compared to XML. XML seems to be very powerful, but also very complex - it's like, if all you need to do is pick your kids up from Soccer Practice, you don't need the powerfullness (complexity) of the Space Shuttle in your vehicle.

  In 1996 I was at some of the initial XML meetings. 
  The participants� anger at HTML for �corrupting� 
  content with layout was intense. Some of the initial 
  backers of XML were frustrated SGML folks who wanted 
  a better cleaner world in which data was pristinely 
  separated from presentation. In short, they disliked 
  one of the great success stories of software history, 
  one that succeeded because of its limitations, not 
  despite them. I very much doubt that an HTML that had 
  initially shipped as a clean layered set of content 
  XML, Layout rules – XSLT, and Formatting- CSS) would 
  have had anything like the explosive uptake.

https://adambosworth.net/2004/11/18/iscoc04-talk/

1 more reply

irrational3y ago

Debate? Did you even read the article? It was about the history of how JSON came around. I didn't read a debate (despite what the title implies).

Zamicol3y ago· 3 in thread

From another interview:

>The best thing we can do today to JavaScript is to retire it. Twenty years ago, I was one of the few advocates for JavaScript. Its cobbling together of nested functions and dynamic objects was brilliant. I spent a decade trying to correct its flaws. I had a minor success with ES5. But since then, there has been strong interest in further bloating the language instead of making it better. So JavaScript, like the other dinosaur languages, has become a barrier to progress. We should be focused on the next language, which should look more like E than like JavaScript.

- https://evrone.com/douglas-crockford-interview

One of the traits that makes Douglas great is being willing to say the obvious even if it is politically unpopular.

n0w3y ago

Oh, hey. That's cool. I hadn't realised Douglas Crawford worked on E. I haven't actually looked but I wonder who else participated?

E had some really cool ideas, it's sad that it doesn't seem to be that well known!

irrational3y ago

The biggest impedances I see to replacing JS are:

1. You've got to keep JS around for backwards compatibility for the billions of websites already using it.

2. You will need to two engine teams, one to maintain JS and one for the new language.

3. Now you have a whole new vector for security issues. You've made the threat surface much broader. So, you will probably need to hire additional people.

4. You need to coordinate with all the other browser makers so everyone rolls out their new engines more or less concurrently. Other than experiments, nobody is going to start using it unless it works on all the major browsers and platforms.

hajile3y ago

That depends on the language you choose.

If we went to a scheme dialect as originally intended, we could have just ONE language for all the things.

Legacy JS? Just compile it into Scheme and run it.

HTML? Use S-expressions and support legacy HTML syntax by compiling it into them. Now you get all the power people want from template languages, but baked right into main language itself.

CSS? No more weirdness like adding sin() or calc() to make up for shortcomings. Once again, you get the power of the full Scheme language right there.

Devasta3y ago· 3 in thread

Namespaces, schemas, custom elements, client side templating, XML has so much stuff that the web threw away, so now its forced to reinvent worse versions of it every few years, a shame.

Abandoning XML was the webs biggest mistake.

giantrobot3y ago

Whenever XML gets discussed here it's interesting to see what people complain about. In my completely unscientific assessment most things people hate(d) were the overwrought "Enterprise" uses/systems.

Very unfortunately for everyone XML came up at the same time as peak "Enterprise" moat building. No design pattern went unused everything was built with mind numbing "configuration". XML got used heavily in that space because it allowed massive "Enterprise Objects" (local branding varies) to be serialized in a way another system might have a chance to read.

Meanwhile the features you mention got thrown out with the bath water because everyone hated Enterprise style architectures. While I don't love, for instance, everything about XSLT it's built directly into browsers as native code. How many person hours, megabytes of JavaScript, and wasted CPU cycles have been spent reinventing client side templating using JSON? XSLT is already right there and will happily convert serialized data to your presentation format. You also get the ability to have comments in the data and a built in schema validation.

On my current project I'd much weather be emitting and consuming XML rather than JSON. But alas everyone hated Enterprise XML so we're stuck with JSON and the inability of some parsers to handle trailing commas and ambiguous definitions of numerics and not a comment to be found.

Zamicol3y ago

XML is oversized for the majority use case.

It's easier to extend a simple standard than to amputate a behemoth with unneeded appendages.

recursive3y ago

The problem with extending a standard is that there are so many ways to do it.

Kuyawa3y ago· 3 in thread

I remember one time designing the simplest and most readable data format ever and came up with Dixy [0] after removing all I could and still make it usable

I'm leaving it here because it will never be used for anything but at least it may inspire somebody design a better format with simplicity in mind

[0] https://github.com/kuyawa/Dixy

nayuki3y ago

This looks a lot like YAML, especially with the non-quoted strings, colons, and indentation. It also seems to share the problems of YAML, namely a very non-uniform syntax. For example, how do you distinguish null (denoted as "?") from a literal string containing one question mark? How do you distinguish the number 1 from the string "1"? Hence why I'm not a fan of both YAML and Dixy.

Other problems to ponder: Is 0 different from 00? Is "1, 2, 3, 4" different from "1,2,3,4"? Is "a: b" different from "a : b" and "a:b"?

duffyjp3y ago

I like this! It's like YAML but you can learn the entire spec in 15 seconds.

ptsneves3y ago

Why will it never be used for any thing? I like it. Thank you for sharing.

maple31423y ago· 2 in thread

One of the reason to prefer JSON over XML is that you can reasonably parse an untrusted JSON using default configuration without getting yourself pwned. A lot of XML processing libraries still support external entities by default that you have to disable them manually: https://cheatsheetseries.owasp.org/cheatsheets/XML_External_...

Sohcahtoa823y ago

> you can reasonably parse an untrusted JSON using default configuration without getting yourself pwned.

If only this were true.

https://medium.com/r3d-buck3t/insecure-deserialization-with-...

maple31423y ago

I know that one, but I think JSON.NET is to blame for this because it decide to take `$type` and other fields and apply some reflection magic on it. It isn't really different from evaling a random json field in your own business code. A lot of sane json implementation also don't do this too, like `JSON.parse` `json.loads` `json.Unmarshal`...

On the other way, XML External Entity is a part of XML standard, so any standard compliant XML implementation have to support it. This is why XXE attack applies to many languages.

user39393823y ago· 2 in thread

If it's not obvious, the issue is that standardizing a data format is going to have trade offs. Interoperability, leveraging tooling universally so all effort is going in the same direction, awesome. The problem is that some uses cases for the format are going to be insanely complex, which will make the standard and tools unnecessarily complex for the simple cases.

JSON is simpler and easier for many cases, but then you lose the interoperability. Go try to make an app right now dealing with Federal government systems or finance, you're going to end up translating JSON<->XML which isn't fun.

There's not going to be a silver bullet solution to this problem, it's not completely solvable.

Sohcahtoa823y ago

> you're going to end up translating JSON<->XML which isn't fun.

Not fun? It's not even possible in the general sense.

If you have XML that looks like:

    <meal type="breakfast">
       <eggs count="3">
           <topping>cheese</topping>
       </eggs>
    </meal>

How would you convert that to JSON without knowing how the JSON consuming application expects it to be formatted? Where do you put the "breakfast" and "count" attributes?

You'd need to manually write a translator for each potential translation.

user39393823y ago

> You'd need to manually write a translator

Yep, therein lies the “not fun”. You write a bunch of super complex, brittle code.

Unfortunately because XML is entrenched in certain domains, you have to decide between writing these converters or doing everything in XML which also sucks, especially if you’re trying to write a modern app with a modern stack.

slaymaker19073y ago· 2 in thread

My biggest gripe with XML is that it can't represent arbitrary strings easily. Even in the latest versions of XML, you can't easily serialize strings with embedded nulls since it is forbidden by the spec to even use something like "". XML 1.0 was even worse since it doesn't allow any characters which require surrogate pairs under UTF-16. Instead, the spec writers apparently expect devs to come up with their own escaping scheme in which case why bother having a standard at all?

Even C# just punts on this issue and won't emit valid XML if a string you serialize happens to have a null character in it.

Sohcahtoa823y ago

If I had to deal with strings that XML won't allow, I'd probably just rely on encoding the data in Base64 before throwing it into the XML.

A human won't be able to read it (Unless you're crazy and have learned to read Base64), but the application still can easily. You'll just have to add a Base64 translation step before/after serialization/deserialization.

slaymaker19073y ago

It's very annoying to do that though since that introduces a bunch of logic in the application and also removes the benefit of being able to read the strings in the XML as a human.

andyjohnson03y ago· 2 in thread

I'll never understand the hating that xml tends to get around here.

Choose the right tool for the job at hand. Sometimes json is the right choice, sometimes xml is. Not everything is a webapp.

eviks3y ago

It's an ugly tool. People generally hate ugly tools

ledauphin3y ago

based on the overwhelming majority of the top 30 comments, i think you should feel comforted.

hot_gril3y ago· 1 in thread

I worked on a customized ejabberd at a company for years, drinking all the XMPP kool-aid and becoming very familiar with XML along the way. Slowly we all began to realize how bad XML was. We eventually put our custom extensions' data into JSON just embedded inside the XML. Says a lot that such a hack was actually an improvement.

The other two premier XML use cases I can think of are

1. RSS: Last time I did this, ironically I built the payload with a JSON-API'd lib that deals with the XML drama for me. Worked fine.

2. Configs. Rarely are these done in XML anymore. Human readability matters for configs. But there are also better options than JSON for this.

hot_gril3y ago

3. HTML-like things where XML actually makes sense cause you're defining some sort of document with reusable objects that gets rendered at the end.

codr73y ago· 1 in thread

I bought one of the first books about XML, read it cover to cover; started writing my own parsers and generators, designed a custom XML protocol for a network server at work.

Then I had to live through the whole SOAP-drama, and Java EE; and ended up promising myself to never touch it again.

It has too many degrees of freedom for its own good, the C++ of data formats.

JSON is in many ways the other end of the spectrum; simple but underspecified and painful to deal with in anything but JS.

I often dream of something in-between.

dralley3y ago

https://github.com/ron-rs/ron

nsxwolf3y ago· 1 in thread

I can't listen to this because the host sounds like a text to speech engine.

irrational3y ago

I read it. It's a quick read.

simonw3y ago

My favourite Douglas Crockford quite, from a debate back in 2006 about why JSON was reinventing the wheel when XML already existed:

> The good thing about reinventing the wheel is that you can get a round one.

https://simonwillison.net/2006/Dec/21/crock/

acabal3y ago

While both are good fits for their specific use cases, I think JSON won as an medium of exchange because unlike XML, JSON is dead simple to parse and ingest programmatically.

What makes XML so unergonomic to ingest is 1) attributes, which don't map cleanly to a basic data structure that you might find in a programming language, and 2) namespaces, which are extremely, extremely tedious to program against.

Programmers are going to use the format that's the easiest to ingest and manipulate. JSON wins in that regard, hands down. Every time I need to write logic to ingest a namespaced XML document I heave a deep sigh and brace myself for another long week of fighting with LXML. But with JSON it's as easy as `json_decode($str)` and move on with your life.

irrational3y ago

> And after years of being too early at everything, the world had caught up to Doug.

Have we though? Earlier, the article even has Douglas saying:

> It turns out it, well, it’s a multi paradigm language, but the important paradigm that it had was functional. We still haven’t, as an industry, caught up to functional programming yet. We’re slowly approaching it, but there is a lot of value there that we haven’t picked up yet.

I do love the very ending:

Adam: What do you think is the XML of today?

Douglas: I don’t know. It’s probably the JavaScript frameworks.

They have gotten so big and so weird. People seem to love them. I don’t understand why.

For a long time I was a big advocate of using some kind of JavaScript library, because the browsers were so unreliable, and the web interfaces were so incompetent, and make someone else do that work for you. But since then, the browsers have actually gotten pretty good. The web standards thing have finally worked, and the web API is stable pretty much. Some of it’s still pretty stupid, but it works and it’s reliable.

And so, when I’m writing interactive stuff in browsers now, I’m just using plain old JavaScript. I’m not using any kind of library, and it’s working for me.

And I think it could work for everybody.

------

Earlier in the interview where they were talking about how people behind XML and SOAP wanted complexity and were upset by the simplicity of JSON, I was thinking that this was resonating with me and how I feel about how complex web development has become with babel/webpack, transpiling, react/vue, etc. It feels like complexity for complexities sake.

bullen3y ago

"So Netscape thought they could do a similar thing for their navigator browser that, if they could get people programming in the same way that they did on HyperCard, on the browser, but now they can have photographs and color and maybe sound effects, it could be a lot more interesting, and you can’t do that in Java."

It's like the man never tried. Try a Java enabled browser: https://www.wikihow.com/Enable-Java-in-Firefox

Just as a reminder Minecraft (the most sold game in history) started out as an Applet.

Applets where not horrible because of the underlying technology, they where horrible because people made bad things with it, just like J2EE was a bad thing people made with J2SE.

But sometimes, rarely, people would make beautiful things with J2SE and J2ME and those are now removed from history forever under the banner of security like everything else that is good in life.

billyhoffman3y ago

I've met Douglas a few times at JS Conferences, and he is an excellent engineer (read up on his work on the NES version on Maniac Mansion). However this passage about starting a company and trying to raise capital from VCs demonstrates that even excellent software engineers can be surprisingly myopic, dismissive, and naive about software businesses.

> Douglas: For me, the most difficult thing was raising money. You’re constantly going to Sandhill and calling on people who don’t understand what you’re doing, and are looking to take advantage of you if you can, and they’re going to do that, but you have to go on your knees anyway.

> I found that stuff to be really hard, although some of them I really liked. And sometimes I’d be sitting in those meetings and I’d be thinking, “I wish I was rich enough to sit on the other side of the table, because what they’re doing right now looks like a lot more fun than what I’m doing right now.” And it was even more difficult raising money then, because at this point, the.com bubble had popped and all VCs had been hurt really badly by that. So they were only funding sure things at that time, in late 2001, early 2002.

> And I thought we were a fairly sure thing, because we had already implemented our technology. And by this point, Chip and I understood the problem really well. And we had a new server and JavaScript libraries done in just a few months. And we had demonstrations. We could show the actual stuff. So it wasn’t like we were raising money so that we could do a thing. We had already done the thing, we needed the money so that we could roll it out. And that wasn’t enough for them. They wanted to see that we were already successfully selling it. And I was like, “If we could do that, we wouldn’t need you.”

Only they hadn't. They had built a demo of what we would later call a web 2.0 app. It wasn't even an application that solved a business problem or did anything specific. It was just showing the concept. That's not a product and that's not a business. The VC's point was: Show us proof that this idea has tangible benefits people will pay for.

The biggest misconception of VC's is that you raise money to "successfully sell" something you've built. You don't. You raise VC money to scale something that has value. So you need to communicate the business value, and ideally have proof-points (either in the form of sales, or data) that prove the value.

Of course Douglas found raising money difficult. But he doesn't seem to have the self awareness that this was probably due to him, and not the rich suits on the other side of the table.

1 more reply

enriquto3y ago

Json is just hipster xml. Jq is just hipster xslt.

Somebody should add a json entry to "the ascent of ward" [0]. Of course, it will be longer than all the previous versions combined, and the fields will appear in random order because dictionary.

[0] http://harmful.cat-v.org/software/xml/

mproud3y ago

To me, Douglas Crockford is the unofficial grandfather of JS. He is amazing, and I love hearing him speak!

breck3y ago

> The success of JSON was totally serendipity. Getting the domain name definitely helped. There are some things that I didn’t do that definitely helped. I didn’t secure any intellectual property protection on it at all. I didn’t get a trademark for the name or for the logo. I didn’t get a copyright for the specification. I didn’t get a patent for the workings of the format. My idea was to make it all as completely free as possible. I don’t even require any kind of notice. No one has to say, “Thank you, Doug, for doing that.” It’s just free for everybody. And I think that definitely helped.

MilStdJunkie3y ago

Doug says there's not a conservation of complexity, but I kind of disagree with that - the problem he was getting hung up on back in 2000 was that the original XML complexity was frickin' useless but the consultants and the capital were trying to keep it around anyway. If you don't know why a complex condition exists, you can't abstract the complexity away.

meinersbur3y ago

> Douglas: [...] It doesn’t look like it should be complicated. It’s just angle brackets, but the semantics of XML can be really complicated, and they assumed it was complicated for a reason.

> Adam: [...] He also wanted people to use JavaScript properly – use semicolons, use a functional style, don’t use a vowel, use JSLint and so on.

They could have done the same with XML, i.e. define a simple-XML subset without schema, CDATA, entities, etc. Instead they built it on top of another language that is so infamous that they felt the need to write JSLint.

> Adam: The thing they came up with, Doug’s idea for sending JavaScript data back and forth, they didn’t even give it a name. It just seemed like the easiest way to talk between the client side and the backend, a way to skip having to build XML parser in JavaScript.

So the original reason was that they could use eval(jsonstr)? Because of the security implications they better had written a JSON parser. At that point, is it any better than writing a simple-XML parser? At least, that would have saved them from the "it's not a standard" discussions.

austin-cheney3y ago

a lot of people started programming in this thing and were writing in a style of programming that the professional programmers of the day thought was impossibly hard, which was doing stuff based on events.

Not so different from today. That quote is about HyperCard, not JS, by the way.

BiteCode_dev3y ago

I really hope that one day CUELANG will catch on to generate and validate JSON.

The current state of JSON generation/validation is simpler than the XML ecosystem, but a bit hackish.

We can have a much better stack.

cantSpellSober3y ago

> Oh, I did that. I didn’t intend to fight the federal government, sorry.

Seems politeness goes a long way when you're facing federal charges

j / k navigate · click thread line to collapse

245 comments

131 comments · 30 top-level

sanitycheck3y ago· 32 in thread

I have huge respect for Doug Crockford, and I never imagined I would disagree with him.

mgr863y ago

A bit disjointed, but ,in short, XML is awesome. Now only if they would move Balisage back to Montreal. I'm no fan of DC or virtual conferences.

PurpleRamen3y ago

With XML, the complexity is the baseline, and it only goes up from there. With JSON, the complexity is just an option, the baseline is pretty simple. Also, good XML-tools are rare or expensive.

andyjohnson03y ago

Baseline for XML would be a document that doesn't use schemas, namespaces, attributes, or any of the SGML legacy stuff like DTDs and PCDATA.

Such a document is essentially as simple as the equivalent JSON.

1 more reply

kitsunesoba3y ago

1 more reply

adolph3y ago

Here to say this too. Compositional complexity is an advantage.

As a human in a repl, I appreciate the balance of readibility between XML which uses a larger set of syntactical characters, and YAML which uses fewer.

djedr3y ago

> With XML, the complexity is the baseline, and it only goes up from there. With JSON, the complexity is just an option, the baseline is pretty simple.

Very well put. And we could lower the baseline substantially towards simplicity, even from JSON.

It's pretty clear that a lot of people think this way. Some even seriously try to figure out what such a baseline of simplicity would look like.

There are lots of simple indentation-based designs (similar to YAML) such as NestedText[0], Tree Notation[1], StrictYAML[2], or even @Kuyawa's Dixy[3] linked in this thread.

[0] https://nestedtext.org/en/stable/ [1] https://treenotation.org/ [2] https://hitchdev.com/strictyaml/ [3] https://news.ycombinator.com/item?id=35469643 [4] https://jevko.org/

pointlessone3y ago

Schemas are basically a native feature for XML. Not so much for JSON.

All sorts of specialised vocabularies are defined for XML. A few are defined for JSON, too.

1 more reply

goto113y ago

If we are going for simplicity, surely S-expressions wins? You can support structures similar to JSON or XML on top of it, but the baseline is simpler.

2 more replies

davemp3y ago

JSON is so much more ergonomic than XML as the lingua franca because I can actually read it. That being said I still have my share of problems with JSON.

radicalbyte3y ago

That was the cause of the XML problems - everything was generated.

3 more replies

js23y ago

> because everything gets validated on deserialization anyways

1 more reply

adamc3y ago

JSON is great, but I surely wish it supported comments. That's the nature of its failings: too minimal.

wongarsu3y ago

That depends on what you want it to be. For a data interchange format, having no comments is arguably a strength. For a config file format, having no comments is a big weakness.

2 more replies

keneda73y ago

This always bothered me. A coworker once suggested using fields ending in 'notes' to put in comments but I never really warmed up to that.

1 more reply

beached_whale3y ago

Luckily a good number of parsers support extensions to JSON like comments and trailing comma's.

PurpleRamen3y ago

Comments are simple to parse, but preserving them on the dump is complex. I guess they were sacrificed for the simplicity.

2 more replies

maxloh3y ago

You can use JSONC, which is JOSN with C style comments.

bilalq3y ago

The ambiguity difference around lists alone makes JSON over XML compelling.

It is simpler than XML/XSD. Without the schema, you never know if a certain element should be treated as being part of a list or not. When interoperating with anything other than XML, that matters.

wil4213y ago

foolfoolz3y ago

tracker13y ago

haburka3y ago

> I dislike SOAP and avoid working with it when I can.

I disagree. I think personal hygiene is very important for in-office coworking.

tracker13y ago

> I dislike SOAP and avoid working with it when I can.

Well, I'm about to take a shower now, and shame on you.

agumonkey3y ago

What I appreciate compared to xml is:

  - generic concepts like arrays and maps
  - lack of opportunity to invent names

Every xml schema is a potential DSL that reinvents things they might now.

halostatue3y ago

For this JSON:

    {
      "part_numbers": [1, 2, 3, 4, 5]
    }

You have two main ways to represent these in XML:

    <!-- repetition = array -->
    <order>
      <part_number>1</part_number>
      <part_number>2</part_number>
      <part_number>3</part_number>
      <part_number>4</part_number>
      <part_number>5</part_number>
    </order>

    <!-- wrapped repetition -->
    <order>
      <part_numbers>
        <part_number>1</part_number>
        <part_number>2</part_number>
        <part_number>3</part_number>
        <part_number>4</part_number>
        <part_number>5</part_number>
      </part_numbers>
    </order>

Is this better than JSON? No, not particularly. But it’s no less clear than the JSON, and it compresses pretty well (it compresses better for larger documents, obviously).

1 more reply

aidenn03y ago

I have literally never used any of these things.

irrational3y ago

> but now we've got JSON Schema, Swagger/OpenAPI, Zod, etc etc. It's not really simpler and there's a lot of manual work - we might as well be using XML, XSD & SOAP/WSDL.

Uh... do we? I've never used any of those. Plain JSON has always worked fine for me.

bryik3y ago

> but now we've got JSON Schema, Swagger/OpenAPI, Zod, etc etc.

You don't have to use any of those.

halostatue3y ago

dylan6043y ago

I have to say that I was bit disappointed the first time I learned about JSON Schema. My immediate reaction was to wonder if they were trying to become XML.

throwawaymaths3y ago

OpenAPI is complex not because of JSON, but because it's a nearly complete description of http.

steve19773y ago

… and have proper comments.

adamgordonbell3y ago· 21 in thread

This quote is funny:

    Douglas: The first time I saw JavaScript when it was first announced in 1995, I thought it was the stupidest thing I’d ever seen. And partly why I thought that was because they were lying about what it was.

A bigger more interesting thing though is how his company failed, in part, because they used hand-rolled JSON for messaging.

    Douglas: And some of our customers were confused and said, “Well, where’s the enormous tool stack that you need in order to manage all of that?” 

    “There isn’t one, because it’s not necessary”, and they just could not understand that. They assumed there wasn’t one because we hadn’t gotten around to writing it. They couldn’t accept that it wasn’t necessary.

    Adam: It’s like you had an electric car and they were like, “Well, where do we put the gas in?”

    Douglas: It was very much like that, very much like that. There were some people who said, “Oh, we just committed to XML, sorry, we can’t do anything that isn’t XML.”

Anyhow, hope you like the episode!

bambax3y ago

> I had to maintain a system once where a major part of it was XSLT

JavaScript is cool -- it has mostly eaten the world anyway. But JSON is not so good IMHO.

stickfigure3y ago

XSLT was (and still is) great for transforming documents. Want that recipe collection as HTML? Easy.

1 more reply

chrismorgan3y ago

I first touched XSLT in 2010. I appreciated what it could do, but it was painful to work with due to poor documentation and tooling. This has only gotten worse by comparison with alternatives.

  <?xml-stylesheet type="text/xsl" href="/atom.xsl"?>

1 more reply

kgwxd3y ago

> In JSON you can't get to the parent from the child. And walking down a tree is unintuitive, because nodes can be of different types

krzyk3y ago

JavaScript is as good as JSON. It has eaten The world jest because it was in every browser. Similarly Chrome, advertised on the biggest search engine.

fatnoah3y ago

Same here. XML was going to save the world! Remember XML data islands with data embedded in page source and displayed via XSLT?

mgr863y ago

1 more reply

ssdspoimdsjvv3y ago

Nowadays you have the same type of tools, only for YAML. Not sure if that's so much better.

2 more replies

rezaprima3y ago

I do just like this with json. Creating api blueprints.

justin663y ago

I really respect that you provide transcripts. It's terribly important for accessibility and for getting what people have said into the various search engines.

I was sad to hear that Crockford is not aiming to be the author of "the next language" anymore, but I wonder how sincere that really is. His thoughts on actor-based languages are interesting.

adamgordonbell3y ago

Thanks!

Crockford's thoughts on actors are really interesting. I tried to pull them apart but I didn't get very far and ended up not including them in the episode.

What he is envisioning is not exactly like Erlang but not exactly like Scheme. He said that Carl Hewitt had a lot of ideas and they were hard to unpack.

If you're interested though, I would reach out to him. He is very approachable and excited to talk to people with ideas for new ways of making things simple.

2 more replies

meepmorp3y ago

I remember a meeting where a consultant from an MCP excitedly told our mutual client that the XP in the upcoming version of Windows stood for 'XML Protocol.'

More innocent times.

lowercased3y ago

I had a power strip which had "works with windows 95" on the packaging box.

3 more replies

adamgordonbell3y ago

Scala had XML literals as part of the language!

Apparently Philip Wadler was the person who told them needed it, because the future was XML.

( Walder is big Haskell/PL person)

5 more replies

Seanny1233y ago

orthoxerox3y ago

1 more reply

bryik3y ago

This was before my time, but I believe the WS-* series of specifications is an example.

https://world.hey.com/dhh/they-re-rebuilding-the-death-star-...

WorldMaker3y ago

> When I do need to parse XML, there was a library I could use, although I will admit that needing xpath was a bit annoying.

A lot of XML history was standards proliferation in the xkcd 927 way: https://xkcd.com/927/

XPath tried to unify a lot of mini-DSLs defined for different DOM-style XML parsers.

XSLT tried to unify a bunch of XML transformation/ETL DSLs.

The things XPath and XSLT were designed to replace lingered for a while after those standards were accepted.

sgtnoodle3y ago

The last time I worked with XML, using an external library wasn't really a great option. I ended up writing my own parser in C++. It took about a week to get all the features required for my purpose.

tracker13y ago

1 more reply

gweinberg3y ago

XML might not have been so bad, if there weren't dopes pushing SOAP.

hk13373y ago· 8 in thread

Honestly, I would relegate XML to application configuration. Trying to communicate with it with something like HTTP requests/responses is absurd.

wongarsu3y ago

I just had to comment on the irony of this comment being embedded in a document that is delivered via HTTP and very close to valid XML.

hk13373y ago

In broad strokes, I suppose you're right to see the irony. Even with that, we need specific client applications (aka browsers) to translate that into something readable.

1 more reply

tannhaeuser3y ago

> HTML is imho a stereotypical example where XML is a good fit

Indeed, this was what XML was created for. From W3C's XML specification:

Honestly, what's absurd is GP comment's cluelessness.

bayindirh3y ago

When you can get the data as XML, verify via its schema externally and then transform it via XSLT, it's not.

Also, it's way better in transferring/storing big, complex intricate data like 3D objects.

Kuyawa3y ago

mattacular3y ago

> Also, it's way better in transferring/storing big, complex intricate data like 3D objects.

Curious how come?

1 more reply

xiande043y ago

Agreed. That and knowledge representation (ontologies) is another good use case for XML, since JSON can't natively represent attributes (has-a relationships).

mgr863y ago

1 more reply

dvh3y ago· 6 in thread

For me 3 killer features of JSON are:

1. Parsing JSON doesn't require adding new firewall rules

2. There are no comments, so nobody will try to invent their own meta format or annotations in comments and instead they will put data in the JSON as they should

3. (When compared to JS) someone finally had the balls and picked one type of quotes, this makes making parser so much simpler.

IshKebab3y ago

XML supports comments and I have not seen a single use of comment directives in it ever.

You might say that Crockford didn't anticipate JSON being used for config files. Fair enough. But now that it is, it should support comments.

My recommendation is to use JSON5 since it has a distinct file extension and fixes some other things about JSON too (e.g. trailing commas, hex constants) without being full on YAML insane.

asdfafe3y ago

directives can just be put in adjacent fields anyways

Ygg23y ago

> There are no comments, so nobody will try to invent their own meta format or annotations in comments and instead they will put data in the JSON as they should

It also means it's worse format for configs where you sometimes need to annotate a few nodes with comments.

lttlrck3y ago

Yep.

"comment": gets littered across the JSON... or temporary changes are copied and the original property name is invalidate with a prefix. The simple structure is gone, replaced with adhoc workarounds.

hiccuphippo3y ago

For configuration files I add an extra key "_" and a string with the comment as the value. I even add multiple "_" keys to the same object and never seen something break.

1 more reply

oblio3y ago

JSON could really use schemas as part of the main implementation.

Local schemas, not crazy remote schemas.

Or some sort of way to bless an "official" schema format.

HideousKojima3y ago· 6 in thread

What if I hate both formats? XML is overly verbose, while JSON isn't specific enough or precise enough for a lot of my needs.

- This message brought to you by TOML gang

filoeleven3y ago

I’ll take edn over any of “em. https://github.com/edn-format/edn

Comments and time stamps allowed, arbitrary nesting of data structures, make your own tagged literals if you need them. And commas are whitespace, mostly unnecessary.

629514133y ago

I've got yet another markup language for your hate group's target list :)

Come join the dark side where we enjoy the wonders of binary formats such as avro and protobuf.

HideousKojima3y ago

Though for something where you want human readability it's hard to beat TOML in my opinion.

dralley3y ago

https://github.com/ron-rs/ron

rewgs3y ago

Absolutely agree. TOML is far and away the best for config files.

mdaniel3y ago

    apiVersion = "v1"
    current-context = ""
    kind = "Config"

    [[clusters]]
    name = "my-cluster"

    [clusters.cluster]
    certificate-authority-data = "LS0tL..."
    server = "https://example.com"

    [[contexts]]
    name = "context0"

    [contexts.context]
    cluster = "my-cluster"
    user = "my-user"

    [[contexts]]
    name = "context1"

    [contexts.context]
    cluster = "my-cluster"
    user = "my-user"

    [[users]]
    name = "my-user"

    [users.user]
    [users.user.exec]
    apiVersion = "client.authentication.k8s.io/v1beta1"
    args = ["eks", "get-token"]
    command = "aws"

2 more replies

taeric3y ago· 4 in thread

> Turned out JavaScript was the first language to give us lambdas, and that was an amazing breakthrough.

I mean... with charity I can see the context and get it. But. What!?

I also recall that what killed XML and WSDL files in general, was the complete shit show that was getting a single document to work with both MS and non-MS clients.

goatlover3y ago

Crockford mentions Scheme right before that, so he's aware lambdas originated with Lisp, presumably. I guess he means JS was the first mainstream language to popularize them?

taeric3y ago

slaymaker19073y ago

taeric3y ago

JSON is garbage to read largely due to how much needs escaping. This is largely fine for smaller documents, but there is a reason yaml and toml both gained traction over raw json for config files.

JSON schema is going to be cursed for a long time. Just the odd treatment of it will be a problem. (In particular, that it is a subset of the numbers that javascript itself supports is... awkward.)

I also confess, though; that I'm not clear why I would want a null in the middle of a string? That feels like a gun loaded and aimed squarely at a foot.

1 more reply

Finnucane3y ago· 4 in thread

What a pointless debate. I've worked with XML manuscript archives and I can be certain that if I'd had to do it in JSON I'd have killed myself.

adamgordonbell3y ago

This is about JSON being created or discovered and Doug struggling to convince people it was relevant when everyone was so bought in on XML.

Are you saying you think JSON shouldn't exist and everyone should use XML for everything?

Tooling around XML was certainly more established, but man there was a lot of complexity built up around it.

bayindirh3y ago

No. JSON is great as Javascript's serialization format, but it's not as readable and robust as XML, period.

I use both extensively, and for bigger objects and definitions, XML is a very clear winner.

5 more replies

daveslash3y ago

  In 1996 I was at some of the initial XML meetings. 
  The participants� anger at HTML for �corrupting� 
  content with layout was intense. Some of the initial 
  backers of XML were frustrated SGML folks who wanted 
  a better cleaner world in which data was pristinely 
  separated from presentation. In short, they disliked 
  one of the great success stories of software history, 
  one that succeeded because of its limitations, not 
  despite them. I very much doubt that an HTML that had 
  initially shipped as a clean layered set of content 
  XML, Layout rules – XSLT, and Formatting- CSS) would 
  have had anything like the explosive uptake.

https://adambosworth.net/2004/11/18/iscoc04-talk/

1 more reply

irrational3y ago

Debate? Did you even read the article? It was about the history of how JSON came around. I didn't read a debate (despite what the title implies).

Zamicol3y ago· 3 in thread

From another interview:

- https://evrone.com/douglas-crockford-interview

One of the traits that makes Douglas great is being willing to say the obvious even if it is politically unpopular.

n0w3y ago

Oh, hey. That's cool. I hadn't realised Douglas Crawford worked on E. I haven't actually looked but I wonder who else participated?

E had some really cool ideas, it's sad that it doesn't seem to be that well known!

irrational3y ago

The biggest impedances I see to replacing JS are:

1. You've got to keep JS around for backwards compatibility for the billions of websites already using it.

2. You will need to two engine teams, one to maintain JS and one for the new language.

3. Now you have a whole new vector for security issues. You've made the threat surface much broader. So, you will probably need to hire additional people.

hajile3y ago

That depends on the language you choose.

If we went to a scheme dialect as originally intended, we could have just ONE language for all the things.

Legacy JS? Just compile it into Scheme and run it.

HTML? Use S-expressions and support legacy HTML syntax by compiling it into them. Now you get all the power people want from template languages, but baked right into main language itself.

CSS? No more weirdness like adding sin() or calc() to make up for shortcomings. Once again, you get the power of the full Scheme language right there.

Devasta3y ago· 3 in thread

Namespaces, schemas, custom elements, client side templating, XML has so much stuff that the web threw away, so now its forced to reinvent worse versions of it every few years, a shame.

Abandoning XML was the webs biggest mistake.

giantrobot3y ago

Zamicol3y ago

XML is oversized for the majority use case.

It's easier to extend a simple standard than to amputate a behemoth with unneeded appendages.

recursive3y ago

The problem with extending a standard is that there are so many ways to do it.

Kuyawa3y ago· 3 in thread

I remember one time designing the simplest and most readable data format ever and came up with Dixy [0] after removing all I could and still make it usable

I'm leaving it here because it will never be used for anything but at least it may inspire somebody design a better format with simplicity in mind

[0] https://github.com/kuyawa/Dixy

nayuki3y ago

Other problems to ponder: Is 0 different from 00? Is "1, 2, 3, 4" different from "1,2,3,4"? Is "a: b" different from "a : b" and "a:b"?

duffyjp3y ago

I like this! It's like YAML but you can learn the entire spec in 15 seconds.

ptsneves3y ago

Why will it never be used for any thing? I like it. Thank you for sharing.

maple31423y ago· 2 in thread

Sohcahtoa823y ago

> you can reasonably parse an untrusted JSON using default configuration without getting yourself pwned.

If only this were true.

https://medium.com/r3d-buck3t/insecure-deserialization-with-...

maple31423y ago

On the other way, XML External Entity is a part of XML standard, so any standard compliant XML implementation have to support it. This is why XXE attack applies to many languages.

user39393823y ago· 2 in thread

There's not going to be a silver bullet solution to this problem, it's not completely solvable.

Sohcahtoa823y ago

> you're going to end up translating JSON<->XML which isn't fun.

Not fun? It's not even possible in the general sense.

If you have XML that looks like:

    <meal type="breakfast">
       <eggs count="3">
           <topping>cheese</topping>
       </eggs>
    </meal>

How would you convert that to JSON without knowing how the JSON consuming application expects it to be formatted? Where do you put the "breakfast" and "count" attributes?

You'd need to manually write a translator for each potential translation.

user39393823y ago

> You'd need to manually write a translator

Yep, therein lies the “not fun”. You write a bunch of super complex, brittle code.

slaymaker19073y ago· 2 in thread

Even C# just punts on this issue and won't emit valid XML if a string you serialize happens to have a null character in it.

Sohcahtoa823y ago

If I had to deal with strings that XML won't allow, I'd probably just rely on encoding the data in Base64 before throwing it into the XML.

slaymaker19073y ago

It's very annoying to do that though since that introduces a bunch of logic in the application and also removes the benefit of being able to read the strings in the XML as a human.

andyjohnson03y ago· 2 in thread

I'll never understand the hating that xml tends to get around here.

Choose the right tool for the job at hand. Sometimes json is the right choice, sometimes xml is. Not everything is a webapp.

eviks3y ago

It's an ugly tool. People generally hate ugly tools

ledauphin3y ago

based on the overwhelming majority of the top 30 comments, i think you should feel comforted.

hot_gril3y ago· 1 in thread

The other two premier XML use cases I can think of are

1. RSS: Last time I did this, ironically I built the payload with a JSON-API'd lib that deals with the XML drama for me. Worked fine.

2. Configs. Rarely are these done in XML anymore. Human readability matters for configs. But there are also better options than JSON for this.

hot_gril3y ago

3. HTML-like things where XML actually makes sense cause you're defining some sort of document with reusable objects that gets rendered at the end.

codr73y ago· 1 in thread

I bought one of the first books about XML, read it cover to cover; started writing my own parsers and generators, designed a custom XML protocol for a network server at work.

Then I had to live through the whole SOAP-drama, and Java EE; and ended up promising myself to never touch it again.

It has too many degrees of freedom for its own good, the C++ of data formats.

JSON is in many ways the other end of the spectrum; simple but underspecified and painful to deal with in anything but JS.

I often dream of something in-between.

dralley3y ago

https://github.com/ron-rs/ron

nsxwolf3y ago· 1 in thread

I can't listen to this because the host sounds like a text to speech engine.

irrational3y ago

I read it. It's a quick read.

simonw3y ago

My favourite Douglas Crockford quite, from a debate back in 2006 about why JSON was reinventing the wheel when XML already existed:

> The good thing about reinventing the wheel is that you can get a round one.

https://simonwillison.net/2006/Dec/21/crock/

acabal3y ago

While both are good fits for their specific use cases, I think JSON won as an medium of exchange because unlike XML, JSON is dead simple to parse and ingest programmatically.

irrational3y ago

> And after years of being too early at everything, the world had caught up to Doug.

Have we though? Earlier, the article even has Douglas saying:

I do love the very ending:

Adam: What do you think is the XML of today?

Douglas: I don’t know. It’s probably the JavaScript frameworks.

They have gotten so big and so weird. People seem to love them. I don’t understand why.

And so, when I’m writing interactive stuff in browsers now, I’m just using plain old JavaScript. I’m not using any kind of library, and it’s working for me.

And I think it could work for everybody.

------

bullen3y ago

It's like the man never tried. Try a Java enabled browser: https://www.wikihow.com/Enable-Java-in-Firefox

Just as a reminder Minecraft (the most sold game in history) started out as an Applet.

Applets where not horrible because of the underlying technology, they where horrible because people made bad things with it, just like J2EE was a bad thing people made with J2SE.

But sometimes, rarely, people would make beautiful things with J2SE and J2ME and those are now removed from history forever under the banner of security like everything else that is good in life.

billyhoffman3y ago

Of course Douglas found raising money difficult. But he doesn't seem to have the self awareness that this was probably due to him, and not the rich suits on the other side of the table.

1 more reply

enriquto3y ago

Json is just hipster xml. Jq is just hipster xslt.

Somebody should add a json entry to "the ascent of ward" [0]. Of course, it will be longer than all the previous versions combined, and the fields will appear in random order because dictionary.

[0] http://harmful.cat-v.org/software/xml/

mproud3y ago

To me, Douglas Crockford is the unofficial grandfather of JS. He is amazing, and I love hearing him speak!

breck3y ago

MilStdJunkie3y ago

meinersbur3y ago

> Douglas: [...] It doesn’t look like it should be complicated. It’s just angle brackets, but the semantics of XML can be really complicated, and they assumed it was complicated for a reason.

> Adam: [...] He also wanted people to use JavaScript properly – use semicolons, use a functional style, don’t use a vowel, use JSLint and so on.

austin-cheney3y ago

Not so different from today. That quote is about HyperCard, not JS, by the way.

BiteCode_dev3y ago

I really hope that one day CUELANG will catch on to generate and validate JSON.

The current state of JSON generation/validation is simpler than the XML ecosystem, but a bit hackish.

We can have a much better stack.

cantSpellSober3y ago

> Oh, I did that. I didn’t intend to fight the federal government, sorry.

Seems politeness goes a long way when you're facing federal charges

j / k navigate · click thread line to collapse