The Pretty JSON Revolution (opens in new tab)

(ohler.com)

169 pointspeterohler5y ago156 comments

156 comments

101 comments · 30 top-level

Macha5y ago· 12 in thread

If you're on a mac or Linux system, you likely have a JSON formatter already installed

`python3 -m json.tool somefile.json` or `cat foo.json | python3 -m json.tool` will print it in "one line per node" format. 3.9 introduces a --sort-keys switch for sorted objects also.

foobarian5y ago

jq is a very nice tool for JSON wrangling available to install on most distros. It also provides key sorting, which is great for diff-ing JSON.

peterohlerOP5y ago

jq is a nice tool. oj is similar in many ways but different in others. jq has it's own proprietary query language while oj uses JSON path. The output options are also different with some overlap. Maybe jq will get a pretty output option after reading the article. :-)

4 more replies

taeric5y ago

I would be a little nervous sorting the keys. I thought it was not too uncommon for parsers to treat them as an alist where order matters. I guess so long as it is a stable sort, no big deal?

Macha5y ago

Pretty printing JSON is mostly for developer consumption, I'm not sure how the pretty printed JSON would end up being fed automatically to another system?

(I have actually encountered a order-dependent JSON-subset parser before, but to my mind, that code is broken)

2 more replies

igetspam5y ago

If you have a parser that's looking at keys in a hash as sorted, you should change your parser. Lists sure but not keys.

1 more reply

peterohlerOP5y ago

It is a stable sort since for almost every parser out there there can be no duplicate keys. It would be dangerous for an JSON parser to assume JSON object keys are in some specific order. It is certainly not something that can be counted on in golang, Ruby, or Python.

2 more replies

lucideer5y ago

If you have an application that's dependent on JSON key order, then you're not passing it prettified JSON.

The main apps I've seen that depend on JSON structure are for hashing, which would also be broken by whitespace / linebreak variances in pretty-printers.

billpg5y ago

I wold rather pritty printing retain original order. Some JSON schemas I've seen put a version or class type at the top of every object and I wouldn't want that to be ordered elsewhere.

augusto-moura5y ago

I prefer Nushell[1] for data processing, it's a full fledged shell but I rarely use it as a interactive shell, mostly as a scripting language and some one-offs oneliners. It supports CSV, JSON and other languages by default and provide the data a much nicer common interface

Something like:

    open file.json | select colors | each { ^echo $it.hex }

is much nicer than jq

[1]: https://www.nushell.sh/

igetspam5y ago

I did not know about sort keys. I'm adding that to my alias. Thank you.

peterohlerOP5y ago

Oj supports a config file as well. .oj-config.sen. -help-config will describe it in more detail.

jandrese5y ago

I use json_pp a lot. It's usually installed in the base OS.

json_pp < somefile.json

The only thing I don't like is that it doesn't process commandline arguments. You have to pipe the file in. It is also fairly strict, I've run into a number of malformed JSON files that it rejects but other parsers would accept. Naked TRUE/FALSE statements are one thing it hates that are super common, especially from places like Google.

thechao5y ago· 12 in thread

Please:

    {
      "colors": [
        { "color": "black",   "hex": "#000", "rgb": [ 0,   0,   0   ] },
        { "color": "red",     "hex": "#f00", "rgb": [ 255, 0,   0   ] },
        { "color": "yellow",  "hex": "#ff0", "rgb": [ 255, 255, 0   ] },
        { "color": "green",   "hex": "#0f0", "rgb": [ 0,   255, 0   ] },
        { "color": "cyan",    "hex": "#0ff", "rgb": [ 0,   255, 255 ] },
        { "color": "blue",    "hex": "#00f", "rgb": [ 0,   0,   255 ] },
        { "color": "magenta", "hex": "#f0f", "rgb": [ 255, 0,   255 ] },
        { "color": "white",   "hex": "#fff", "rgb": [ 255, 255, 255 ] }
      ]
    }

augusto-moura5y ago

The position of commas in the rgb array are triggering me

Numbers to the right make it much more pleasant to my eyes

    {
      "colors": [
        { "color": "black",   "hex": "#000", "rgb": [   0,   0,   0 ] },
        { "color": "red",     "hex": "#f00", "rgb": [ 255,   0,   0 ] },
        { "color": "yellow",  "hex": "#ff0", "rgb": [ 255, 255,   0 ] },
        { "color": "green",   "hex": "#0f0", "rgb": [   0, 255,   0 ] },
        { "color": "cyan",    "hex": "#0ff", "rgb": [   0, 255, 255 ] },
        { "color": "blue",    "hex": "#00f", "rgb": [   0,   0, 255 ] },
        { "color": "magenta", "hex": "#f0f", "rgb": [ 255,   0, 255 ] },
        { "color": "white",   "hex": "#fff", "rgb": [ 255, 255, 255 ] }
      ]
    }

Zecc5y ago

> The position of commas in the rgb array are triggering me

You mean:

    {
      "colors": [
        { "color": "black"  , "hex": "#000", "rgb": [   0,   0,   0 ] },
        { "color": "red"    , "hex": "#f00", "rgb": [ 255,   0,   0 ] },
        { "color": "yellow" , "hex": "#ff0", "rgb": [ 255, 255,   0 ] },
        { "color": "green"  , "hex": "#0f0", "rgb": [   0, 255,   0 ] },
        { "color": "cyan"   , "hex": "#0ff", "rgb": [   0, 255, 255 ] },
        { "color": "blue"   , "hex": "#00f", "rgb": [   0,   0, 255 ] },
        { "color": "magenta", "hex": "#f0f", "rgb": [ 255,   0, 255 ] },
        { "color": "white"  , "hex": "#fff", "rgb": [ 255, 255, 255 ] }
      ]
    }

2 more replies

falcolas5y ago

Opinionated Opinion:

That's an incredibly XML-ified version of a color table. I can clearly see the tags now. Can't just do a look up of a color color, instead I would have to iterate over the members or store it in a different data structure.

Why even use JSON? Blech.

selfmodruntime5y ago

Just because the data is structured this way in the example, doesn't mean it's not possible. What's hindering you from defining a class property for each color?

"colors": { "red":{"rgb":"fff"}", ... }

2 more replies

zeroimpl5y ago

Perhaps the purpose is to list colors in a specific order, such as to show to the user in a desired order?

And before you argue that dictionaries can still be iterated in order, you better check the sibling threads where people are arguing you shouldn’t rely on that.

leoedin5y ago

I don't get why you're railing against this comment - that's the same data structure used in the article we're discussing. The only difference between the proposal in the grandparent comment and the article is the indentation and spacing.

1 more reply

peterohlerOP5y ago

An issue has been added to the repo. It is on the list now. Great idea, thanks. https://github.com/ohler55/ojg/issues/35

Latty5y ago

Personally, aligning columns like that drives me nuts. It's awkward to maintain and makes things harder to actually read most of the time.

jng5y ago

How does it make it harder to read? Patterns and repetitions jump out of the page. Indeed, maintaining it takes time and effort, I wish the editor were smart enough, but I only see advantages with regards to reading the code.

d0mine5y ago

visidata shows it as:

  color   | hex  | rgb ║
  red     | #f00 | [3] ║
  black   | #000 | [3] ║
  yellow  | #ff0 | [3] ║
  green   | #0f0 | [3] ║
  cyan    | #0ff | [3] ║
  blue    | #00f | [3] ║
  magenta | #f0f | [3] ║
  white   | #fff | [3] ║

I would be nice to inline the rgb column here.

ivanche5y ago

Ouch! This is borderline unreadable to me. Even more so when there's a value of length, say, 50 for the one of color keys.

stouset5y ago

What on earth is difficult to read about aligned data?

More importantly, what’s more readable to you?

1 more reply

Garlef5y ago· 11 in thread

> JSON can be made prettier by sorting the JSON object members by element keys.

This seems to be a bad idea. The JSON language spec has ORDERED object members. But the order is arbitrary (precisely the one given in the JSON string) and does not have to be the lexicographic.

Sorting the object members by default would introduce problems whenever the order matters to the consumer of the JSON.

dragonwriter5y ago

> The JSON language spec has ORDERED object members.

False. “An object is an unordered collection of zero or more name/value pairs, where a name is a string and a value is a string, number, boolean, null, object, or array.” [emphasis added][0]

[0] https://tools.ietf.org/html/rfc8259

ucarion5y ago

Not exactly, unfortunately. The text you cite is in the introduction, which is non-normative. That text talks about the conceptual data model, but that's just to frame the reader's thinking.

The normative text has the "real" answer, and the real answer is that it's basically undefined behavior. It starts by saying "The names within an object SHOULD be unique", and then elaborates:

  An object whose names are all unique is interoperable in the sense that all
  software implementations receiving that object will agree on the name-value
  mappings.  When the names within an object are not unique, the behavior of
  software that receives such an object is unpredictable.  Many implementations
  report the last name/value pair only.  Other implementations report an error
  or fail to parse the object, and some implementations report all of the
  name/value pairs, including duplicates.
  
  JSON parsing libraries have been observed to differ as to whether or not they
  make the ordering of object members visible to calling software.
  Implementations whose behavior does not depend on member ordering will be
  interoperable in the sense that they will not be affected by these
  differences.

https://tools.ietf.org/html/rfc8259#section-4

3 more replies

msluyter5y ago

In theory, order shouldn't matter, right? But I recall seeing this trick for adding comments to json:

  { 
    "foo": "this is a comment about foo",
    "foo": "actual value of foo that overwrites the comment"
  }

The trick is that the second value value of foo overwrites the first. But, clearly, sorting would would wreak havoc here (if the value was used in the sort key). ;)

Buttons8405y ago

Yes. That is valid JSON.

A fun fact about MongoDB is it will actually store that JSON, both duplicate keys. The implication is that whatever MongoDB client you're using, that maps Mongo data to dictionaries/maps, is not capable of representing all valid MongoDB documents. It's important to recognize that Mongo may be storing data your client will not be able to access.

I learned this when the Python client was showing one value for a key, and the Ruby client was showing another value for the same key, and neither client was showing the whole document.

1 more reply

Latty5y ago

The spec[1] says:

> An object is an unordered collection of zero or more name/value pairs, where a name is a string and a value is a string, number, boolean, null, object, or array.

"whenever the order matters to the consumer of the JSON" should be never.

More pragmatically, regardless of what the spec says, a ton of JSON tooling assumes the order doesn't matter and relying on it would be a big mistake.

[1]: https://tools.ietf.org/html/rfc7159#section-1

gmfawcett5y ago

The other spec (ECMA-404) disagrees with RFC 7159 on this point.

That's the great thing about specs -- if you don't like what one says, there's always another to support your position. :)

2 more replies

Garlef5y ago

> More pragmatically, regardless of what the spec says, a ton of JSON tooling assumes the order doesn't matter and relying on it would be a big mistake.

Agreed. But this does not mean that a tool should break it.

My assumption would be that

   fn(parse(pretty_print(someJSONString)))

should always evaluate to the same as

   fn(someJSONString)

(for all functions fn)

1 more reply

Garlef5y ago

I checked the spec again:

It's unclear: At one point it says "An object is an unordered set of name/value pairs." while in the actual grammar it is ordered:

    object
        '{' ws '}'
        '{' members '}'
    
    members
        member
        member ',' members

gmfawcett5y ago

I recommend reading the ECMA-404 spec instead. It's less ambiguous, and basically says that you can treat the pairs as ordered if you want, as the syntax itself doesn't imbue the order with any meaning:

https://www.ecma-international.org/wp-content/uploads/ECMA-4...

dragonwriter5y ago

Grammar is inherently ordered, semantics is different from syntax.

peterohlerOP5y ago

I think you will find the JSON object members are not ordered while JSON array members are ordered. Since the JSON object members are not ordered, changing the order for display purposes does not change the data in any material way.

breck5y ago· 6 in thread

Relevant plug: if Pretty Notations interest you, then you should keep an eye on Tree Notation https://treenotation.org/.

theamk5y ago

The Tree Notation is like the opposite of pretty notation though, no?

The whole idea of pretty notation is automatically inserting non-significant whitespace to make it look nice. Step 2, "one line per node", inserts spaces and newlines. Step 4, "human style" strategically removes some of those so the lines look nice -- the 2nd level dict has lots of content, so it was split across multiple lines... while the 3rd level dict has fewer data, so it all fits on one line.

As opposed to this, Tree Notation is all about single canonical representation. So whitespace is significant, and you can never add or remove it to make output look nicer. You do whatever your schema tells you, and I hope you like many short lines.

breck5y ago

The OP is sort of all over the place (their favorite "Pretty JSON" is not actually JSON at all, but SEN, which is definitely not JSON, which has a very discrete specification).

So what they are really talking about is just pretty code. Their favorite examples utilize alignment (tree notation does that better—every tree doc is ismorphic to a spreadsheet and you don't have to align things to the left spine, and their are grid langs that don't do that).

The colors et al are called "secondary notations" and again Tree Notation can't be beat. Adding secondary notations is simple. Here's an example: https://www.youtube.com/watch?v=vn2aJA5ANUc

1 more reply

stevenpetryk5y ago

This site uses only images to show the code but doesn't provide any text alternative for the image. Every image just has a `title` attribute of "Code you could hold in your hand"

breck5y ago

Lots of code examples here: https://jtree.treenotation.org/designer/

And the source for that homepage is here: https://github.com/treenotation/treenotation.org

Always open to PR!

adwn5y ago

a) That's not relevant, b) it's not nearly as new, interesting, or revolutionary as you think it is, and c) please stop spamming links to your website in every second thread here.

breck5y ago

https://giphy.com/gifs/smile-clap-laff-3oEjHI8WJv4x6UPDB6

taeric5y ago· 4 in thread

I would prefer one that aligned the like named keys, if it fits in screen. Makes it dead easy to scan the values.

That said, it is just another pun on the text as art thing. In that it doesn't really scale, and you are going to upset someone by not having a codified tool for automatically doing this. (I don't recall seeing align-regex in any popular tool.)

thechao5y ago

As a joke I developed a format called "KVIN" which is like GRON but "context-sensitive":

    foo.bar.baz = 10
           .biz = 12 // foo.bar.biz
      ..boz.baz = 31 // foo.boz.baz

etc. It basically combines really brittle context-sensitive grammar production with complete lack of greppability.

peterohlerOP5y ago

Can you explain what you mean by "like named"? The sorting helps a lot but I'm always interested in additional features.

dan-robertson5y ago

I assume they mean vertical alignment like:

  [ { foo: a   bar: 123.45 }
    { foo: abc bar:   6.7  } ]

1 more reply

taeric5y ago

The current top comment is what I meant. On my phone, so couldn't put an example easily.

vicpara5y ago· 4 in thread

JSON is mostly for machines not people. When needed, developers format their json in their code editor of choice or bash.

peterohlerOP5y ago

There are a lot of people who store JSON in NoSQL databases. After fetching a JSON records you generally view the JSON or you do as a developer. That is where the tool is handy as you get get something like a FHIR record on a single page instead of crunched into a single line to expanded over multiple pages.

gpvos5y ago

True, and this is a nice tool to do so.

slingnow5y ago

JSON is mostly for people and not machines in that it is meant to be easily readable and editable by humans. If you wanted a something for machines you would store your data in a compressed/binary format.

Pxtl5y ago

Which demonstrates that JSON is pointless.

It's too ugly for humans (too many quotes, too many escape characters, and no comments) and too texty for machines.

specialist5y ago· 3 in thread

Nicely done. First I've seen "SEN".

Treating the colons as white space, as you've done with the commas, will move you one step closer to The Correct Answer™.

peterohlerOP5y ago

I suppose that is possible to remove the colons but it is nice having the extra reminder that the left side of the colon is a key and the right a value. That could easily become lost if a new line is inserted after the key.

SEN is new. After dealing with broken JSON due to commas missing or one at the end of an array and some of the team using Javascript this was a way of sucking in the broken JSON and fixing it.

specialist5y ago

> After dealing with broken JSON...

Postel tried to warn us.

> ...nice having the extra reminder that the left side of the colon is a key and the right a value.

Totally. IMHO: whitespace, formatting, delimiters are for humans. The parsers can do without. With some exceptions, like your examples of quoting strings to remove ambiguity.

kccqzy5y ago

The colons really aren't necessary in most cases. Clojure does away with the colons for instance. And I'm pretty sure GP is referring to some kind of Lisp.

cratermoon5y ago· 3 in thread

I recall seeing something that claimed all JSON is syntactically valid JavaScript. If that's correct, shouldn't it be possible to use JS code formatting engines to intelligently format JSON?

sefrost5y ago

Yes, Prettier can format JSON.

https://prettier.io/docs/en

patdx5y ago

Basically yes, though `{` and `}` also start and end expression blocks in JavaScript. So the JS formatter needs to be aware that it is working with a JSON object and not a complete expression.

Valid JSON:

  {
    "key1": "hello",
    "key2": "world"
  }

You could "trick" a JS formatter to format it by wrapping with a fake function, etc. Some minimum valid JS:

  json({
    key1: "hello",
    key2: "world",
  });

MongoDB has some JS libraries that use similar tricks to use JS parsers for their shell query format (which is similar to JSON). For example, around line 597: https://unpkg.com/browse/ejson-shell-parser@1.1.1/dist/ejson...

binarymax5y ago

Easy node one-liner:

JSON.stringify(JSON.parse(require('fs').readfileSync('myfile.json')),null,2);

AzzieElbab5y ago· 3 in thread

I expected a Mona Lisa as json by the end of this article

flaie5y ago

Here you go: https://pastiebin.com/603528abd6813

AzzieElbab5y ago

Just look at that #smile

peterohlerOP5y ago

Now that would be cool! :-)

benatkin5y ago· 2 in thread

In addition to outputting HTML, it could output JSON that could be rendered to HTML, the terminal, or JSX:

https://github.com/wooorm/lowlight#projects

chrisweekly5y ago

oo cool, wooorm looks useful, thanks for the link

benatkin5y ago

You're not wrong, but wooorm is a person.

Remark and Unified are some well-known projects that wooorm maintains.

https://github.com/remarkjs/remark https://unifiedjs.com/

1 more reply

ehnto5y ago· 2 in thread

> Those two parameters are specified as a float where the whole number part is the edge and the fractional part or the number of 10ths is the maximum depth on a single line.

Is that a convention I'm not aware of? Seems a little obtuse and unnecessary, why not just accept two arguments? One less arbitrary usage detail to remember.

peterohlerOP5y ago

I keep the `-p` (pretty) option as a single option. Not a convention at all. I toyed with 80x3 and 80:3 but ended up with 80.3. You are right though, it probably make sense to support two options as well to avoid the unusual convention. Maybe a `-edge` and `-max-depth` options in addition. Issue created: https://github.com/ohler55/ojg/issues/36

ehnto5y ago

Thanks for the reply, the combined parameter was borne out of your real world usage so I'm glad it sounds like you'll keep it in. Hope I didn't come off cynical, this is a very cool feature for OjG.

1 more reply

Pxtl5y ago· 1 in thread

I have trouble getting excited about tools to prettify JSON as long as the guy controlling the standard has a stubborn attitude about allowing comments or decent storage for long/multiline strings.

At this point I honestly take XML over JSON where I have a choice because of CDATA and comments.

ohitsdom5y ago

I was interested in this link just for its take on comments, bummed to see that skipped over.

_flux5y ago· 1 in thread

Also the revolution of two-letter command names :/.

I mean, at least before one has proven a tool's ubiquitous use, use a longer name.

jq just got lucky but I don't think it was because of its name ;).

peterohlerOP5y ago

Oj is actually pretty well known as a Ruby JSON parser. The OjG project is in the same family.

enriquto5y ago· 1 in thread

This beautiful post is missing a last section called "Gron: do away with json altogether and print something actually readable". That would be a good punchline!

peterohlerOP5y ago

Interesting idea. Solves the issue of trying to find data in JSON fairly well. I'm a bit biased but I like using the oj with a JSONPath extraction (-x option) to do something similar but with the power of JSONPath.

warmfuzzykitten5y ago· 1 in thread

It seems a mistake to format JSON as non-JSON text (SEN Format) in the name of "pretty". That will inevitably lead to copy/paste and monkey-see errors.

theamk5y ago

I think that even you discard the last step ("sen") and stick to plain "human style with colors", this is already much prettier than many languages support.

I like the idea that the incompatible format is off by default.

Phrodo_005y ago· 1 in thread

> the conversion from SEN to JSON and the reverse is lossless

How does SEN deal with numbers-encoded as string? is it something like .4 ? that's a bit confusing

peterohlerOP5y ago

SEN still uses quotes when necessary. For any sting that starts with a number, the sting is quoted. Note in the example the hex colors are in quotes since the `#` character is not valid unquoted token character.

olafure5y ago· 1 in thread

Funny coincidence for cli tool name and the Icelandic meaning: https://en.wiktionary.org/wiki/oj#Interjection

peterohlerOP5y ago

Quite a variety of meanings. Some pretty funny.

asaph5y ago· 1 in thread

An incremental formatting tweak is not a revolution.

peterohlerOP5y ago

The title was meant to be fun. JSON format is a pretty light topic.

croes5y ago· 1 in thread

What if the JSON has multiple nested objects?

peterohlerOP5y ago

Works fine. Give it a try.

derefr5y ago· 1 in thread

IMHO, this is what YAML is actually for.

YAML is “a superset of JSON”, yes, but there are two separate meanings to that:

• YAML has alternative syntactic sugar for expressing the same underlying JSON-equivalent semantics (sort of the same as Avro being canonically a binary compact expression of underlying JSON — in both cases, libraries for the codec expect JSON-encodable data structures as #encode input, and produce JSON-encodable data structures as #decode output)

• YAML has its own semantics (like node type annotations, or references) that JSON doesn’t have, such that documents that use these are no longer transposable into JSON.

I love bullet point #1. I hate bullet point #2.

Personally, I wish there was a name for the reduced subset of YAML that is still a “syntactic superset of JSON”, but which has none of the extended semantics of bullet-point #2.

Many systems that “consume YAML” already actually require their documents to be this “strictly-JSONifiable YAML”! Kubernetes, for example: it might seem to expose a YAML manifest API, but actually, internally, it does everything in JSON. All the resources in k8s etcd are stored in canonicalized JSON. The k8s controller just prettifies that JSON to YAML on its way out to you; and uglifies it back to JSON when you send it in. Which means that any YAML features that don’t survive that translation, can’t be used.

IMHO, if YAML hadn’t been designed with any extended semantics, but instead had strictly targeted being a “sugared alternative encoding of JSON”, I think everyone would have switched to sending YAML in place of JSON a long time ago. Browsers would have likely added YAML parsing as well.

But those added semantics are just so much extra work for everybody. Type annotations are source of so many vulnerabilities in programs that were unaware their input could “reach in and do things” through those types; and yet many YAML parser libs don’t have any flag to restrict them from decoding these type annotations (i.e. no way to “defuse the bomb.”) References change the entire way you have to write a YAML parser, disallowing some types of parsing grammar altogether, meaning you might no longer have access to the first-class parsing solution of your language runtime; meaning that for many runtimes, the YAML codec lib for that runtime is much slower — and memory-intensive! — than the JSON codec lib for the same runtime. Etc.

Honestly, if we could all agree on a name for “strict, JSONifiable YAML”, and create libraries that only parse/validate/accept that subset of YAML while rejecting the higher-level semantics, those libs—and that interchange format—would be immediately more popular than YAML. The time for this to happen hasn’t passed! We still have a chance!

elliottinvent5y ago

In the python world this is StrictYAML: https://hitchdev.com/strictyaml/

gpvos5y ago

I hadn't seen the SEN format before. I would like the keys unquoted as far as possible, but the commas kept in and otherwise also to keep it 100% Javascript-compatible and usable to cut and paste it into Javascript code.

squaresmile5y ago

The human style format reminds me of the default format of js-beautify [1]. We use it to get the "human-style" instead of the "One Line Per Node" for a project where we store json files in a git repo. That way the git diff is pretty easy to read and not bloated. Too bad, not many tools have the "human-style" option.

[1] https://github.com/beautify-web/js-beautify

soheilpro5y ago

Another way to display JSON files in a more readable format is catj (https://github.com/soheilpro/catj)

dan-robertson5y ago

I tried using a json pretty printer in the lisp pp family of pretty printers (but it didn’t have miser mode.) Maybe we were just formatting things wrong and should have put brackets or breaking rules in different places, but changing that sort of code is hard and the results weren’t particularly great and people preferred the standard JSON.print(_,null,2) method. We switched to this and it was simpler and better. This format is also easier to process with something like grep or sed or awk or editor macros when needed.

EdwardDiego5y ago

The biggest advantage of the one line format is the ndjson/jsonl file where one line = one record.

noxer5y ago

Slightly off topic but I sometimes use https://json.pizza (a site I know from HN) to format JSON. It however does not have different ways to format just the standard indentation.

AtlasBarfed5y ago

I don't think YAML is perfect, but it is better than every one of these pretty formats.

Pretty JSON is inevitably for either logging or config files, and YAML is better at both of those.

austincheney5y ago

I maintained a code beautification tool for about a decade. Here is what I learned from code beautification.

1. First notice that there is a world of difference between what users want and what they are willing to achieve. Know this more than anything else. People will ask for all kinds of shit, and.... A wish list is not a fully explored business requirement with known sub-tasks and test cases. A simple ask can become something worthy of a different independent project.

2. Too subjective. Everybody has subtle different personal preferences. In some cases the inability to support some edge case of some language will cause certain users to have an emotional episode. WTF. This is free software providing a convenience that you can easily live without.

3. A lot of work. You have to be very clear about what language, grammar, class of languages, or other various of characters you are willing to support. For example there is HTML then there are about billion trillion different HTML template schemes each with their own syntax and inside that syntax is a wildly different language than the surrounding HTML.

4. Carve out a measurable portion of your life. This is an investment of time you will never get back. Writing a code beautifier is far more work than it sounds. First, you need a parser. If one does not exist for the language you wish to support in the language or format of your tool you will need to write one. Be careful though, because that parser will have to support conventions that are unique to beautification and not necessarily useful elsewhere. In the case of the HTML example above you will need multiple different parsers that can achieve a nesting of parse trees or achieve harmony of a uniform parse tree beloved by all languages. This is achievable, as I have done it, but good luck.

5. Maintenance. There are always new edge cases, new languages, new grammars, new features and your users will want them all. Set hard boundaries.

------

With the amount of work required you will begin to ask yourself some basic life questions:

Does this tool bring me more money or a better job? Does it bring me prestige AND satisfy a craving for attention? Does it improve my work, as in other real work outside your beautification tool?

In my case, for a while, the tool did allow me access to better jobs with increased pay. It demonstrated I could do things many other developers could not and that I was willing to dedicate some absurd about of effort into something people actually used. But, that will only take your career so far after which you are just spinning your wheels and burning time.

When I got further in my career I realized I wasn't beautifying my code ever. I had no need for the tool I was maintaining and despite continuous maintenance by me the tool started to decay, because the requirements had grown out of control and I was no longer an end user.

slingnow5y ago

If you want something human readable why limit yourself to printing the raw JSON with different indentation rules? Just write a JSON "visualizer" that does something smart with the data.

Your final example is just approaching a JSON -> YAML converter. If your complaint about your chosen human readable serialization format is that it isn't human readable enough, then switch to something more inherently human readable instead of writing tools to temporarily transform it.

jwfearn5y ago

`oj` looks like a useful tool. I wish it was Homebrew-installable.

j / k navigate · click thread line to collapse

156 comments

101 comments · 30 top-level

Macha5y ago· 12 in thread

If you're on a mac or Linux system, you likely have a JSON formatter already installed

`python3 -m json.tool somefile.json` or `cat foo.json | python3 -m json.tool` will print it in "one line per node" format. 3.9 introduces a --sort-keys switch for sorted objects also.

foobarian5y ago

jq is a very nice tool for JSON wrangling available to install on most distros. It also provides key sorting, which is great for diff-ing JSON.

peterohlerOP5y ago

4 more replies

taeric5y ago

I would be a little nervous sorting the keys. I thought it was not too uncommon for parsers to treat them as an alist where order matters. I guess so long as it is a stable sort, no big deal?

Macha5y ago

Pretty printing JSON is mostly for developer consumption, I'm not sure how the pretty printed JSON would end up being fed automatically to another system?

(I have actually encountered a order-dependent JSON-subset parser before, but to my mind, that code is broken)

2 more replies

igetspam5y ago

If you have a parser that's looking at keys in a hash as sorted, you should change your parser. Lists sure but not keys.

1 more reply

peterohlerOP5y ago

2 more replies

lucideer5y ago

If you have an application that's dependent on JSON key order, then you're not passing it prettified JSON.

The main apps I've seen that depend on JSON structure are for hashing, which would also be broken by whitespace / linebreak variances in pretty-printers.

billpg5y ago

I wold rather pritty printing retain original order. Some JSON schemas I've seen put a version or class type at the top of every object and I wouldn't want that to be ordered elsewhere.

augusto-moura5y ago

Something like:

    open file.json | select colors | each { ^echo $it.hex }

is much nicer than jq

[1]: https://www.nushell.sh/

igetspam5y ago

I did not know about sort keys. I'm adding that to my alias. Thank you.

peterohlerOP5y ago

Oj supports a config file as well. .oj-config.sen. -help-config will describe it in more detail.

jandrese5y ago

I use json_pp a lot. It's usually installed in the base OS.

json_pp < somefile.json

thechao5y ago· 12 in thread

Please:

    {
      "colors": [
        { "color": "black",   "hex": "#000", "rgb": [ 0,   0,   0   ] },
        { "color": "red",     "hex": "#f00", "rgb": [ 255, 0,   0   ] },
        { "color": "yellow",  "hex": "#ff0", "rgb": [ 255, 255, 0   ] },
        { "color": "green",   "hex": "#0f0", "rgb": [ 0,   255, 0   ] },
        { "color": "cyan",    "hex": "#0ff", "rgb": [ 0,   255, 255 ] },
        { "color": "blue",    "hex": "#00f", "rgb": [ 0,   0,   255 ] },
        { "color": "magenta", "hex": "#f0f", "rgb": [ 255, 0,   255 ] },
        { "color": "white",   "hex": "#fff", "rgb": [ 255, 255, 255 ] }
      ]
    }

augusto-moura5y ago

The position of commas in the rgb array are triggering me

Numbers to the right make it much more pleasant to my eyes

    {
      "colors": [
        { "color": "black",   "hex": "#000", "rgb": [   0,   0,   0 ] },
        { "color": "red",     "hex": "#f00", "rgb": [ 255,   0,   0 ] },
        { "color": "yellow",  "hex": "#ff0", "rgb": [ 255, 255,   0 ] },
        { "color": "green",   "hex": "#0f0", "rgb": [   0, 255,   0 ] },
        { "color": "cyan",    "hex": "#0ff", "rgb": [   0, 255, 255 ] },
        { "color": "blue",    "hex": "#00f", "rgb": [   0,   0, 255 ] },
        { "color": "magenta", "hex": "#f0f", "rgb": [ 255,   0, 255 ] },
        { "color": "white",   "hex": "#fff", "rgb": [ 255, 255, 255 ] }
      ]
    }

Zecc5y ago

> The position of commas in the rgb array are triggering me

You mean:

    {
      "colors": [
        { "color": "black"  , "hex": "#000", "rgb": [   0,   0,   0 ] },
        { "color": "red"    , "hex": "#f00", "rgb": [ 255,   0,   0 ] },
        { "color": "yellow" , "hex": "#ff0", "rgb": [ 255, 255,   0 ] },
        { "color": "green"  , "hex": "#0f0", "rgb": [   0, 255,   0 ] },
        { "color": "cyan"   , "hex": "#0ff", "rgb": [   0, 255, 255 ] },
        { "color": "blue"   , "hex": "#00f", "rgb": [   0,   0, 255 ] },
        { "color": "magenta", "hex": "#f0f", "rgb": [ 255,   0, 255 ] },
        { "color": "white"  , "hex": "#fff", "rgb": [ 255, 255, 255 ] }
      ]
    }

2 more replies

falcolas5y ago

Opinionated Opinion:

Why even use JSON? Blech.

selfmodruntime5y ago

Just because the data is structured this way in the example, doesn't mean it's not possible. What's hindering you from defining a class property for each color?

"colors": { "red":{"rgb":"fff"}", ... }

2 more replies

zeroimpl5y ago

Perhaps the purpose is to list colors in a specific order, such as to show to the user in a desired order?

And before you argue that dictionaries can still be iterated in order, you better check the sibling threads where people are arguing you shouldn’t rely on that.

leoedin5y ago

1 more reply

peterohlerOP5y ago

An issue has been added to the repo. It is on the list now. Great idea, thanks. https://github.com/ohler55/ojg/issues/35

Latty5y ago

Personally, aligning columns like that drives me nuts. It's awkward to maintain and makes things harder to actually read most of the time.

jng5y ago

d0mine5y ago

visidata shows it as:

  color   | hex  | rgb ║
  red     | #f00 | [3] ║
  black   | #000 | [3] ║
  yellow  | #ff0 | [3] ║
  green   | #0f0 | [3] ║
  cyan    | #0ff | [3] ║
  blue    | #00f | [3] ║
  magenta | #f0f | [3] ║
  white   | #fff | [3] ║

I would be nice to inline the rgb column here.

ivanche5y ago

Ouch! This is borderline unreadable to me. Even more so when there's a value of length, say, 50 for the one of color keys.

stouset5y ago

What on earth is difficult to read about aligned data?

More importantly, what’s more readable to you?

1 more reply

Garlef5y ago· 11 in thread

> JSON can be made prettier by sorting the JSON object members by element keys.

This seems to be a bad idea. The JSON language spec has ORDERED object members. But the order is arbitrary (precisely the one given in the JSON string) and does not have to be the lexicographic.

Sorting the object members by default would introduce problems whenever the order matters to the consumer of the JSON.

dragonwriter5y ago

> The JSON language spec has ORDERED object members.

False. “An object is an unordered collection of zero or more name/value pairs, where a name is a string and a value is a string, number, boolean, null, object, or array.” [emphasis added][0]

[0] https://tools.ietf.org/html/rfc8259

ucarion5y ago

Not exactly, unfortunately. The text you cite is in the introduction, which is non-normative. That text talks about the conceptual data model, but that's just to frame the reader's thinking.

The normative text has the "real" answer, and the real answer is that it's basically undefined behavior. It starts by saying "The names within an object SHOULD be unique", and then elaborates:

  An object whose names are all unique is interoperable in the sense that all
  software implementations receiving that object will agree on the name-value
  mappings.  When the names within an object are not unique, the behavior of
  software that receives such an object is unpredictable.  Many implementations
  report the last name/value pair only.  Other implementations report an error
  or fail to parse the object, and some implementations report all of the
  name/value pairs, including duplicates.
  
  JSON parsing libraries have been observed to differ as to whether or not they
  make the ordering of object members visible to calling software.
  Implementations whose behavior does not depend on member ordering will be
  interoperable in the sense that they will not be affected by these
  differences.

https://tools.ietf.org/html/rfc8259#section-4

3 more replies

msluyter5y ago

In theory, order shouldn't matter, right? But I recall seeing this trick for adding comments to json:

  { 
    "foo": "this is a comment about foo",
    "foo": "actual value of foo that overwrites the comment"
  }

The trick is that the second value value of foo overwrites the first. But, clearly, sorting would would wreak havoc here (if the value was used in the sort key). ;)

Buttons8405y ago

Yes. That is valid JSON.

I learned this when the Python client was showing one value for a key, and the Ruby client was showing another value for the same key, and neither client was showing the whole document.

1 more reply

Latty5y ago

The spec[1] says:

> An object is an unordered collection of zero or more name/value pairs, where a name is a string and a value is a string, number, boolean, null, object, or array.

"whenever the order matters to the consumer of the JSON" should be never.

More pragmatically, regardless of what the spec says, a ton of JSON tooling assumes the order doesn't matter and relying on it would be a big mistake.

[1]: https://tools.ietf.org/html/rfc7159#section-1

gmfawcett5y ago

The other spec (ECMA-404) disagrees with RFC 7159 on this point.

That's the great thing about specs -- if you don't like what one says, there's always another to support your position. :)

2 more replies

Garlef5y ago

> More pragmatically, regardless of what the spec says, a ton of JSON tooling assumes the order doesn't matter and relying on it would be a big mistake.

Agreed. But this does not mean that a tool should break it.

My assumption would be that

   fn(parse(pretty_print(someJSONString)))

should always evaluate to the same as

   fn(someJSONString)

(for all functions fn)

1 more reply

Garlef5y ago

I checked the spec again:

It's unclear: At one point it says "An object is an unordered set of name/value pairs." while in the actual grammar it is ordered:

    object
        '{' ws '}'
        '{' members '}'
    
    members
        member
        member ',' members

gmfawcett5y ago

https://www.ecma-international.org/wp-content/uploads/ECMA-4...

dragonwriter5y ago

Grammar is inherently ordered, semantics is different from syntax.

peterohlerOP5y ago

breck5y ago· 6 in thread

Relevant plug: if Pretty Notations interest you, then you should keep an eye on Tree Notation https://treenotation.org/.

theamk5y ago

The Tree Notation is like the opposite of pretty notation though, no?

breck5y ago

The OP is sort of all over the place (their favorite "Pretty JSON" is not actually JSON at all, but SEN, which is definitely not JSON, which has a very discrete specification).

The colors et al are called "secondary notations" and again Tree Notation can't be beat. Adding secondary notations is simple. Here's an example: https://www.youtube.com/watch?v=vn2aJA5ANUc

1 more reply

stevenpetryk5y ago

This site uses only images to show the code but doesn't provide any text alternative for the image. Every image just has a `title` attribute of "Code you could hold in your hand"

breck5y ago

Lots of code examples here: https://jtree.treenotation.org/designer/

And the source for that homepage is here: https://github.com/treenotation/treenotation.org

Always open to PR!

adwn5y ago

a) That's not relevant, b) it's not nearly as new, interesting, or revolutionary as you think it is, and c) please stop spamming links to your website in every second thread here.

breck5y ago

https://giphy.com/gifs/smile-clap-laff-3oEjHI8WJv4x6UPDB6

taeric5y ago· 4 in thread

I would prefer one that aligned the like named keys, if it fits in screen. Makes it dead easy to scan the values.

thechao5y ago

As a joke I developed a format called "KVIN" which is like GRON but "context-sensitive":

    foo.bar.baz = 10
           .biz = 12 // foo.bar.biz
      ..boz.baz = 31 // foo.boz.baz

etc. It basically combines really brittle context-sensitive grammar production with complete lack of greppability.

peterohlerOP5y ago

Can you explain what you mean by "like named"? The sorting helps a lot but I'm always interested in additional features.

dan-robertson5y ago

I assume they mean vertical alignment like:

  [ { foo: a   bar: 123.45 }
    { foo: abc bar:   6.7  } ]

1 more reply

taeric5y ago

The current top comment is what I meant. On my phone, so couldn't put an example easily.

vicpara5y ago· 4 in thread

JSON is mostly for machines not people. When needed, developers format their json in their code editor of choice or bash.

peterohlerOP5y ago

gpvos5y ago

True, and this is a nice tool to do so.

slingnow5y ago

Pxtl5y ago

Which demonstrates that JSON is pointless.

It's too ugly for humans (too many quotes, too many escape characters, and no comments) and too texty for machines.

specialist5y ago· 3 in thread

Nicely done. First I've seen "SEN".

Treating the colons as white space, as you've done with the commas, will move you one step closer to The Correct Answer™.

peterohlerOP5y ago

SEN is new. After dealing with broken JSON due to commas missing or one at the end of an array and some of the team using Javascript this was a way of sucking in the broken JSON and fixing it.

specialist5y ago

> After dealing with broken JSON...

Postel tried to warn us.

> ...nice having the extra reminder that the left side of the colon is a key and the right a value.

Totally. IMHO: whitespace, formatting, delimiters are for humans. The parsers can do without. With some exceptions, like your examples of quoting strings to remove ambiguity.

kccqzy5y ago

The colons really aren't necessary in most cases. Clojure does away with the colons for instance. And I'm pretty sure GP is referring to some kind of Lisp.

cratermoon5y ago· 3 in thread

I recall seeing something that claimed all JSON is syntactically valid JavaScript. If that's correct, shouldn't it be possible to use JS code formatting engines to intelligently format JSON?

sefrost5y ago

Yes, Prettier can format JSON.

https://prettier.io/docs/en

patdx5y ago

Basically yes, though `{` and `}` also start and end expression blocks in JavaScript. So the JS formatter needs to be aware that it is working with a JSON object and not a complete expression.

Valid JSON:

  {
    "key1": "hello",
    "key2": "world"
  }

You could "trick" a JS formatter to format it by wrapping with a fake function, etc. Some minimum valid JS:

  json({
    key1: "hello",
    key2: "world",
  });

binarymax5y ago

Easy node one-liner:

JSON.stringify(JSON.parse(require('fs').readfileSync('myfile.json')),null,2);

AzzieElbab5y ago· 3 in thread

I expected a Mona Lisa as json by the end of this article

flaie5y ago

Here you go: https://pastiebin.com/603528abd6813

AzzieElbab5y ago

Just look at that #smile

peterohlerOP5y ago

Now that would be cool! :-)

benatkin5y ago· 2 in thread

In addition to outputting HTML, it could output JSON that could be rendered to HTML, the terminal, or JSX:

https://github.com/wooorm/lowlight#projects

chrisweekly5y ago

oo cool, wooorm looks useful, thanks for the link

benatkin5y ago

You're not wrong, but wooorm is a person.

Remark and Unified are some well-known projects that wooorm maintains.

https://github.com/remarkjs/remark https://unifiedjs.com/

1 more reply

ehnto5y ago· 2 in thread

> Those two parameters are specified as a float where the whole number part is the edge and the fractional part or the number of 10ths is the maximum depth on a single line.

Is that a convention I'm not aware of? Seems a little obtuse and unnecessary, why not just accept two arguments? One less arbitrary usage detail to remember.

peterohlerOP5y ago

ehnto5y ago

Thanks for the reply, the combined parameter was borne out of your real world usage so I'm glad it sounds like you'll keep it in. Hope I didn't come off cynical, this is a very cool feature for OjG.

1 more reply

Pxtl5y ago· 1 in thread

I have trouble getting excited about tools to prettify JSON as long as the guy controlling the standard has a stubborn attitude about allowing comments or decent storage for long/multiline strings.

At this point I honestly take XML over JSON where I have a choice because of CDATA and comments.

ohitsdom5y ago

I was interested in this link just for its take on comments, bummed to see that skipped over.

_flux5y ago· 1 in thread

Also the revolution of two-letter command names :/.

I mean, at least before one has proven a tool's ubiquitous use, use a longer name.

jq just got lucky but I don't think it was because of its name ;).

peterohlerOP5y ago

Oj is actually pretty well known as a Ruby JSON parser. The OjG project is in the same family.

enriquto5y ago· 1 in thread

This beautiful post is missing a last section called "Gron: do away with json altogether and print something actually readable". That would be a good punchline!

peterohlerOP5y ago

warmfuzzykitten5y ago· 1 in thread

It seems a mistake to format JSON as non-JSON text (SEN Format) in the name of "pretty". That will inevitably lead to copy/paste and monkey-see errors.

theamk5y ago

I think that even you discard the last step ("sen") and stick to plain "human style with colors", this is already much prettier than many languages support.

I like the idea that the incompatible format is off by default.

Phrodo_005y ago· 1 in thread

> the conversion from SEN to JSON and the reverse is lossless

How does SEN deal with numbers-encoded as string? is it something like .4 ? that's a bit confusing

peterohlerOP5y ago

olafure5y ago· 1 in thread

Funny coincidence for cli tool name and the Icelandic meaning: https://en.wiktionary.org/wiki/oj#Interjection

peterohlerOP5y ago

Quite a variety of meanings. Some pretty funny.

asaph5y ago· 1 in thread

An incremental formatting tweak is not a revolution.

peterohlerOP5y ago

The title was meant to be fun. JSON format is a pretty light topic.

croes5y ago· 1 in thread

What if the JSON has multiple nested objects?

peterohlerOP5y ago

Works fine. Give it a try.

derefr5y ago· 1 in thread

IMHO, this is what YAML is actually for.

YAML is “a superset of JSON”, yes, but there are two separate meanings to that:

• YAML has its own semantics (like node type annotations, or references) that JSON doesn’t have, such that documents that use these are no longer transposable into JSON.

I love bullet point #1. I hate bullet point #2.

Personally, I wish there was a name for the reduced subset of YAML that is still a “syntactic superset of JSON”, but which has none of the extended semantics of bullet-point #2.

elliottinvent5y ago

In the python world this is StrictYAML: https://hitchdev.com/strictyaml/

gpvos5y ago

squaresmile5y ago

[1] https://github.com/beautify-web/js-beautify

soheilpro5y ago

Another way to display JSON files in a more readable format is catj (https://github.com/soheilpro/catj)

dan-robertson5y ago

EdwardDiego5y ago

The biggest advantage of the one line format is the ndjson/jsonl file where one line = one record.

noxer5y ago

Slightly off topic but I sometimes use https://json.pizza (a site I know from HN) to format JSON. It however does not have different ways to format just the standard indentation.

AtlasBarfed5y ago

I don't think YAML is perfect, but it is better than every one of these pretty formats.

Pretty JSON is inevitably for either logging or config files, and YAML is better at both of those.

austincheney5y ago

I maintained a code beautification tool for about a decade. Here is what I learned from code beautification.

5. Maintenance. There are always new edge cases, new languages, new grammars, new features and your users will want them all. Set hard boundaries.

------

With the amount of work required you will begin to ask yourself some basic life questions:

Does this tool bring me more money or a better job? Does it bring me prestige AND satisfy a craving for attention? Does it improve my work, as in other real work outside your beautification tool?

slingnow5y ago

If you want something human readable why limit yourself to printing the raw JSON with different indentation rules? Just write a JSON "visualizer" that does something smart with the data.

jwfearn5y ago

`oj` looks like a useful tool. I wish it was Homebrew-installable.

j / k navigate · click thread line to collapse