Jaq – A jq clone focused on correctness, speed, and simplicity (opens in new tab)

(github.com)

440 pointstmcneal2y ago229 comments

229 comments

133 comments · 30 top-level

gigatexal2y ago· 22 in thread

It's so awesome when projects shout out other projects that they're similar to or inspired by or not replacements for. I learned about https://github.com/yamafaktory/jql from the readme of this project and it's what I've been looking for for a long time, thank you!

That's not to take away from JAQ by any means I just find the JQ style syntax uber hard to grokk so jql makes more sense for me.

Valodim2y ago

Very nice in this regard is gron, too. It simply flattens any json into lines of key value format, making it compatible with grep and other simple stream operations.

https://github.com/tomnomnom/gron

lkuty2y ago

And also https://github.com/adamritter/fastgron that I've just discovered.

mstade2y ago

This is brilliant, thank you for sharing!

jjeaff2y ago

Nice find. I think I'll try it out. Although I was hoping for a real SQL type experience. I don't understand why no one just copies SQL so I can write a query like "SELECT * FROM $json WHERE x>1".

Everyone seems to want to invent their own new esoteric symbolic query language as if everything they do is a game of code golf. I really wish everyone would move away from this old Unix mentality of extremely concise, yet not-self-evident syntax and do more like the power shell way.

pgeorgi2y ago

> Although I was hoping for a real SQL type experience. I don't understand why no one just copies SQL so I can write a query like "SELECT * FROM $json WHERE x>1".

With somewhat tabular data, you can use sqlite to read the data into tables and then work from there.

Example 10 from https://opensource.adobe.com/Spry/samples/data_region/JSONDa... (slightly fixed by removing the ellipsis) results in this interaction:

    sqlite> select json_extract(value, '$.id'), json_extract(value, '$.type') from json_each(readfile('test.json'), '$.items.item[0].batters.batter');
    1001|Regular
    1002|Chocolate
    1003|Blueberry
    1004|Devil's Food

    sqlite> select json_extract(value, '$.id'), json_extract(value, '$.type') from json_each(readfile('test.json'), '$.items.item[0].topping');
    5001|None
    5002|Glazed
    5005|Sugar
    5007|Powdered Sugar
    5006|Chocolate with Sprinkles
    5003|Chocolate
    5004|Maple

Instead of "select" this could also flow into freshly created tables using "insert into" for more complex scenarios.

soulbadguy2y ago

While i agree about the general sentiment on preferring well defined and explicit standard as opposed to "cute" custom made languages. In this case i am not convince that SQL would be the best candidate for querying nested structures like JSON.Something like xpath maybe.

3 more replies

pdntspa2y ago

Be the change you want to see.

I personally don't understand why people aren't willing to learn instead. It's not hard to sit down and pick up a new skill and it's good to step out of one's comfort zone. I personally hate Powershell syntax, brevity is the soul of wit and PS could learn a thing or two from bash and "the linux way".

We seem obsessed with molding the machine to our individual preferences. Perhaps we should obsess over the opposite: molding our mind to think more like the machine. This keeps a lot of things simple, uncomplicated, and flexible.

Does a painter wish for paints that were more like how he wanted them to be? Sure, but at the end of the day he buys the same paint everyone else does and learns to work with his medium.

9 more replies

bobobar3392y ago

DuckDB does just this, https://duckdb.org/docs/archive/0.9.2/guides/import/json_imp...

justinsaccount2y ago

The datafusion cli https://arrow.apache.org/datafusion/user-guide/cli.html can run SQL queries against existing json files.

filmor2y ago

SQL is built for relational/tabular data, JSON is not relational and usually not tabular.

1 more reply

screature22y ago

I think the closest I've seen to a SQL experience for JSON is how steampipe stores json columns as jsonb datatypes and allows you to query those columns w/postgres JSON functions etc.

- https://steampipe.io/docs/sql/querying-json#querying-json #example w/the AWS steampipe plugin (I think this is a wrapper around the AWS go SDK)

- https://hub.steampipe.io/plugins/turbot/config #I think this lets you query random json files.

(edited to try to fix the bulleting)

throwaway20372y ago

    do more like the power shell way

I just checked the GitHub page [1] for Microsoft PowerShell. It looks written in C# and available on Win32/MacOS/Linux, where DotNet is now supported. Do you use PowerShell only on Win32 or other platforms also?

    Everyone seems to want to invent their own new esoteric symbolic query language

Can you give an example of something that PS can do that is built-in for text processing, instead of a proprietary symbolic query language?

[1] https://github.com/PowerShell/PowerShell

1 more reply

chthonicdaemon2y ago

Have you looked at [duckdb's JSON support](https://duckdb.org/docs/extensions/json.html)? It's pretty transparent and you can do exactly what you say: `select * from 'file.json' where x > 1` will work with "simple" json files like {"x": 1, "y": 2} and [{"x": 1, "y":2}, {"x":2, "y":3}]

cryptonector2y ago

> I don't understand why no one just copies SQL so I can write a query like "SELECT * FROM $json WHERE x>1".

You could ask the same with respect to XML too -- why XPath/XSLT instead of SQL?

The problem is that SQL isn't that convenient when you're querying data in a free-form and recursive schema. Especially the latter, because recursive queries in SQL are just not pithy. I say this as someone who loves SQL.

nbk_20002y ago

OctoSQL[1] does a pretty good job of allowing you to query JSON (and CSV) with SQL.

[1] https://github.com/cube2222/octosql

psd12y ago

nushell and pwsh. I'm not familiar with nushell, but pwsh offers where, select, foreach, group, sort.

N.B. those aliases are not created by default on *nix

It's pipeline-based and procedural, but you can be very declarative in data processing

PhilippGille2y ago

I can also recommend checking https://github.com/tidwall/jj

stevage2y ago

That looks excellent, thank you!

OJFord2y ago

I do sympathise with that a bit, but for me at least it does not look like jql is the solution:

    '|={"b""d"=2, "c"}'

this appears to be something like jq's:

    'select(."b"."d" == 2 or ."c" != null)'

which.. is obviously longer, but I think I prefer it, it's clearer?

(actually it would be `.[] | select(...)`, but I'm not sure something like that isn't true of jql too without trying it, I don't know if the example's intended to be complete - and I don't think it affects my verdict)

klausnrooster2y ago

jql homoiconicity looks rather ... Lispy. Like you could use it on itself, write "Macros", etc.

antonvs2y ago

> I just find the JQ style syntax uber hard to grokk

You're not alone. ChatGPT (3.5) is terrible at it also, for anything non-trivial.

I'm not sure if that's because of the nature of the jq syntax, but I do wonder.

never_inline2y ago

Well ChatGPT doesn't 'grok' anything, really..

loudmax2y ago· 14 in thread

I applaud this project's focus on correctness and efficiency, but I'd also really like a version of `jq` that's easy to understand without having to learn a whole new syntax.

`jq` is a really powerful tool and `jaq` promises to be even more powerful. But, as a system administrator, most lot of the time that I'm dealing with json files, something that behaved more like grep would be sufficient.

ishandotpage2y ago

Have you tried `gron`?

It converts your nested json into a line by line format which plays better with tools like `grep`

From the project's README:

▶ gron "https://api.github.com/repos/tomnomnom/gron/commits?per_page..." | fgrep "commit.author"

json[0].commit.author = {};

json[0].commit.author.date = "2016-07-02T10:51:21Z";

json[0].commit.author.email = "mail@tomnomnom.com";

json[0].commit.author.name = "Tom Hudson";

https://github.com/tomnomnom/gron

It was suggested to me in HN comments on an article I wrote about `jq`, and I have found myself using it a lot in my day to day workflow

stronglikedan2y ago

This is awesome, thanks! Not OP, but this will help me to write specifications for modifying existing JSON structures immensely. It's kind of a pain parsing JSON by (old man) eye to figure out which properties are arrays, and follow property names down a chain. This will definitely help eliminate mistakes!

1 more reply

hu32y ago

Thank you so much. This seems like a saner approach for some simpler use cases.

It flattens the structure. And makes for easy diffing.

1 more reply

sn0wf1re2y ago

You can also mimic gron, including support for yaml with

yq -o=props my-file.yaml

1 more reply

jbverschoor2y ago

This looks some much better as an ad-hoc tool. Would be cool if it supported more formats - plist, yaml, xml (hoow to do body, or conflicting attr/elements)

jrockway2y ago

One of my coworkers really likes Miller: https://github.com/johnkerl/miller

The idea is that you get awk/grep like commands for operating on structured data.

zellyn2y ago

ChatGPT excels at producing `jq` incantations; I can actually use `jq` now…

frou_dh2y ago

> I'd also really like a version of `jq` that's easy to understand without having to learn a whole new syntax.

Since JSON is JavaScript Object Notation, then an obvious non-special-snowflake language for such expressions on the CLI is JavaScript: https://fx.wtf/getting-started#json-processing

gchamonlive2y ago

It is a little early to say, but I have been learning how nushell deals with structured data and it seems like it is very usable for simple cases to produce readable one-liners, and if you need to bring out the big guns the shell is also a full fledged scripting language. Don't know about how efficient it is though.

It needs to justify moving to a completely different shell, but the way you deal with data in general does not restrict itself to manipulating json, but also the output of many commands, so you kinda have one unified piping interface for all these structured data manipulations, which I think is neat.

bobbylarrybobby2y ago

From the data side, nushell uses polars for querying tabular data so it should be pretty fast. Not sure about its scripting language.

msluyter2y ago

Obligatory reference to "gron" ("make JSON greppable"), which I find to be quite useful for many common tasks:

https://github.com/tomnomnom/gron

INTPenis2y ago

jq, and yq, are tools you spend an hour figuring out and then leave them in a CI pipeline for 3 years.

hyperthesis2y ago

Maybe like SQL for relational algebra? Codd made two query languages that were "too difficult for mortals to use". (B-trees for performance was a separate issue)

But jq's strength is its syntax - the difficulty is the semantics.

notatoad2y ago

there's got to be some syntax though. jq does a unique function that isn't defined in any other syntax. i'm with you, the jq syntax is weird and sometimes difficult to understand. but the replacement would just be some different syntax.

these little one-off unique syntaxes that i'm never going to properly learn are one of my favourite uses of chatGPT.

lopatin2y ago· 8 in thread

Regarding correctness, will it display uint64 numbers without truncating them? That's my biggest pet peeve with jq currently.

necubi2y ago

Unfortunately JSON numbers are 64 bit floats, so if you're standards compliant you have to treat them as such, which gives you 53 bits of precision for integers.

Also hey, been a while ;)

Edit: I stand corrected, the latest spec (rfc8259) only formally specifies the textual format, but not the semantics of numbers.

However, it does have this to say:

> This specification allows implementations to set limits on the range/and precision of numbers accepted. Since software that implements IEEE 754 binary64 (double precision) numbers [IEEE754] is generally available and widely used, good interoperability can be achieved by implementations that expect no more precision or range than these provide, in the sense that implementations will approximate JSON numbers within the expected precision.

In practice, most implementations treat JSON as a subset of Javascript, which implies that numbers are 64-bit floats.

matt_kantor2y ago

I'm being pedantic here, but JSON numbers are sequences of digits and ./+/-/e/E. Whether to parse those sequences into 64-bit floats or something else is left up to the implementation.

However what you say is good practice anyway. The spec (RFC 8259) has this note on interoperability:

> This specification allows implementations to set limits on the range and precision of numbers accepted. Since software that implements IEEE 754 binary64 (double precision) numbers [IEEE754] is generally available and widely used, good interoperability can be achieved by implementations that expect no more precision or range than these provide, in the sense that implementations will approximate JSON numbers within the expected precision. A JSON number such as 1E400 or 3.141592653589793238462643383279 may indicate potential interoperability problems, since it suggests that the software that created it expects receiving software to have greater capabilities for numeric magnitude and precision than is widely available.

rdtsc2y ago

> Unfortunately JSON numbers are 64 bit floats, so if you're standards compliant you have to treat them as such,

Are you sure? Looking at https://www.json.org/json-en.html I don't see anything about 64 bit floats.

Groxx2y ago

JSON does not define a precision for numbers, so: it's often float64 (but note -0 is allowed, but NaN and +/-Inf are not), but it depends on your language, parser config, etc.

Many will produce higher precision but parse as float64 by default. But maximally-compatible JSON systems should always handle arbitrary precision.

lopatin2y ago

I thought the JSON spec says that numbers can have an arbitrary amount of digits.

Also, what!! Hey! Miss you man.

re2y ago

I believe this has improved in jq 1.7: https://github.com/jqlang/jq/releases/tag/jq-1.7

> Use decimal number literals to preserve precision. Comparison operations respects precision but arithmetic operations might truncate.

anonymoushn2y ago

This is still broken in jq 1.7 for sufficiently long exponents

1 more reply

wwader2y ago

jq 1.7 do preserve large integers but will truncate if any operation is done on them. Unfortunetly it currently truncates to a decimal64 which is a bit confusing, this will be fixed in next release where it follow the suggestion from the JSON spec and truncates to binary64 (double) https://github.com/jqlang/jq/pull/2949

Yanael2y ago· 7 in thread

How have you been using jq? It is more adhoc for exploring JSON files during development/data analysis or in programs that run in production?

wwader2y ago

Quite a lot! i use it to explode both JSON and tex (parse using jq functions). I also use it for exploring ane debug binary formats (https://github.com/wader/fq). Now a days i also use it for some adhoc programming and a calculator.

Yanael2y ago

Oh sounds a very neat way to explore binary!

1 more reply

brundolf2y ago

Yeah, I've always liked the idea of jq but personally I find it easier to open a REPL in the language I'm most familiar with (which happens to be JS, which does make a difference) and just paste in the JSON and work with it there

It may be more verbose, but I never have to google anything, which makes a bigger difference in my experience

wwader2y ago

https://github.com/wader/fq has a REPL and can read JSON. Tip is to use "paste | from_json | repl" in a REPl to paste JSON into a sub-REPL, you can also use `<text here>` with fq which is a raw string literal

1 more reply

Too2y ago

Yes. So much easier to reuse other common helper functions. Once you’ve finished exploration you can just copy the code into production instead of translating.

delecti2y ago

My most common usage is pretty-printing the output of curl, or getting a list of things from endpoint service/A and then calling service/endpoint B/<entry> to do things for each entry in the list.

Liskni_si2y ago

I use it as a "JSON library for bash". :-)

Not really in "production", but I have a lot of small-ish shell scripts all over the place, mostly in ~/bin, and some in CI (GitHub Actions) as well.

mgaunard2y ago· 5 in thread

While jq is a very powerful tool, I've also been using DuckDB a lot lately.

SQL is a much more natural language if the data is somewhat tabular.

suchar2y ago

Some time ago I tried Retool and it does have "Query JSON with SQL": https://docs.retool.com/queries/guides/sql/query-json (it is somewhat relevant because it was extremely convenient)

It is somewhat similar to Linq in C# although SQL there is more standardised so I like it more. Also, it would be fantastic to have in-language support for querying raw collections with SQL. Even better: to be able to transparently store collections in Sqlite.

It is always sad to see code which takes some data from db/whatever and then does simple processing using loops/stream api. SQL is much higher level and more concise language for these use cases than Java/Kotlin/Python/JavaScript

CBLT2y ago

I've found the same. I store all raw json output into a sqlite table, create virtual columns from it, then do a shell loop off of a select. Nested loops become unnested, and debugability is leagues better because I have the exact record in the db to examine and replay.

I've noticed what I'm creating are DAGs, and that I'm constantly restarting it from the last-successfully-proccessed record. Is there a `Make`-like tool to represent this? Make doesn't have sql targets, but full-featured dag processors like Airflow are way too heavyweight to glue together shell snippets.

cryptonector2y ago

Yes. SQL is much better for relational data with a strict schema. Though you'll still never get a way to express recursive queries in SQL w/o a lot of verbosity.

MrDrMcCoy2y ago

I like textql [0] better for this use case, as it's simpler in my mind.

[0] https://github.com/dinedal/textql

bdcravens2y ago

textql doesn't seem to work with JSON. I think the grandparent comment meant that the data was in a table of sorts, represented in JSON.

1 more reply

stickfigure2y ago· 5 in thread

Congratulations! We're almost back to the basic functionality we used to have with XSLT.

lkuty2y ago

You could use an elaborate filter with jq (see https://stackoverflow.com/a/73040814/452614) to transform JSON to XML and then use an XQuery implementation to process the document. It would be quite powerful, especially if the implementation supports XML Schema. I have not tested it.

Or https://github.com/AtomGraph/JSON2XML which is based on https://www.w3.org/TR/xslt-30/#json-to-xml-mapping

It even looks like we could use an XSLT 3 processor with the json-to-xml function (https://www.w3.org/TR/xslt-30/#func-json-to-xml) and then use XQuery or stay with XSLT 3.

Now I have to test it.

lkuty2y ago

In fact XQuery alone is enough, e.g. with Saxon HE 12.3.

    (: file json2xml.xq :)
    declare default element namespace "http://www.w3.org/2005/xpath-functions";
    declare option saxon:output "method=text";
    declare variable $file as xs:string external;
    json-to-xml(unparsed-text($file))/<your xpath goes here>

    java -cp ~/Java/SaxonHE12-3J/saxon-he-12.3.jar net.sf.saxon.Query -q:json2xml.xq file='/path/to/file.json'

nurettin2y ago

To be fair, xslt is a lot more verbose than `map(.*2)`

lkuty2y ago

A bit more verbose but you have the full power of XQuery with you. XSLT however is more verbose than that like you mentioned.

    for $price in json-to-xml(unparsed-text($file))/map/map/number[@key="price"]
    return $price+2

For the following JSON document:

    {
      "fruit1": {
        "name": "apple",
        "color": "green",
        "price": 1.2
      },
      "fruit2": {
        "name": "pear",
        "color": "green",
        "price": 1.6
      }
    }

The call to json-to-xml() produces this XML document:

    <?xml version="1.0" encoding="UTF-8"?>
    <map xmlns="http://www.w3.org/2005/xpath-functions">
       <map key="fruit1">
          <string key="name">apple</string>
          <string key="color">green</string>
          <number key="price">1.2</number>
       </map>
       <map key="fruit2">
          <string key="name">pear</string>
          <string key="color">green</string>
          <number key="price">1.6</number>
       </map>
    </map>

cryptonector2y ago

Yes. jq is essentially an XPath/XSLT for JSON. I'd say that jq is more powerful than XPath/XSLT, but that's neither here nor there since both can evolve to be as powerful as they need to be.

sigmonsays2y ago· 5 in thread

why not contribute to the existing jq project instead of starting a new one?

We have so many json query tools now it's insane.

lilyball2y ago

The obvious reason here is jaq makes some changes to semantics, changes which would be rejected by jq.

Another likely reason is that it seems a motivation for jaq is improving the performance of jq. Any low-hanging fruit there in the jq implementation was likely handled a long time ago, so improving this in jq is likely to be hard. Writing a brand new implementation allows for trying out different ways of implementing the same functionality, and using a different language known for its performance helps too.

Using a language like Rust also helps with the goal of ensuring correctness and safety.

cryptonector2y ago

jq hasn't had much work done to make it fast though.

There's two classes of performance problems:

- implementation issues

- language issues

The latter is mainly a problem in `foreach` and also some missing ways to help programmers release references (via `$bindings`) that they no longer need.

The former is mostly a matter of doing a variety of bytecode interpreter improvements, and maybe doing more inlining, and maybe finding creative ways to reduce the number of branches.

cryptonector2y ago

jq maintainer here. We love that there are multiple implementations of jq now. It does several things: a) it gives users more choices, b) it helps standardize the language (though we've not yet written a formal specification), c) it brings more energy to jq because the maintainers of the other tools have joined jq as maintainers. I also love that these alternative implementations relieve my growing dislike of C.

sillysaurusx2y ago

Fun, of course. Existing projects are boring almost by definition. And this is volunteer work.

anonymoushn2y ago

One reason to do this is that often performance improvements involve architectural overhauls that maintainers are unlikely to approve of.

rad_gruchalski2y ago· 4 in thread

I started using yq over jq. Any significant differences?

MrDrMcCoy2y ago

Which yq? I prefer https://github.com/mikefarah/yq to https://github.com/kislyuk/yq.

Yasuraka2y ago

I prefer the former, single static binary which works great on workstations and CI alike, the latter requires python as well as jq as it's a wrapper

1 more reply

rad_gruchalski2y ago

The former: https://gruchalski.com/posts/2023-07-10-yq-the-yaml-power-to....

a-nikolaev2y ago

jq feels like a much more robust tool than yq. I understand that the task of processing YAML is much harder than JSON, but:

- yq changed its syntax between version 3 and 4 to be more like jq (but not quite the same for some reason)

- yq has no if-then-else https://github.com/mikefarah/yq/issues/95 which is a poor design (or omission) in my opinion

So yq works when you need to process YAML, it can even handle comments quite well. Buy for pure JSON processing jq is a better tool.

coldtea2y ago· 4 in thread

>nan > nan is false, while nan < nan is true.

If this wrong behavior from jq, or some artifact consistent with how the floating point spec is defined, surprising, but faithful to IEEE 754 nonetheless?

extraduder_ire2y ago

IIRC, any comparison using a nan must fail (return false) according to the IEEE spec.

kopecs2y ago

I think it is a bit more complex, since NaN is defined to be "unordered" with respect to all other values (including other NaNs), and so any relation for which unordered values result in true (e.g., compareQuietNotEqual) will return true. (See section 5.11)

throw555chip2y ago

I used Bard after trying unsuccessfully to decipher the wikipedia page and Bard says, according to IEEE 754, nan < nan should return false (0); while nan > nan should return false (0)

ClassyJacket2y ago

I wish there was some version of Wikipedia for people who speak good English (not Simple English), but aren't assumed to already be experts on the topic. Technical articles are pretty much impenetrable.

1 more reply

pizza_pleb2y ago· 4 in thread

Somewhat off-topic, but is there a tool which integrates something like this/jq/fx and API requests? I’d like to be able to do some ETL-like operations and join JSON responses declaratively, without having to write a script.

awayto2y ago

Is there anything out there like "SELECT * FROM "http://..."?

pizza_pleb2y ago

I think a query language would be great, with a way to subquery/chain data from previous requests (e.g. by jsonpath) to subsequent ones.

The closest I’ve gotten is to wrap the APIs with GraphQL. This achieves joining, but requires strict typing and coding the schema+relationships ahead of time which restricts query flexibility for unforeseen edge cases.

Another is a workflow automation tool like n8n which isn’t as strict and is more user-friendly, but still isn’t very dynamic either.

Postman supports chaining, but in a static way with getting/setting env variables in pre/post request JS scripts.

Bash piping is another option, and seems like a more natural fit, but isn’t super reusable for data sources (e.g. with complex client/auth setup) and I’m not sure how well it would support batch requests.

It would be an interesting tool/language to build, but I figure there has to be a solution out there already.

1 more reply

RyanHamilton2y ago

I'm working on a project I call babeldb. It allows "select * from query_rest('https://api1.binance.com/api/v3/exchangeInfo#.symbols')" The #.symbols at the end is actually jq path expression, it's sometimes needed when the default json to table is suboptimal. You can see it in action by selecting babeldb in the dropdown, then clicking "Run All" here: https://pulseui.net/sqleditor?qry=select%20*%20from%20query_...

hnlmorg2y ago

My shell will do that

    open http://… | select * where …
    # FROM can be omitted because you’re loading a pipe

https://murex.rocks/optional/select.html

j1elo2y ago· 3 in thread

> [[]] | implode crashes jq, and this was not fixed at the time of writing despite being known since five years.

Well, taking into account that jq development has been halted for 5 years and only recently revived again, it's no wonder that bug reports have been sitting there for that time, both well known and new ones. I bet they'll get up to speed and slowly but surely clear the backlog that has built up all this time.

wwader2y ago

Yeap was fixed in 1.7 https://github.com/jqlang/jq/pull/2646

thekoma2y ago

Why was it halted?

slaymaker19072y ago

I think the original devs just got burnt out for a while https://github.com/jqlang/jq/issues/2305#issuecomment-157263...

Osiris2y ago· 3 in thread

I love the idea of jq but i use it infrequently enough that I have to search the manual for how to use their syntax to get what I want.

Sadly 99% of what I do with jq is “| jq .”

ruuda2y ago

I have the same problem. Then, unrelated, I started building a configuration language, and it turned out it's quite nice for querying json [1]. Here is an example use case that I couldn't solve in jq but I could in RCL: https://fosstodon.org/@ruuda/111120049523534027

[1]: https://docs.ruuda.nl/rcl/rcl_query/

dse19822y ago

I had the same problem, keeping me from really exploiting the power of jq. But for this and similar cases I am really glad about copilot being available to help. I just tell it what I need, together with a reduced sample of the source-json, and it generates a correct jq-script for me. For more complex requirements I usually iterate a bit with Copilot because it is easier and more reliable to guide it to the solution gradually than to word everything out correctly in the question in the first go. Also I myself often get new and better ideas during the iterations than I had in the beginning. Probably works the same with ChatGPT and others.

mmorearty2y ago

Me too; but recently I used ChatGPT to just quickly me the jq syntax I needed: https://chat.openai.com/share/40b68d73-d2dd-412d-867f-9f375e...

WhereIsTheTruth2y ago· 3 in thread

https://github.com/01mf02/jaq/blob/main/Cargo.lock

That's a lot of dependencies..

mozey2y ago

Yes it is, compared to gojq https://github.com/itchyny/gojq/blob/main/go.mod

sgt2y ago

How does that usually play out in the Rust ecosystem? Lots of dependencies tell me there's a huge risk of the dependencies becoming inherently incompatible with each other over time, making maintenance a major task. How will this compile in say, 2 years?

majewsky2y ago

Because of the lockfile, it will use the same library versions when compiling again in the future. The main question for "will this compile" is whether the Rust compiler is sufficiently backwards-compatible, which (at least from my experience) it certainly is.

Also re "lots of dependencies": This is kind of unavoidable in Rust because the stdlib is deliberately very lean, and focuses on basic data structures that are needed for interop (e.g. having common string types is important for different libraries to work together with each other) or not possible to implement without specific compiler support (e.g. marker traits or boxing). Contrast this with Go where the stdlib contains things like a full-fledged HTTP server and regex engine. It's easy to build things in Go with a rather short go.mod file, but only because the go.mod file does not show all the stdlib packages that you're using.

1 more reply

vjust2y ago· 3 in thread

I find jq's syntax (and docs) kind of opaque, but I guess we have no other options. And I don't think this latest incarnation breaks any new ground there. But it'd be better if I just wrote it myself - "be the change ...."

stevage2y ago

Well, as pointed out in the jaq docs there is jql.

But I just looked at jql and I liked it even less. The pedantry about requiring all keys in selectors to be double quoted is, um, painful for a CLI tool.

stevage2y ago

Someone else above pointed out JJ which looks much easier to use.

wrsh072y ago

ChatGPT or the warp chatbot is pretty good at jq syntax

visarga2y ago· 3 in thread

This language must be the spiritual successor of Perl

TurboHaskal2y ago

I inherited some piece of code that made use of an extremely long and complicated jq script.

I simply gave up understanding the whole thing, and restored the balance in the universe by rewriting it in Perl.

hnlmorg2y ago

Now you just need to rewrite Perl in Rust and compile that to WebAssembly. And the circle of HN is complete.

LargeTomato2y ago

I know perl is useful. I know it's going to help me. It seems like you can get away with a quick perl script whereas a python script would attract scrutiny.

But it's such a painful language to look at.

jhatemyjob2y ago· 2 in thread

I switched to jless and never looked back. The user interface is miles ahead of everything else

Snelius2y ago

It's not the same. The jq is not just a viewer. It's a JSON query lang processor.

jhatemyjob2y ago

You are correct, the user interface of jq is not the same as the user interface of jless.

sgt2y ago· 2 in thread

The fact that jq takes almost a second to run on a Pi is crazy[0]. And the tool is written in C.

[0] https://github.com/jqlang/jq/issues/1411

eyegor2y ago

It was fixed in 2019 though? I don't understand your point.

https://github.com/jqlang/jq/issues/1380

sgt2y ago

You are right. I stand corrected.

jasonlhy2y ago· 2 in thread

I think the best alternative for JQ is datawave, but it is not open source. https://dataweave.mulesoft.com/

anonymoushn2y ago

The latest blog post is about open sourcing it from last September. So the process of open sourcing dataweave takes at least 15 months.

jasonlhy2y ago

It have some learning curve, but it actually makes sense when you get used to it and work for other format too. It is much better than other transformation language, and you can even call Java.

I think they kind of stuck in the development, even the mule engine only have one active developer from the github commit ….

jbritton2y ago· 1 in thread

The 2nd and 3rd examples make no sense to me.

echo '{"a": 1, "b": 2}' | jaq 'add'

Construct an array from an object in two ways and show that they are equal:

$ echo '{"a": 1, "b": 2}' | jaq '[.a, .b] == [.[]]'

true

wwader2y ago

What might be confusing is that iterating an object iterates its values. add is defined something like this: def add: reduce .[] as $n (0; . + $n)

bilekas2y ago· 1 in thread

> nan > nan is false, while nan < nan is true.

You learn something new everyday. Does anyone have any idea why this might be happening? Seems like more than just a bug..

linux_whisperer2y ago

I use jq on a daily basis. This is new to me thanks for remarking it

Yanael2y ago· 1 in thread

jq have been in my toolbox since a while it’s a very great tool. But yet another query language to learn, jaq seems identical on that. I think that’s where LLMs can help a lot to make it easier for adoption, I started a project on that note to manipulate the data just with natural language, https://partial.sh

‘cat’ your json file and describe what you want I think should be the way to go

LargeTomato2y ago

I usually avoid those types of tools. It looks way too fragile and the examples look a bit magical. Do you think it's stable and easy to use?

1 more reply

dilsmatchanov2y ago· 1 in thread

Haven't checked yet, but I am sure it's written in Rust

anitil2y ago

How could you tell?

jeffbee2y ago

I guess it's cute that there's some terminal line art library in Rust somewhere, but when I tried to invoke jaq it just pooped megabytes of escape codes into my iTerm and eventually iTerm tried to print to the printer. Too clever.

I tried to do `echo *json | rush -- jaq -rf ./this-program.jq {} | datamash ...` and in that context I don't think it's appropriate to try to get artistic with the tty.

The cause of the errors, for whatever it's worth, is that `jaq` lacks `strftime`.

1 more reply

olemunch2y ago

My first impression is it has fancy error messages but no halt_error/0

  $ ./jaq-v1.2.0-x86_64-unknown-linux-gnu -sf aoc22-13.jq input.txt
  Error: undefined filter
      ╭─[<unknown>:30:18]
      │
   30 │ ╭─▶          "bad input" | halt_error
   31 │ ├─▶        end;
      │ │
      │ ╰───────────────── undefined filter
  ────╯

and (after commenting out halt_error) slower than both jq and gojq

  $ time jq -sf aoc22-13.jq input.txt
  6415
  20056
  
  real    0m0.023s
  user    0m0.010s
  sys     0m0.010s
  $
  $ time gojq -sf aoc22-13.jq input.txt
  6415
  20056
  
  real    0m0.070s
  user    0m0.030s
  sys     0m0.000s
  $
  $ time ./jaq-v1.2.0-x86_64-unknown-linux-gnu -sf aoc22-13.jq input.txt
  6415
  20056
  
  real    0m0.103s
  user    0m0.065s
  sys     0m0.000s

aoc22-13.jq is here https://pastebin.com/raw/YiUjEu2n and input.txt is here https://pastebin.com/raw/X0FSyTNf

1vuio0pswjnm72y ago

All else being equal, does the speed of jaq change with the size of the input.

sesm2y ago

Is there a JS library that is similar to JQ but works on JS objects in memory?

232kkk33kk2y ago

and in powershell you don't need to learn all those syntaxes for different tools for different formats like jq, xmlstarlet, etc. Just convert everything to an object and query the data by using powershell syntax

icco2y ago

I use `yq` for this stuff and it handles most of this pretty well.

phplovesong2y ago

Before a clicked on the link i had this gut feeling. It turned out my gut was right. It was written in rust. Go figure..

fyzix2y ago

I think my benchmark[1] would be a great test for this. The jq[2] version takes 50s on my machine.

[1] : https://github.com/jinyus/related_post_gen

[2]: https://github.com/jinyus/related_post_gen/blob/main/jq/rela...

j / k navigate · click thread line to collapse

229 comments

133 comments · 30 top-level

gigatexal2y ago· 22 in thread

That's not to take away from JAQ by any means I just find the JQ style syntax uber hard to grokk so jql makes more sense for me.

Valodim2y ago

Very nice in this regard is gron, too. It simply flattens any json into lines of key value format, making it compatible with grep and other simple stream operations.

https://github.com/tomnomnom/gron

lkuty2y ago

And also https://github.com/adamritter/fastgron that I've just discovered.

mstade2y ago

This is brilliant, thank you for sharing!

jjeaff2y ago

Nice find. I think I'll try it out. Although I was hoping for a real SQL type experience. I don't understand why no one just copies SQL so I can write a query like "SELECT * FROM $json WHERE x>1".

pgeorgi2y ago

> Although I was hoping for a real SQL type experience. I don't understand why no one just copies SQL so I can write a query like "SELECT * FROM $json WHERE x>1".

With somewhat tabular data, you can use sqlite to read the data into tables and then work from there.

Example 10 from https://opensource.adobe.com/Spry/samples/data_region/JSONDa... (slightly fixed by removing the ellipsis) results in this interaction:

    sqlite> select json_extract(value, '$.id'), json_extract(value, '$.type') from json_each(readfile('test.json'), '$.items.item[0].batters.batter');
    1001|Regular
    1002|Chocolate
    1003|Blueberry
    1004|Devil's Food

    sqlite> select json_extract(value, '$.id'), json_extract(value, '$.type') from json_each(readfile('test.json'), '$.items.item[0].topping');
    5001|None
    5002|Glazed
    5005|Sugar
    5007|Powdered Sugar
    5006|Chocolate with Sprinkles
    5003|Chocolate
    5004|Maple

Instead of "select" this could also flow into freshly created tables using "insert into" for more complex scenarios.

soulbadguy2y ago

3 more replies

pdntspa2y ago

Be the change you want to see.

Does a painter wish for paints that were more like how he wanted them to be? Sure, but at the end of the day he buys the same paint everyone else does and learns to work with his medium.

9 more replies

bobobar3392y ago

DuckDB does just this, https://duckdb.org/docs/archive/0.9.2/guides/import/json_imp...

justinsaccount2y ago

The datafusion cli https://arrow.apache.org/datafusion/user-guide/cli.html can run SQL queries against existing json files.

filmor2y ago

SQL is built for relational/tabular data, JSON is not relational and usually not tabular.

1 more reply

screature22y ago

I think the closest I've seen to a SQL experience for JSON is how steampipe stores json columns as jsonb datatypes and allows you to query those columns w/postgres JSON functions etc.

- https://steampipe.io/docs/sql/querying-json#querying-json #example w/the AWS steampipe plugin (I think this is a wrapper around the AWS go SDK)

- https://hub.steampipe.io/plugins/turbot/config #I think this lets you query random json files.

(edited to try to fix the bulleting)

throwaway20372y ago

    do more like the power shell way

    Everyone seems to want to invent their own new esoteric symbolic query language

Can you give an example of something that PS can do that is built-in for text processing, instead of a proprietary symbolic query language?

[1] https://github.com/PowerShell/PowerShell

1 more reply

chthonicdaemon2y ago

cryptonector2y ago

> I don't understand why no one just copies SQL so I can write a query like "SELECT * FROM $json WHERE x>1".

You could ask the same with respect to XML too -- why XPath/XSLT instead of SQL?

nbk_20002y ago

OctoSQL[1] does a pretty good job of allowing you to query JSON (and CSV) with SQL.

[1] https://github.com/cube2222/octosql

psd12y ago

nushell and pwsh. I'm not familiar with nushell, but pwsh offers where, select, foreach, group, sort.

N.B. those aliases are not created by default on *nix

It's pipeline-based and procedural, but you can be very declarative in data processing

PhilippGille2y ago

I can also recommend checking https://github.com/tidwall/jj

stevage2y ago

That looks excellent, thank you!

OJFord2y ago

I do sympathise with that a bit, but for me at least it does not look like jql is the solution:

    '|={"b""d"=2, "c"}'

this appears to be something like jq's:

    'select(."b"."d" == 2 or ."c" != null)'

which.. is obviously longer, but I think I prefer it, it's clearer?

klausnrooster2y ago

jql homoiconicity looks rather ... Lispy. Like you could use it on itself, write "Macros", etc.

antonvs2y ago

> I just find the JQ style syntax uber hard to grokk

You're not alone. ChatGPT (3.5) is terrible at it also, for anything non-trivial.

I'm not sure if that's because of the nature of the jq syntax, but I do wonder.

never_inline2y ago

Well ChatGPT doesn't 'grok' anything, really..

loudmax2y ago· 14 in thread

I applaud this project's focus on correctness and efficiency, but I'd also really like a version of `jq` that's easy to understand without having to learn a whole new syntax.

ishandotpage2y ago

Have you tried `gron`?

It converts your nested json into a line by line format which plays better with tools like `grep`

From the project's README:

▶ gron "https://api.github.com/repos/tomnomnom/gron/commits?per_page..." | fgrep "commit.author"

json[0].commit.author = {};

json[0].commit.author.date = "2016-07-02T10:51:21Z";

json[0].commit.author.email = "mail@tomnomnom.com";

json[0].commit.author.name = "Tom Hudson";

https://github.com/tomnomnom/gron

It was suggested to me in HN comments on an article I wrote about `jq`, and I have found myself using it a lot in my day to day workflow

stronglikedan2y ago

1 more reply

hu32y ago

Thank you so much. This seems like a saner approach for some simpler use cases.

It flattens the structure. And makes for easy diffing.

1 more reply

sn0wf1re2y ago

You can also mimic gron, including support for yaml with

yq -o=props my-file.yaml

1 more reply

jbverschoor2y ago

This looks some much better as an ad-hoc tool. Would be cool if it supported more formats - plist, yaml, xml (hoow to do body, or conflicting attr/elements)

jrockway2y ago

One of my coworkers really likes Miller: https://github.com/johnkerl/miller

The idea is that you get awk/grep like commands for operating on structured data.

zellyn2y ago

ChatGPT excels at producing `jq` incantations; I can actually use `jq` now…

frou_dh2y ago

> I'd also really like a version of `jq` that's easy to understand without having to learn a whole new syntax.

Since JSON is JavaScript Object Notation, then an obvious non-special-snowflake language for such expressions on the CLI is JavaScript: https://fx.wtf/getting-started#json-processing

gchamonlive2y ago

bobbylarrybobby2y ago

From the data side, nushell uses polars for querying tabular data so it should be pretty fast. Not sure about its scripting language.

msluyter2y ago

Obligatory reference to "gron" ("make JSON greppable"), which I find to be quite useful for many common tasks:

https://github.com/tomnomnom/gron

INTPenis2y ago

jq, and yq, are tools you spend an hour figuring out and then leave them in a CI pipeline for 3 years.

hyperthesis2y ago

Maybe like SQL for relational algebra? Codd made two query languages that were "too difficult for mortals to use". (B-trees for performance was a separate issue)

But jq's strength is its syntax - the difficulty is the semantics.

notatoad2y ago

these little one-off unique syntaxes that i'm never going to properly learn are one of my favourite uses of chatGPT.

lopatin2y ago· 8 in thread

Regarding correctness, will it display uint64 numbers without truncating them? That's my biggest pet peeve with jq currently.

necubi2y ago

Unfortunately JSON numbers are 64 bit floats, so if you're standards compliant you have to treat them as such, which gives you 53 bits of precision for integers.

Also hey, been a while ;)

Edit: I stand corrected, the latest spec (rfc8259) only formally specifies the textual format, but not the semantics of numbers.

However, it does have this to say:

In practice, most implementations treat JSON as a subset of Javascript, which implies that numbers are 64-bit floats.

matt_kantor2y ago

I'm being pedantic here, but JSON numbers are sequences of digits and ./+/-/e/E. Whether to parse those sequences into 64-bit floats or something else is left up to the implementation.

However what you say is good practice anyway. The spec (RFC 8259) has this note on interoperability:

rdtsc2y ago

> Unfortunately JSON numbers are 64 bit floats, so if you're standards compliant you have to treat them as such,

Are you sure? Looking at https://www.json.org/json-en.html I don't see anything about 64 bit floats.

Groxx2y ago

JSON does not define a precision for numbers, so: it's often float64 (but note -0 is allowed, but NaN and +/-Inf are not), but it depends on your language, parser config, etc.

Many will produce higher precision but parse as float64 by default. But maximally-compatible JSON systems should always handle arbitrary precision.

lopatin2y ago

I thought the JSON spec says that numbers can have an arbitrary amount of digits.

Also, what!! Hey! Miss you man.

re2y ago

I believe this has improved in jq 1.7: https://github.com/jqlang/jq/releases/tag/jq-1.7

> Use decimal number literals to preserve precision. Comparison operations respects precision but arithmetic operations might truncate.

anonymoushn2y ago

This is still broken in jq 1.7 for sufficiently long exponents

1 more reply

wwader2y ago

Yanael2y ago· 7 in thread

How have you been using jq? It is more adhoc for exploring JSON files during development/data analysis or in programs that run in production?

wwader2y ago

Yanael2y ago

Oh sounds a very neat way to explore binary!

1 more reply

brundolf2y ago

It may be more verbose, but I never have to google anything, which makes a bigger difference in my experience

wwader2y ago

1 more reply

Too2y ago

Yes. So much easier to reuse other common helper functions. Once you’ve finished exploration you can just copy the code into production instead of translating.

delecti2y ago

My most common usage is pretty-printing the output of curl, or getting a list of things from endpoint service/A and then calling service/endpoint B/<entry> to do things for each entry in the list.

Liskni_si2y ago

I use it as a "JSON library for bash". :-)

Not really in "production", but I have a lot of small-ish shell scripts all over the place, mostly in ~/bin, and some in CI (GitHub Actions) as well.

mgaunard2y ago· 5 in thread

While jq is a very powerful tool, I've also been using DuckDB a lot lately.

SQL is a much more natural language if the data is somewhat tabular.

suchar2y ago

Some time ago I tried Retool and it does have "Query JSON with SQL": https://docs.retool.com/queries/guides/sql/query-json (it is somewhat relevant because it was extremely convenient)

CBLT2y ago

cryptonector2y ago

Yes. SQL is much better for relational data with a strict schema. Though you'll still never get a way to express recursive queries in SQL w/o a lot of verbosity.

MrDrMcCoy2y ago

I like textql [0] better for this use case, as it's simpler in my mind.

[0] https://github.com/dinedal/textql

bdcravens2y ago

textql doesn't seem to work with JSON. I think the grandparent comment meant that the data was in a table of sorts, represented in JSON.

1 more reply

stickfigure2y ago· 5 in thread

Congratulations! We're almost back to the basic functionality we used to have with XSLT.

lkuty2y ago

Or https://github.com/AtomGraph/JSON2XML which is based on https://www.w3.org/TR/xslt-30/#json-to-xml-mapping

It even looks like we could use an XSLT 3 processor with the json-to-xml function (https://www.w3.org/TR/xslt-30/#func-json-to-xml) and then use XQuery or stay with XSLT 3.

Now I have to test it.

lkuty2y ago

In fact XQuery alone is enough, e.g. with Saxon HE 12.3.

    (: file json2xml.xq :)
    declare default element namespace "http://www.w3.org/2005/xpath-functions";
    declare option saxon:output "method=text";
    declare variable $file as xs:string external;
    json-to-xml(unparsed-text($file))/<your xpath goes here>

    java -cp ~/Java/SaxonHE12-3J/saxon-he-12.3.jar net.sf.saxon.Query -q:json2xml.xq file='/path/to/file.json'

nurettin2y ago

To be fair, xslt is a lot more verbose than `map(.*2)`

lkuty2y ago

A bit more verbose but you have the full power of XQuery with you. XSLT however is more verbose than that like you mentioned.

    for $price in json-to-xml(unparsed-text($file))/map/map/number[@key="price"]
    return $price+2

For the following JSON document:

    {
      "fruit1": {
        "name": "apple",
        "color": "green",
        "price": 1.2
      },
      "fruit2": {
        "name": "pear",
        "color": "green",
        "price": 1.6
      }
    }

The call to json-to-xml() produces this XML document:

    <?xml version="1.0" encoding="UTF-8"?>
    <map xmlns="http://www.w3.org/2005/xpath-functions">
       <map key="fruit1">
          <string key="name">apple</string>
          <string key="color">green</string>
          <number key="price">1.2</number>
       </map>
       <map key="fruit2">
          <string key="name">pear</string>
          <string key="color">green</string>
          <number key="price">1.6</number>
       </map>
    </map>

cryptonector2y ago

Yes. jq is essentially an XPath/XSLT for JSON. I'd say that jq is more powerful than XPath/XSLT, but that's neither here nor there since both can evolve to be as powerful as they need to be.

sigmonsays2y ago· 5 in thread

why not contribute to the existing jq project instead of starting a new one?

We have so many json query tools now it's insane.

lilyball2y ago

The obvious reason here is jaq makes some changes to semantics, changes which would be rejected by jq.

Using a language like Rust also helps with the goal of ensuring correctness and safety.

cryptonector2y ago

jq hasn't had much work done to make it fast though.

There's two classes of performance problems:

- implementation issues

- language issues

The latter is mainly a problem in `foreach` and also some missing ways to help programmers release references (via `$bindings`) that they no longer need.

The former is mostly a matter of doing a variety of bytecode interpreter improvements, and maybe doing more inlining, and maybe finding creative ways to reduce the number of branches.

cryptonector2y ago

sillysaurusx2y ago

Fun, of course. Existing projects are boring almost by definition. And this is volunteer work.

anonymoushn2y ago

One reason to do this is that often performance improvements involve architectural overhauls that maintainers are unlikely to approve of.

rad_gruchalski2y ago· 4 in thread

I started using yq over jq. Any significant differences?

MrDrMcCoy2y ago

Which yq? I prefer https://github.com/mikefarah/yq to https://github.com/kislyuk/yq.

Yasuraka2y ago

I prefer the former, single static binary which works great on workstations and CI alike, the latter requires python as well as jq as it's a wrapper

1 more reply

rad_gruchalski2y ago

The former: https://gruchalski.com/posts/2023-07-10-yq-the-yaml-power-to....

a-nikolaev2y ago

jq feels like a much more robust tool than yq. I understand that the task of processing YAML is much harder than JSON, but:

- yq changed its syntax between version 3 and 4 to be more like jq (but not quite the same for some reason)

- yq has no if-then-else https://github.com/mikefarah/yq/issues/95 which is a poor design (or omission) in my opinion

So yq works when you need to process YAML, it can even handle comments quite well. Buy for pure JSON processing jq is a better tool.

coldtea2y ago· 4 in thread

>nan > nan is false, while nan < nan is true.

If this wrong behavior from jq, or some artifact consistent with how the floating point spec is defined, surprising, but faithful to IEEE 754 nonetheless?

extraduder_ire2y ago

IIRC, any comparison using a nan must fail (return false) according to the IEEE spec.

kopecs2y ago

throw555chip2y ago

I used Bard after trying unsuccessfully to decipher the wikipedia page and Bard says, according to IEEE 754, nan < nan should return false (0); while nan > nan should return false (0)

ClassyJacket2y ago

1 more reply

pizza_pleb2y ago· 4 in thread

awayto2y ago

Is there anything out there like "SELECT * FROM "http://..."?

pizza_pleb2y ago

I think a query language would be great, with a way to subquery/chain data from previous requests (e.g. by jsonpath) to subsequent ones.

Another is a workflow automation tool like n8n which isn’t as strict and is more user-friendly, but still isn’t very dynamic either.

Postman supports chaining, but in a static way with getting/setting env variables in pre/post request JS scripts.

It would be an interesting tool/language to build, but I figure there has to be a solution out there already.

1 more reply

RyanHamilton2y ago

hnlmorg2y ago

My shell will do that

    open http://… | select * where …
    # FROM can be omitted because you’re loading a pipe

https://murex.rocks/optional/select.html

j1elo2y ago· 3 in thread

> [[]] | implode crashes jq, and this was not fixed at the time of writing despite being known since five years.

wwader2y ago

Yeap was fixed in 1.7 https://github.com/jqlang/jq/pull/2646

thekoma2y ago

Why was it halted?

slaymaker19072y ago

I think the original devs just got burnt out for a while https://github.com/jqlang/jq/issues/2305#issuecomment-157263...

Osiris2y ago· 3 in thread

I love the idea of jq but i use it infrequently enough that I have to search the manual for how to use their syntax to get what I want.

Sadly 99% of what I do with jq is “| jq .”

ruuda2y ago

[1]: https://docs.ruuda.nl/rcl/rcl_query/

dse19822y ago

mmorearty2y ago

Me too; but recently I used ChatGPT to just quickly me the jq syntax I needed: https://chat.openai.com/share/40b68d73-d2dd-412d-867f-9f375e...

WhereIsTheTruth2y ago· 3 in thread

https://github.com/01mf02/jaq/blob/main/Cargo.lock

That's a lot of dependencies..

mozey2y ago

Yes it is, compared to gojq https://github.com/itchyny/gojq/blob/main/go.mod

sgt2y ago

majewsky2y ago

1 more reply

vjust2y ago· 3 in thread

stevage2y ago

Well, as pointed out in the jaq docs there is jql.

But I just looked at jql and I liked it even less. The pedantry about requiring all keys in selectors to be double quoted is, um, painful for a CLI tool.

stevage2y ago

Someone else above pointed out JJ which looks much easier to use.

wrsh072y ago

ChatGPT or the warp chatbot is pretty good at jq syntax

visarga2y ago· 3 in thread

This language must be the spiritual successor of Perl

TurboHaskal2y ago

I inherited some piece of code that made use of an extremely long and complicated jq script.

I simply gave up understanding the whole thing, and restored the balance in the universe by rewriting it in Perl.

hnlmorg2y ago

Now you just need to rewrite Perl in Rust and compile that to WebAssembly. And the circle of HN is complete.

LargeTomato2y ago

I know perl is useful. I know it's going to help me. It seems like you can get away with a quick perl script whereas a python script would attract scrutiny.

But it's such a painful language to look at.

jhatemyjob2y ago· 2 in thread

I switched to jless and never looked back. The user interface is miles ahead of everything else

Snelius2y ago

It's not the same. The jq is not just a viewer. It's a JSON query lang processor.

jhatemyjob2y ago

You are correct, the user interface of jq is not the same as the user interface of jless.

sgt2y ago· 2 in thread

The fact that jq takes almost a second to run on a Pi is crazy[0]. And the tool is written in C.

[0] https://github.com/jqlang/jq/issues/1411

eyegor2y ago

It was fixed in 2019 though? I don't understand your point.

https://github.com/jqlang/jq/issues/1380

sgt2y ago

You are right. I stand corrected.

jasonlhy2y ago· 2 in thread

I think the best alternative for JQ is datawave, but it is not open source. https://dataweave.mulesoft.com/

anonymoushn2y ago

The latest blog post is about open sourcing it from last September. So the process of open sourcing dataweave takes at least 15 months.

jasonlhy2y ago

It have some learning curve, but it actually makes sense when you get used to it and work for other format too. It is much better than other transformation language, and you can even call Java.

I think they kind of stuck in the development, even the mule engine only have one active developer from the github commit ….

jbritton2y ago· 1 in thread

The 2nd and 3rd examples make no sense to me.

echo '{"a": 1, "b": 2}' | jaq 'add'

Construct an array from an object in two ways and show that they are equal:

$ echo '{"a": 1, "b": 2}' | jaq '[.a, .b] == [.[]]'

true

wwader2y ago

What might be confusing is that iterating an object iterates its values. add is defined something like this: def add: reduce .[] as $n (0; . + $n)

bilekas2y ago· 1 in thread

> nan > nan is false, while nan < nan is true.

You learn something new everyday. Does anyone have any idea why this might be happening? Seems like more than just a bug..

linux_whisperer2y ago

I use jq on a daily basis. This is new to me thanks for remarking it

Yanael2y ago· 1 in thread

‘cat’ your json file and describe what you want I think should be the way to go

LargeTomato2y ago

I usually avoid those types of tools. It looks way too fragile and the examples look a bit magical. Do you think it's stable and easy to use?

1 more reply

dilsmatchanov2y ago· 1 in thread

Haven't checked yet, but I am sure it's written in Rust

anitil2y ago

How could you tell?

jeffbee2y ago

I tried to do `echo *json | rush -- jaq -rf ./this-program.jq {} | datamash ...` and in that context I don't think it's appropriate to try to get artistic with the tty.

The cause of the errors, for whatever it's worth, is that `jaq` lacks `strftime`.

1 more reply

olemunch2y ago

My first impression is it has fancy error messages but no halt_error/0

  $ ./jaq-v1.2.0-x86_64-unknown-linux-gnu -sf aoc22-13.jq input.txt
  Error: undefined filter
      ╭─[<unknown>:30:18]
      │
   30 │ ╭─▶          "bad input" | halt_error
   31 │ ├─▶        end;
      │ │
      │ ╰───────────────── undefined filter
  ────╯

and (after commenting out halt_error) slower than both jq and gojq

  $ time jq -sf aoc22-13.jq input.txt
  6415
  20056
  
  real    0m0.023s
  user    0m0.010s
  sys     0m0.010s
  $
  $ time gojq -sf aoc22-13.jq input.txt
  6415
  20056
  
  real    0m0.070s
  user    0m0.030s
  sys     0m0.000s
  $
  $ time ./jaq-v1.2.0-x86_64-unknown-linux-gnu -sf aoc22-13.jq input.txt
  6415
  20056
  
  real    0m0.103s
  user    0m0.065s
  sys     0m0.000s

aoc22-13.jq is here https://pastebin.com/raw/YiUjEu2n and input.txt is here https://pastebin.com/raw/X0FSyTNf

1vuio0pswjnm72y ago

All else being equal, does the speed of jaq change with the size of the input.

sesm2y ago

Is there a JS library that is similar to JQ but works on JS objects in memory?

232kkk33kk2y ago

icco2y ago

I use `yq` for this stuff and it handles most of this pretty well.

phplovesong2y ago

Before a clicked on the link i had this gut feeling. It turned out my gut was right. It was written in rust. Go figure..

fyzix2y ago

I think my benchmark[1] would be a great test for this. The jq[2] version takes 50s on my machine.

[1] : https://github.com/jinyus/related_post_gen

[2]: https://github.com/jinyus/related_post_gen/blob/main/jq/rela...

j / k navigate · click thread line to collapse