Major mode for editing SEML (S-Expression Markup Language) files (opens in new tab)

(github.com)

95 pointssomezero7y ago27 comments

27 comments

20 comments · 5 top-level

zeveb7y ago· 6 in thread

I used to spend a lot of time manually typing up examples of why S-expressions are cleaner and more readable than XML, HTML, JSON, YAML &c. They are, they really are. And yet for some reason there's a population of people who prefer:

    <!DOCTYPE html>
    <html lang="en">
        <head>
            <meta charset="utf-8"/>
            <title>sample page</title>
            <link rel="stylesheet" href="sample1.css"/>
        </head>
        <body>
            <h1>sample</h1>
            <p>
                text sample
            </p>
        </body>
    </html>

to:

    (html ((lang . "en"))
          (head nil
                (meta ((charset . "utf-8")))
                (title nil "sample page")
                (link ((rel . "stylesheet") (href . "sample1.css"))))
          (body nil
                (h1 nil "sample")
                (p nil "text sample")))

I don't understand it, but it seems to be true. The egotistical part of me feels that they just haven't experience the enlightenment of understanding the benefits of have all data & code be manipulable, structured data rather than dead text which must be painstakingly parsed, combined with the benefits of a single, general, universal, cheaply-parseable representation.

But the professional, open-minded part of me wonders if maybe I am missing the point. Maybe all that painful-to-parse, irregular syntax is buying something. Maybe there's a reason every generation for the last 50 years has been approximating some but not all of the features of Lisp. Maybe those other languages and formats have worthwhile benefits. Maybe they're even superior.

Or maybe most folks really are stuck in a local maximum, like kids who like being read to and don't see the advantage of learning to read. I honestly don't know.

Regardless, SEML looks great.

nfoz7y ago

Your example (and all the SEML documentation it seems) is missing marked-up text. For example, how would you represent:

    <p>This is a <b>really cool</b> sentence.</p>

Your solution will probably have to splice the text segments around the embedded markup, something like:

    (p nil (text "This is a " (b nil "really cool") " sentence."))

In particular, notice the careful whitespace at the edges of the strings.

IMO this Sexpr is now more obtuse than the XML, and the more markup you have within text-spans (e.g. nested markup), the worse it gets. This is also a major difference between XML (markup language) vs JSON (data-structure notation).

Maybe you don't need the (text ...) thing, but either way you're changing the grammar. How do SEML and SXML handle this?

undersuit7y ago

The issue is the spaces, so let's get rid of them!

    (p nil (text (string-join '("This" "is "a" '(b nil "really") '(b nil "cool") "sentence.") " ")))

No more wondering if the space between 'really' and 'cool' should be bold and no need to have awkward preceding and trailing padding.

zeveb7y ago

I think that's mostly an artifact of SEML. In SXML that would be:

    (p "This is a " (b "really cool") " sentence.")

Which seems a-okay to me.

MikusR7y ago

Quoting the page: "SEML is short and easy to understand for Lisp hacker."

For someone that has edited a couple of html pages but is not a programmer SEML looks like gibberish (who is nil?).

zeveb7y ago

Honestly, SXML is probably cleaner and easier than SEML. The example in SXML would be:

    (html (@ (lang "en"))
          (head
           (meta (@ (charset "utf-8")))
           (title "sample page")
           (link (@ (rel "stylesheet") (href "sample1.css"))))
          (body
           (h1 "sample")
           (p  "text sample")))

The great thing about standards is that there are so many to choose from.

kazinator7y ago

> who is nil

A noun in the English language, of Latin origin.

https://www.merriam-webster.com/dictionary/nil

tgbugs7y ago· 4 in thread

I've been using SXML [0] for all my sgml needs in Racket and the quality of life improvement from having a sane and regular syntax for everything is hard to overstate. seml looks like it might have the same kind of quality of life improvements for some of my elisp-only code. I'm not sold on the way that missing attributes are handled using nil, that seems like a design decision that was made to simplify parsing at the expense of making the representation more cluttered.

https://docs.racket-lang.org/sxml/

neilv7y ago

My old Scheme/Racket permissive HTML parser[0] initially used what might've been exactly this SEML format. Because it's perhaps the most natural choice for a Lisp person -- HTML element is a list, item 0 of that list is a symbol for the HTML element name, item 1 is an alist of HTML attributes, tail items are HTML element content.

However, I changed the format when I saw Oleg Kiselyov's SXML work, to make my Web-scraping and other tools able to use his XML tools. I later made a few other tools that used SXML, such as a simple HTML-writing template embedded in Racket that does some of the checking and work at compilation time.[1]

At a library level, SXML's arbitrary nested lists make some things computationally harder to do (e.g., find all the attributes, depending on the "normal form" of SXML), but some other things easier to do (e.g., some kinds of functional editing, due to arbitrary nested lists). Aside from those considerations, SXML is the closest to a de facto standard for XML and HTML tools in Scheme.

If I ever happen to have funding to do so, I'd like to revisit the exact representations, to try come up with an end-all-be-all for all purposes, and redesign/reimplement all the tools from scratch. Until then, there's SXML.

[0] https://www.neilvandyke.org/racket/html-parsing/

[1] https://www.neilvandyke.org/racket/html-template/

agumonkey7y ago

It's funny how the lisp/fp often remove irregularity, at the cost of abstraction [0], which makes people pulling their hair off to the point of going back to simpler but irregular separate logical tools. Even if they complain about it very .. regularly.

[0] some people can't bear lisp uniformity for instance.

chriswarbo7y ago

Keep in mind that Lisp (including SXML) can be written in many ways (parenthesised, indentation based, braces, infixed, prefixed, etc.), which can be mixed and matched within the same expression, and can be trivially converted between automatically.

I bring this up so often that I have a go-to blog post for it: http://chriswarbo.net/blog/2017-08-29-s_expressions.html

noir_lord7y ago

Although different I got a similar quality of life from using pug for html, one you get used to it it's faster to write but crucially much easier to read, it makes the intent so much clearer.

Lowkeyloki7y ago· 3 in thread

This is interesting. It reminds me of the API behind JSX. But I'm not sure what problem this is seeking to solve exactly. Is it showing that HTML and s-expressions are technically interchangeable?

txru7y ago

If you have time, this[0] is the canonical article usually shared around this concept. The thrust of it is that yes, XML (or x-expressions) and s-expressions are very similar, and that s-expressions are a less verbose and simpler way to represent data.

[0] https://www.defmacro.org/ramblings/lisp.html

js87y ago

Actually, there is a fundamental difference between XML and sexps. In XML, text is unescaped, while the metadata are escaped. In sexps, the metadata are unescaped, while the text is escaped.

Most text formats fall into one of these two categories. Formats primarily for storing text (like XML or SGML or TeX) are in the former, formats primarily for storing (unstructured) data (like sexps or JSON or YAML) are in the latter.

jakear7y ago

Skimmed the article, left me a bit confused. Am I missing something big, or is this not particularly novel? The similarities between a-expressions and XML seem fairly obvious to me.

2 more replies

notduncansmith7y ago· 2 in thread

This reminds me very much of Hiccup[1]. Both nested s-expressions and HTML describe trees.

[1] https://github.com/weavejester/hiccup

StreamBright7y ago

I really like Hiccup, it is my favorite part of the Clojure web kit.

agumonkey7y ago

lisp and trees, you know

tlavoie7y ago

Makes me think of Edi Weitz's CL-WHO, which works very nicely if creating web pages from Common Lisp. https://edicl.github.io/cl-who/

j / k navigate · click thread line to collapse

27 comments

20 comments · 5 top-level

zeveb7y ago· 6 in thread

    <!DOCTYPE html>
    <html lang="en">
        <head>
            <meta charset="utf-8"/>
            <title>sample page</title>
            <link rel="stylesheet" href="sample1.css"/>
        </head>
        <body>
            <h1>sample</h1>
            <p>
                text sample
            </p>
        </body>
    </html>

to:

    (html ((lang . "en"))
          (head nil
                (meta ((charset . "utf-8")))
                (title nil "sample page")
                (link ((rel . "stylesheet") (href . "sample1.css"))))
          (body nil
                (h1 nil "sample")
                (p nil "text sample")))

Or maybe most folks really are stuck in a local maximum, like kids who like being read to and don't see the advantage of learning to read. I honestly don't know.

Regardless, SEML looks great.

nfoz7y ago

Your example (and all the SEML documentation it seems) is missing marked-up text. For example, how would you represent:

    <p>This is a <b>really cool</b> sentence.</p>

Your solution will probably have to splice the text segments around the embedded markup, something like:

    (p nil (text "This is a " (b nil "really cool") " sentence."))

In particular, notice the careful whitespace at the edges of the strings.

Maybe you don't need the (text ...) thing, but either way you're changing the grammar. How do SEML and SXML handle this?

undersuit7y ago

The issue is the spaces, so let's get rid of them!

    (p nil (text (string-join '("This" "is "a" '(b nil "really") '(b nil "cool") "sentence.") " ")))

No more wondering if the space between 'really' and 'cool' should be bold and no need to have awkward preceding and trailing padding.

zeveb7y ago

I think that's mostly an artifact of SEML. In SXML that would be:

    (p "This is a " (b "really cool") " sentence.")

Which seems a-okay to me.

MikusR7y ago

Quoting the page: "SEML is short and easy to understand for Lisp hacker."

For someone that has edited a couple of html pages but is not a programmer SEML looks like gibberish (who is nil?).

zeveb7y ago

Honestly, SXML is probably cleaner and easier than SEML. The example in SXML would be:

    (html (@ (lang "en"))
          (head
           (meta (@ (charset "utf-8")))
           (title "sample page")
           (link (@ (rel "stylesheet") (href "sample1.css"))))
          (body
           (h1 "sample")
           (p  "text sample")))

The great thing about standards is that there are so many to choose from.

kazinator7y ago

> who is nil

A noun in the English language, of Latin origin.

https://www.merriam-webster.com/dictionary/nil

tgbugs7y ago· 4 in thread

https://docs.racket-lang.org/sxml/

neilv7y ago

[0] https://www.neilvandyke.org/racket/html-parsing/

[1] https://www.neilvandyke.org/racket/html-template/

agumonkey7y ago

[0] some people can't bear lisp uniformity for instance.

chriswarbo7y ago

I bring this up so often that I have a go-to blog post for it: http://chriswarbo.net/blog/2017-08-29-s_expressions.html

noir_lord7y ago

Although different I got a similar quality of life from using pug for html, one you get used to it it's faster to write but crucially much easier to read, it makes the intent so much clearer.

Lowkeyloki7y ago· 3 in thread

This is interesting. It reminds me of the API behind JSX. But I'm not sure what problem this is seeking to solve exactly. Is it showing that HTML and s-expressions are technically interchangeable?

txru7y ago

[0] https://www.defmacro.org/ramblings/lisp.html

js87y ago

Actually, there is a fundamental difference between XML and sexps. In XML, text is unescaped, while the metadata are escaped. In sexps, the metadata are unescaped, while the text is escaped.

jakear7y ago

Skimmed the article, left me a bit confused. Am I missing something big, or is this not particularly novel? The similarities between a-expressions and XML seem fairly obvious to me.

2 more replies

notduncansmith7y ago· 2 in thread

This reminds me very much of Hiccup[1]. Both nested s-expressions and HTML describe trees.

[1] https://github.com/weavejester/hiccup

StreamBright7y ago

I really like Hiccup, it is my favorite part of the Clojure web kit.

agumonkey7y ago

lisp and trees, you know

tlavoie7y ago

Makes me think of Edi Weitz's CL-WHO, which works very nicely if creating web pages from Common Lisp. https://edicl.github.io/cl-who/

j / k navigate · click thread line to collapse