Steve Yegge and Grok (opens in new tab)

(bsumm.net)

163 pointsbryansum13y ago47 comments

47 comments

39 comments · 13 top-level

hackinthebochs13y ago· 8 in thread

Am I the only one that really, really hates the word "grok"? I was hoping it would eventually lose favor among us geeks, but now with this project's name I fear I may just have to get used to it.

gruseom13y ago

I get irritated by words too (for example, Twitter's use of "tweet" bugs me) so sympathize in general, plus I suppose "grok" is rather unpleasant sounding. But the reason I like it, and why I suspect it (sort of) caught on, is that English doesn't have any other word for this. "Understand" is too general. "Grok" means to understand something intuitively, in its essence and as a whole. Perhaps a synonym would be "get" as in "to get it". But "get" is so overloaded that there's room for a more specific alternative.

(Actually the above isn't what "grok" originally meant, but I think it means that now.)

endtime13y ago

What did grok originally mean, then? I thought Heinlein defined it pretty much as you did.

4 more replies

columbo13y ago

I'm curious, Why? I have no problem with grok as long as anyone that uses the term has actually read 'Stranger in a Strange Land'. Honestly if there ever was a word that needed to be tossed out with the bathwater I believe it would be hacker.

hackinthebochs13y ago

Scaevolus's answer is exactly how I feel. It just very harsh sounding to my minds ear. It literally grates on my psyche every time I have to say it in my head.

And I'm with you on "hacker". I don't have a problem with the word itself, but its so overused in these circles to the point that its irritating every time I see someone try to stuff yet another everyday activity under the hacker term.

1 more reply

Scaevolus13y ago

It's very harsh sounding-- like an expletive or a loud bird.

wamatt13y ago

You are not alone, it's a horrible horrible word.

justinmk13y ago

> I was hoping [grok] would eventually lose favor among us geeks

Yet you do not seem to mind the word "geek".

hackinthebochs13y ago

I don't care for that word much either. But it's normalized in our culture so I'm used to it which makes it less abrasive. Another problem with "grok" is that I can't convince myself of how to pronounce it. This fact alone is probably causes half of the internal discord I feel when I try to verbalize it internally (does it rhyme with rock or broke?). Also its spelling doesn't have any precedent in English, so every time I see it my subconscious pattern matching system goes haywire.

You know when you read or hear something that is contradictory, or something that is just painfully wrong, and you can almost feel your psyche twinge as it recognizes the error? I get that feeling every time I see this word. Perhaps that's just me.

SoftwareMaven13y ago· 4 in thread

I'm very happy that Steve is such an Emacs fan. I have little doubt that the Emacs support will be top notch when this hits the street.

why-el13y ago

He has been an Emacs fan for a long time now. You should probably check out his "Effective Emacs" article. Lots of good advice there.

eta_carinae13y ago

The fact that he's such a Emacs fan makes me skeptical about project Grok, to be honest. There is only so far you can take Emacs and for statically typed languages, IDE's leave Emacs in the dust.

I with he were more knowledgeable on IDE's (IDEA, Eclipse, Visual Studio, XCode) so he had a better idea of what modern developers expect from their tools.

I'm hoping Grok will succeed, but I wouldn't be surprised if in the coming year, it gets completely canceled.

jrockway13y ago

So, as Steve mentions in his post, Grok already exists. It's designed to work with all editors, including IDEs. IDEs exhibit ... how shall I say ... interesting failure modes ... on the Google codebase. Grok will benefit users of Eclipse and IntelliJ as much as it will benefit users of Emacs and Vim.

Also worth noting: he is pretty knowledgeable about IDEs too. Why do you think he isn't?

Finally, I wouldn't count on Grok being canceled.

tree_of_item13y ago

The whole point of Grok is that the phrase "IDEs leave Emacs in the dust" is a function of the poor design of today's tools, not anything inherent in IDEs or Emacs.

"IDE as a service" in the style of Grok (or Ensime for something currently available) puts Emacs on a level playing field with any IDE.

knowtheory13y ago· 4 in thread

This isn't a pipe dream it's just dumb. Different languages are suitable for different tasks and communities grow up with different goals and acheivements to reflect that. Certain cultural movements are language agnostic (say, testing) but DSLs? HM type theory? Not so much.

We can just look at how hard it is to compare programming languages through common benchmarks is to do to realize what a bad idea this is.

zaphar13y ago

I don't know about you but getting real autocomplete and refactoring in Emacs for M languages is a dream come true for me. I don't want a different IDE for each language. I want one Editor to Rule Them All.

Making the infrastructure to understand a programming language shareable between all the ways to edit/refactor/analyze code in that language is far from dumb. It's something Eclipse/IntelliJ/VisualStudio should have done a long long time ago. Heck I once attempted to pick apart the tooling in Eclipse into something I could use headless without booting the entire thing and the code was so tightly integrated into their GUI I gave up in disgust. I mean this is architecture 101. Clear API's and division of labor should have been on everyone's mind but instead the whole thing is a pile of spaghetti code.

Even something as simple as a CLI interface to your IDE's language support features so I could integrate them into editor of my choice would have been smart. But no one did that. I work with 4 different languages minimum every day at work. I'm not going to context switch my editor/IDE for each one of them. For this reason I'm cheering Yegge on like mad.

(Full disclosure: I work at google same as Yegge. I've had these opinions for longer than I've worked there though.)

masklinn13y ago

> getting real autocomplete and refactoring in Emacs for M languages is a dream come true for me

Even just one. The Python RefactoringBrowser takes tends of seconds just to make a mess of the whole thing. It's a nice effort, but it fails on both reliability and speed.

shasta13y ago

It is dumb the way it's phrased. The easiest way to achieve it is to define a super language that all other languages can be shallowly encodes into. Describing this as ambitious is dumb.

jrockway13y ago

Incidentally, check out how Java indexing is implemented in Grok :)

ZitchDog13y ago· 3 in thread

"We would like to be able to Grok all open-source code in the world. For any given Java library in the world, find all callers of this function!"

Whether it will work or not, this is a truly audacious and remarkable vision.

hythloday13y ago

You can do stuff that's quite a bit more amazing than that (though I guess this goes against his point of "manage expectations"):

For any given C++ refactoring (rename method + swap args), propagate that refactoring automatically to every caller of the refactored function, such that upgrading to a new version of the library is automatically refactored correctly.

Parse this Python function and emit it as C, for optimization. Parse this C function and emit it as Python, for readability.

Cross-layer data-flow analysis - so you could track that a lock that's taken out in Java and released in C++ code would never produce deadlocks.

mej1013y ago

I have been dreaming of something like this for years. So glad that it is being worked on.

spullara13y ago

Can't wait for to refactor a library and have pull requests auto generated for all the code that depends on it.

mynegation13y ago· 2 in thread

This is quite an undertaking and even if they do not achieve all the lofty goals, the experience and the code (if they open-source it) will be extremely valuable.

For many years I made a living working on static analysis tools and that included a lot of compiler construction stuff. Unfortunately not all languages are equally amenable to static analysis.

Take C++ for example. On one hand C++ is statically typed and that helps in static analysis and refactoring. On the other hand, its semantic is so horribly complicated that it makes almost impossible to write static analyzers for it and useful refactoring tools for C++ are almost non-existent. Pointer aliasing and macroprocessor just add insult to injury.

Python on the other hand is very clean language, no macroprocessor, but it is dynamically typed which makes reasoning about code very difficult.

Java (and after it, C#) managed to strike very nice balance, that is why tooling ecosystems for these languages are very good. The road to good tools starts with carefully designed language.

cageface13y ago

This is why I have high hopes for the Kotlin language. I think it makes a lot of sense to design a language with tooling in mind from the beginning to avoid nightmares like C++.

I think nobody really understands the pitfalls of language tooling better than the Jetbrains guys.

rabbitfang13y ago

Either way their free Scala plugin for IntelliJ is pretty sweet. Kotlin looks very similar, will be interesting to see how it develops.

quanticle13y ago· 2 in thread

Idea: pry open compilers, run their “guts” on distributed clusters, output a language neutral index. Serve the index via service APIs. Write client plugins, etc.

Isn't this similar to what Microsoft is trying to achieve with Project Roslyn [1]? Was someone at Microsoft inspired by Yegge's vision and decided to implement it for the .Net Common Language Runtime?

[1] http://www.microsoft.com/en-us/download/details.aspx?id=2774...

magic_haze13y ago

From Eric Lippert's blog posts about this, the Roslyn project most definitely doesn't seem like a one-person-getting-inspired kind of thing. I was talking to a guy in the Roslyn team last week, and the project scope, and the way they have decided to implement it, is quite frankly astonishing. I mean, a dynamic parser/lexer with all the underlying data structures immutable [1]? Impressive.

[1] http://blogs.msdn.com/b/ericlippert/archive/2012/06/08/persi...

saraid21613y ago

> Was someone at Microsoft inspired by Yegge's vision and decided to implement it for the .Net Common Language Runtime?

I feel a bit obligated to point out that co-incidental invention/discovery is very possible, especially when trying to solve a problem large numbers of smart people would have exposure to.

gbog13y ago· 1 in thread

I think there is an elephant in the room of Yegge's article: he is apparently leading the development of a toolchain for code analysis and he is at the same time a self-declared "liberal", who advocates use of metaprogramming, evaled code, and all sort of black magic.

But then almost all the fights he fights everyday are precisely against these multiple techniques that make code analysis, even when simply greping for the place where a thing has been set, a nightmare. (Just imagine you want to know why ab == 2, and it is so because in another line of another file you have eval("a" + x + " = 2"), and because of another remote line of code x happens to equal "b".)

So, we can be optimistic and hope Grok will work. But another possibility is that Yegge's sweet-and-sour rant is the sign that it is unbearable for him to see his "software political belief" to be the cause of the failure of his gigantic project.

The fact that he is talking publicly about an internal project, apparently without authorisation, may mean he is looking for the emergency exit sign over the door...

georgemcbay13y ago

"The fact that he is talking publicly about an internal project, apparently without authorisation, may mean he is looking for the emergency exit sign over the door..."

From everything said publicly it seems like "grok" isn't a secret for Google, the team members just haven't talked about it much because it is so likely to fail or at least fall short of its goals given how ambitious it is.

The fact that the project wouldn't be explicitly secret is perfectly believable, just look at Go or Dart which have been developed fully in the open. Like those projects, grok seems like the sort of thing that could potentially help Google and everyone but isn't seen as a competitive advantage for advertising/social, so it is perfectly reasonable to believe they aren't trying to keep it a secret.

wsxiaoys13y ago· 1 in thread

hmm interesting, just read another article on Grok yesterday posted by a previous intern of that team. (sadly it's in Chinese, but it covered the detail of Python Indexer's implementation) http://blog.sina.com.cn/s/blog_5d90e82f010191rh.html

EDIT: Another article on the analyzer by this author, in English. http://yinwang0.wordpress.com/2010/09/12/pysonar/

j-kidd13y ago

Thanks for sharing. The first article is really interesting. The author worked 3 months as an intern at Google, and his much under-appreciated works saved Steve Yegge's Grok team that has been struggling for years.

EDIT: Found another gem from the same author: http://yinwang0.wordpress.com/2012/04/09/reinvent-y/

> Today I found that I forgot how to derive the defintion of the Y combinator. I learned it several years ago from an online article, but now the search term “Y combinator” gives me all the news about startups (sigh).

pwpwp13y ago· 1 in thread

What does "All non-compiler languages are ad-hoc." mean?

stan_rogers13y ago

I'll take a stab at it. (That is to say, "I think I understand how the phrase is being used, at least to a degree that it makes perfect sense to me. I may, of course, be wrong. But I'll try to translate my understanding.")

At any level above the trivial, the code we write (or work with) is very often not a direct set of instructions for the machine to carry out. Here, I don't simply mean a high-level language is not machine code or assembly. I think it's safe to assume that we all get that much at least.

Rather, it's a lot like everyday conversation, but with Lewis Carrol's Humpty Dumpty character. There will be common (and less-than-common) cultural references, along with a whole bunch of private definitions for things. Actually understanding the conversation means not only having those cultural references in common[1], but also having access to the private definitions[2] (such as "glory" meaning "a nice knock-down argument").

So the text on the screen (or in the file) is not the actual code, or even a complete description of the code, that the machine will run. It's only a sort of short-hand representation, a language we created along the way to talk about the instructions (even if, at the basic level of the language facilities, the operators and keywords are directly translatable). Simply working with the text content, then, as one would do with a simple parser, or as a human programmer would do with a single text file in isolation, is insufficient; the person or tool would have to have a much deeper understanding of the implications of the text in order to make any real sense of it.[3] For a tool to offer useful help (beyond the simple matter of altering text), it has to be able to "understand" what the text means, and that , in turn, means looking beyond the simple arrangement of text characters.

[1] It might help to think about the Darmok episode of Start Trek TNG. Library calls are very much like the metaphorical language of the Tamarians, in that they make perfect sense if we know the "story" being referenced, but not knowing means either the reference is meaningless or can even be misleading.

[2] That applies as much to preprocessing directives like constant definitions as it does to things like function/method calls.

[3] We do this is natural languages as well. Every time you've run across a cant, an argot or a jargon specific to one field of endeavor, you either understand it because you are familiar with the displaced vocabulary, or you're left in that uncomfortable state of knowing (or thinking you know) what most of the words mean, but failing entirely to understand what's being said.

mark_l_watson13y ago

""Crucially, consistent tools make it easier to switch languages. Devs would then be more likely to use the best language for the job. IDE authors can focus on presentation and editing. This leads to more configurability, scriptability, accessibility in languages.""

I am a (sometimes reluctant) polyglot programmer. Sticking with a single IDE product line (I use IntelliJ, RubyMine and PyCharm for some consistency across Clojure, Java, Ruby, and Python development) helps a lot. Eclipse is another great alternative for a unified development platform.

At SAIC in the late 1980s, we had a small team put together for a job - we were all Lisp hackers and we had a great dev community built around sharing Emacs configurations. A lot of fun but our adhoc tooling would not scale up to a Google or Facebook size engineering infrastructure.

skrebbel13y ago

Sounds lovely, but I've one worry:

Given N langauges and M editors / IDEs, total toolchain effort is N x M… Any toolchain support for this number of systems is non-trivial. [SNIP] How do you solve matrix problems like this? Use a hub and spoke model.

Indeed, every problem in computer science can be solved by yet another layer of indirection.

(and no, not putting every imaginable layer of indirection in there already is not flunking CS 101, like the author suggests. I hope he never becomes an architect somewhere).

The usual dangers of this extra layer of indirection are performance and configuration complexity. In fact, platforms like Eclipse have added many such layers already, and we can feel that (sluggish UI performance, conflicting plugins, random crashes).

I really hope Grok will work as well in practice as it could in theory.

lobster_johnson13y ago

Very interesting project. I hope documentation (ie., docstrings/documentation comments) will be part of it; it seems like a natural extension of indexing mere grammar. It would be a boon to editors/IDEs to be able to extract documentation alongside autocompleted symbols.

nic-ferrier13y ago

This is interesting, I was only talking about doing this the other day, just for EmacsLisp on github and in the package repositories. Because of the constraint mine would be worse. So maybe better. Hmmmm.

j / k navigate · click thread line to collapse

47 comments

39 comments · 13 top-level

hackinthebochs13y ago· 8 in thread

Am I the only one that really, really hates the word "grok"? I was hoping it would eventually lose favor among us geeks, but now with this project's name I fear I may just have to get used to it.

gruseom13y ago

(Actually the above isn't what "grok" originally meant, but I think it means that now.)

endtime13y ago

What did grok originally mean, then? I thought Heinlein defined it pretty much as you did.

4 more replies

columbo13y ago

hackinthebochs13y ago

Scaevolus's answer is exactly how I feel. It just very harsh sounding to my minds ear. It literally grates on my psyche every time I have to say it in my head.

1 more reply

Scaevolus13y ago

It's very harsh sounding-- like an expletive or a loud bird.

wamatt13y ago

You are not alone, it's a horrible horrible word.

justinmk13y ago

> I was hoping [grok] would eventually lose favor among us geeks

Yet you do not seem to mind the word "geek".

hackinthebochs13y ago

SoftwareMaven13y ago· 4 in thread

I'm very happy that Steve is such an Emacs fan. I have little doubt that the Emacs support will be top notch when this hits the street.

why-el13y ago

He has been an Emacs fan for a long time now. You should probably check out his "Effective Emacs" article. Lots of good advice there.

eta_carinae13y ago

The fact that he's such a Emacs fan makes me skeptical about project Grok, to be honest. There is only so far you can take Emacs and for statically typed languages, IDE's leave Emacs in the dust.

I with he were more knowledgeable on IDE's (IDEA, Eclipse, Visual Studio, XCode) so he had a better idea of what modern developers expect from their tools.

I'm hoping Grok will succeed, but I wouldn't be surprised if in the coming year, it gets completely canceled.

jrockway13y ago

Also worth noting: he is pretty knowledgeable about IDEs too. Why do you think he isn't?

Finally, I wouldn't count on Grok being canceled.

tree_of_item13y ago

The whole point of Grok is that the phrase "IDEs leave Emacs in the dust" is a function of the poor design of today's tools, not anything inherent in IDEs or Emacs.

"IDE as a service" in the style of Grok (or Ensime for something currently available) puts Emacs on a level playing field with any IDE.

knowtheory13y ago· 4 in thread

We can just look at how hard it is to compare programming languages through common benchmarks is to do to realize what a bad idea this is.

zaphar13y ago

(Full disclosure: I work at google same as Yegge. I've had these opinions for longer than I've worked there though.)

masklinn13y ago

> getting real autocomplete and refactoring in Emacs for M languages is a dream come true for me

Even just one. The Python RefactoringBrowser takes tends of seconds just to make a mess of the whole thing. It's a nice effort, but it fails on both reliability and speed.

shasta13y ago

It is dumb the way it's phrased. The easiest way to achieve it is to define a super language that all other languages can be shallowly encodes into. Describing this as ambitious is dumb.

jrockway13y ago

Incidentally, check out how Java indexing is implemented in Grok :)

ZitchDog13y ago· 3 in thread

"We would like to be able to Grok all open-source code in the world. For any given Java library in the world, find all callers of this function!"

Whether it will work or not, this is a truly audacious and remarkable vision.

hythloday13y ago

You can do stuff that's quite a bit more amazing than that (though I guess this goes against his point of "manage expectations"):

Parse this Python function and emit it as C, for optimization. Parse this C function and emit it as Python, for readability.

Cross-layer data-flow analysis - so you could track that a lock that's taken out in Java and released in C++ code would never produce deadlocks.

mej1013y ago

I have been dreaming of something like this for years. So glad that it is being worked on.

spullara13y ago

Can't wait for to refactor a library and have pull requests auto generated for all the code that depends on it.

mynegation13y ago· 2 in thread

This is quite an undertaking and even if they do not achieve all the lofty goals, the experience and the code (if they open-source it) will be extremely valuable.

For many years I made a living working on static analysis tools and that included a lot of compiler construction stuff. Unfortunately not all languages are equally amenable to static analysis.

Python on the other hand is very clean language, no macroprocessor, but it is dynamically typed which makes reasoning about code very difficult.

Java (and after it, C#) managed to strike very nice balance, that is why tooling ecosystems for these languages are very good. The road to good tools starts with carefully designed language.

cageface13y ago

This is why I have high hopes for the Kotlin language. I think it makes a lot of sense to design a language with tooling in mind from the beginning to avoid nightmares like C++.

I think nobody really understands the pitfalls of language tooling better than the Jetbrains guys.

rabbitfang13y ago

Either way their free Scala plugin for IntelliJ is pretty sweet. Kotlin looks very similar, will be interesting to see how it develops.

quanticle13y ago· 2 in thread

Idea: pry open compilers, run their “guts” on distributed clusters, output a language neutral index. Serve the index via service APIs. Write client plugins, etc.

[1] http://www.microsoft.com/en-us/download/details.aspx?id=2774...

magic_haze13y ago

[1] http://blogs.msdn.com/b/ericlippert/archive/2012/06/08/persi...

saraid21613y ago

> Was someone at Microsoft inspired by Yegge's vision and decided to implement it for the .Net Common Language Runtime?

I feel a bit obligated to point out that co-incidental invention/discovery is very possible, especially when trying to solve a problem large numbers of smart people would have exposure to.

gbog13y ago· 1 in thread

The fact that he is talking publicly about an internal project, apparently without authorisation, may mean he is looking for the emergency exit sign over the door...

georgemcbay13y ago

"The fact that he is talking publicly about an internal project, apparently without authorisation, may mean he is looking for the emergency exit sign over the door..."

wsxiaoys13y ago· 1 in thread

EDIT: Another article on the analyzer by this author, in English. http://yinwang0.wordpress.com/2010/09/12/pysonar/

j-kidd13y ago

EDIT: Found another gem from the same author: http://yinwang0.wordpress.com/2012/04/09/reinvent-y/

pwpwp13y ago· 1 in thread

What does "All non-compiler languages are ad-hoc." mean?

stan_rogers13y ago

[2] That applies as much to preprocessing directives like constant definitions as it does to things like function/method calls.

mark_l_watson13y ago

skrebbel13y ago

Sounds lovely, but I've one worry:

Indeed, every problem in computer science can be solved by yet another layer of indirection.

(and no, not putting every imaginable layer of indirection in there already is not flunking CS 101, like the author suggests. I hope he never becomes an architect somewhere).

I really hope Grok will work as well in practice as it could in theory.

lobster_johnson13y ago

nic-ferrier13y ago

j / k navigate · click thread line to collapse