A Simple AI Capable of Basic Reading Comprehension (opens in new tab)

(blog.ayoungprogrammer.com)

195 pointsyoungprogrammer10y ago61 comments

61 comments

47 comments · 16 top-level

The difference between this graph and propositional logic is only that the predicates joining concepts are arbitrary instead of logic operators. In that sense, this is like Google Knowledge Graph / Freebase.

https://en.wikipedia.org/wiki/Propositional_calculus#Solvers

Solving a set of propositional logic statements is NP-Complete. I'd argue "reading comprehension" is actually knowing the state of the world after a piece of text, which requires solving how these predicates interact. For example, if the paragraph is

  Bobby picked up the toy. Then he put down the toy.

This "semantic memory" does not "comprehend" where the toy is, and this is a relatively simple example. I think the title "basic reading comprehension" is thus inaccurate. Perhaps a better title is "A simple knowledge graph" or "A simple semantic memory"

aamar10y ago

Classic example:

    The iron ball fell on the glass table, and it shattered.
    The glass ball fell on the iron table, and it shattered.

Human readers will pick the correct antecedent for "it" in each case, so it's not ambiguous. But correct interpretation depends on knowing something about how likely glass and iron are to shatter.

I love these kinds of examples, but I don't think they ought to significantly deter the kind of work shown here; it depends on the application, but basic interpretation could be very useful even if it can't handle every case.

mrec10y ago

These are called Winograd Schemas; you can read more about them and see more examples at https://www.cs.nyu.edu/davise/papers/WS.html

kajecounterhack10y ago

Yeah, it's a bit of a nitpick over definition of "comprehension" but the program itself is obviously useful, I mean it's a poor man's version of Freebase. Freebase/Google Knowledge Graph doesn't claim to "comprehend" anything, it is just a large graph datastructure with an efficient querying mechanism. That's what this is (...given this is a poor man's version, efficient querying mechanism may be lacking for larger graphs)

1 more reply

oh_sigh10y ago

For those curious, this process is called "anaphora resolution", which is the process of understanding what a reference(usually a pronoun) actually refers to.

reptation10y ago

While this is a interesting point, grammarians will bristle at such an imprecise use of 'it', which should unambiguously refer to one antecedent.

1 more reply

T-Roy10y ago

OpenCyc is a really cool implementation of one solution to this problem.

The way it was explained to me, computers don't know that when you turn a full cup of coffee upside down, the liquid will fall out. Cyc attempts to provide a framework for that.

http://sw.opencyc.org

youngprogrammerOP10y ago

> Bobby picked up the toy. Then he put down the toy.

When we read this sentence, our brains automatically augment additional information based on the verb. However, in this example, my program will fail to answer because my program does not augment any additional information but it can be extended to.

This can be implemented in our program by created a new property for each object called "location". If a verb is location based, we can set the location of the object based on what the verb describes. For example, "the toy"'s location could be "Bobby's hands" after the first sentence based on the verb phrase "pick up". So the program will understand where the toy is and be able to understand queries related to "where".

As you can imagine, implementing this would be very tedious since there are too many cases for all the verbs. My program may not be able to do advanced reading comprehension (reading between lines and augmenting information) but I argue that it can do simple reading comprehension, in that it can understand the relationship between objects. There is still a long way to go before my program is capable of more sophisticated reading comprehension, but in theory, I think my approach seems possible.

markjspivey10y ago

there is a linguistic research called "usage based" and also "emergent grammar" that i find interesting regarding subjects like this:

1. meaning is usage. 2. structure emerges from usage.

meaning we don't use grammar to produce language, it is an emergent property (irreducible) of "pre-linguistic pragmatics" ...

borkabrak10y ago· 6 in thread

I love the ambition. But when you stated your next goals, I'm afraid I thought of this:

http://xkcd.com/1425/

I may very well be wrong, and I'd love it if you showed me I am. Good luck. As I said, I really would be excited to see this go farther.

meeper1610y ago

Using Word2Vec or better yet, what it was based on, https://www.kaggle.com/c/word2vec-nlp-tutorial/forums/t/1234... might make this possible and would be a good approach toward achieving future goals.

Take a look at text sentiment analyzers and processors that employ advanced machine learning algorithms like this too, http://52.10.12.34/TuataraSum

nyamhap10y ago

When was that xkcd posted? - couldn't not find a date.

Since machine vision is such a fast moving field, I think date is relevant for understanding the context of this xkcd. Is image recognition of a bird so inconceivable at the moment? Perhaps the joke now would be - "I'll need one researcher and one year"

wslh10y ago

You are taking the xkcd strip very literally instead of understanding the central concept on it.

You comment reminds me of this: http://www.uh.edu/engines/epi879.htm

2 more replies

seiji10y ago

Yeah, Randy was a bit miscalculating the estimate there. Within a month of that comic, flickr made http://code.flickr.net/2014/10/20/introducing-flickr-park-or...

1 more reply

ganarajpr10y ago

One researcher and a month ( perhaps even less ! ) is more like it..

1 more reply

sp33210y ago

September 24, 2014

mratzloff10y ago· 6 in thread

Some thoughts about creating a system like this:

Any successful implementation of comprehension must progressively enhance the world model based on additional information. Furthermore it must understand some basic rules, such as, "Any subject, set of subjects, or actions can be represented multiple ways."

So if you said, "Mary's brother is Sam," or "Mary has a brother named Sam," or "Mary's brother is named Sam," the world model must collocate the meanings "Sam" and "brother" for Mary and be able to respond to queries about either.

Further, if you mention that John also has a brother named Sam, and then you mention Sam in an ambiguous context, the program should be smart enough to ask, "Which Sam? Mary's brother or John's?" Infocom games did this; you are building a more flexible world model builder, but the parser would operate similarly.

There is also the concept of recency. If I talk about "John's brother Sam", ignoring the fact that pronoun references to "he" should be contextually mapped correctly, and then mention Sam, the program should not need to ask which Sam I mean. It would be like talking to someone who wasn't paying attention.

Finally, there is also the concept of confidence. In the face of ambiguity that can't be resolved, a confidence rating should be assigned based on available information and future answers should be based on that confidence.

I suspect that if someone were to create a language parser that could create a mostly-accurate world model AND modify itself based on new rules it read (e.g., "When some says 'he' after referring to someone's name, they are almost certainly talking about the man they previously referred to"), you would be 90% of the way to creating a useful virtual intelligence. It would of course not be able to reason or have opinions of its own, but it would be extremely useful as a virtual assistant that could learn your preferences over time.

nileshtrivedi10y ago

Not only this, the system would have to maintain multiple levels of world models and simulate them as well. Consider a sentence: "If he goes there, and then she does not reach in time, he will surely be disappointed."

There has to be a hierarchy of world models, and only the bottom-most layer would be the "real" one. Any kind of planning will require simulating hypothetical worlds.

phkahler10y ago

>> Finally, there is also the concept of confidence. In the face of ambiguity that can't be resolved, a confidence rating should be assigned based on available information and future answers should be based on that confidence.

That should probably be the first thing ;-) Every word has a probable meaning, and when all of them fit together into a coherent way including context, then the meaning is correct (probably). This also encompasses the use of pronouns, which you touch on. An inability to resolve ambiguity should also point to the appropriate question to ask for clarification, which you also mentioned. I think your points are all great, just wanted to point out that I think the notion of confidence should be more central.

I'd also go as far as saying part of the internal model of the world should also have a confidence. Any failure to understand a sentence may actually be a problem with its world view. But now I'm rambling.

mratzloff10y ago

I'd also go as far as saying part of the internal model of the world should also have a confidence. Any failure to understand a sentence may actually be a problem with its world view.

Yes, but that happens automatically by virtue of learning through natural language teaching. Put the VI through school.

phkahler10y ago

In the blog comments someone raises proposes the sentence "Mary saw the jaguar". And points out that it wouldn't know if the jaguar was an animal or a car. Once again, confidence comes in but it would be based on context. Or perhaps it would become clear after more conversation.

RHSeeger10y ago

> Further, if you mention that John also has a brother named Sam, and then you mention Sam in an ambiguous context, the program should be smart enough to ask, "Which Sam? Mary's brother or John's?"

Option 3: The only Sam, since Mary and John are brothers :)

jessaustin10y ago

Back to the drawing board, you've failed this Turing test! (Mary is no one's brother.)

logicallee10y ago· 5 in thread

I was intrigued by the initial transcript, but disappointed that it was edited (lightly). For the question "Why did Mary cheer?" the screenshot showed that the simple AI literally answered "because IT be HER FIRST TIME WINNING". This is correct, but the author edited it into a correct sentence for us.

I think it is too much to call this "capable of basic reading comprehension". Surely, "simple sentence parser can answer reading comprehension questions" would be more correct?

I mean it is quite an accomplishment, but there is no understanding here. In some natural languages with less inflection or change in word order, for example, you could answer any "why" questions with the regex /($question_string) because (.+?)\./ against some source corpus, and then $2 will contain your answer. It doesn't work with English due to slight changes in word order, but surely it would be too much to state that this regex is capable of basic reading comprehension in languages it does work in.... If I'm allowed to massage the question the way the author massaged the output, look at this fine result:

http://ideone.com/84QSHC (output at bottom)

Would you say that 14-line Perl program is capable of basic reading comprehension? I wouldn't!

youngprogrammerOP10y ago

> I was intrigued by the initial transcript, but disappointed that it was edited (lightly). For the question "Why did Mary cheer?" the screenshot showed that the simple AI literally answered "because IT be HER FIRST TIME WINNING". This is correct, but the author edited it into a correct sentence for us.

For some reason, the library I used to get the present tense of "was" is "be". I had a manually fix for this but accidentally removed it when I was cleaning up the code. Sorry if I disappointed you

> I think it is too much to call this "capable of basic reading comprehension". Surely, "simple sentence parser can answer reading comprehension questions" would be more correct?

I would say it is capable of basic reading comprehension because it is attempts to build relationships between different objects although the relationships are weak. When humans do reading comprehension, we do what my program does which is try to parse the sentence and understand the relationships. But brains are also able to augment a lot more information and thus be more flexible with understanding.

> Would you say that 14-line Perl program is capable of basic reading comprehension? I wouldn't!

I would say it is not because it does not understanding the relation between objects; it only understands that sentences starting with "why" should be answered with everything after the string "because". Also, in your program, if there are two instances of "because" in the source, your program will only choose the first one.

kajecounterhack10y ago

  > Would you say that 14-line Perl program is capable of
  basic reading comprehension? I wouldn't!

  I would say it is not because it does not understanding the 
  relation between objects; it only understands that sentences   
  starting with "why" should be answered with everything after 
  the string "because". Also, in your program, if there are 
  two instances of "because" in the source, your program will 
  only choose the first one.

This line is arbitrary. These programs aren't "understanding" anything, they are just able to generate different facts. Your "AI" is very input-dependent too, which was his point.

I am still bothered by the title, as your program doesn't "comprehend" the text -- only you do that. Your program restructures the text and answers to a very limited querying mechanism; there's no comprehension, only information retrieval. They're not the same. Knowledge Graph and Freebase are comparable (/ much larger) efforts.

logicallee10y ago

Thanks for the reply! Thanks for sharing your work with us as well. It's impressive.

It's not disappointing, it's quite a feat. I suppose I would say that what I was trying to say is that "parsing", while impressive, is not a large part of "understanding" in my opinion. To use an analogy: almost by definition, compilers parse languages like C++ much better than humans do (basically perfectly, unless there's literally a bug in the compiler or it doesn't follow the standard due to some error).

But that doesn't mean they understand the programs (at all.) A compiler has no idea on an algorithmic level what a program might be doing (and if you remove comments, maybe a person won't understand it either, if they're not familiar with the algorithm.)

So my basic objection is that you're really calling this reading comprehension, but I don't think anything is actually being "understood"; just parsed. A better title would be as I suggested: simple AI correctly answers reading comprehension questions.

The reason that I object to "comprehension" is that these days there really are a few "deep learning" systems, that can possibly synthesize information. (I don't know that much about them.) I don't think it's fair to elevate semantic parsing to the level of comprehension.

However the comment by tariqali34 makes a good point, that perhaps this is a criticism of reading comprehension tests. I know in multiple-choice tests from standardized exams, I've been able to correctly answer reading comprehension questions about texts that I didn't even read (by finding just the sentence that talks about it), or in other cases, texts that were too technical and that I didn't understand.

I would say that I would be able to answer a question about some biomedical excerpt that I can't understand a word of, I just can't make heads or tails of it, let's say:

In order to study the physiological roles of AGC kinases, a commonly used approach has been to over-express the active forms in cells. However, due to the overlapping substrate specificities of many AGC kinases, it is likely that the over-expression of one member of this kinase subfamily will result in the phosphorylation of substrates that are normally phosphorylated by another AGC kinase. Another strategy has been to over-express catalytically inactive ‘dominant negative’ mutants of AGC kinases in cells. However, such mutants are likely to interact with and inhibit the upstream protein kinase(s) that they are is activated by, and thus prevent the ‘upstream’ kinase(s) from phosphorylation of other cellular substrates. For example, a dominant negative RSK may interact with ERK1/ERK2 preventing the activation of MSK isoforms and hence the phosphorylation of CREB (cAMP-response-element-binding protein) [9]. Furthermore, in Saccharomyces cerevisiae, over-expression of catalytically inactive Rck2p, a kinase that binds to and is activated by the Hog1P MAPK, sequestered the substrate-docking site of the Hog1P kinase, thereby preventing Hog1P from interacting with other substrates. Thus catalytically inactive Rck2P is acting as a dominant negative mutant of Hog1P and not Rck2P.

I couldn't answer a REAL reading-comprehension test about this: I just have no idea what it's REALLY talking about, I don't actually understand it. (Obviously on a syntactic level, it's not hard to parse.) I don't know what over-expression is, I don't know what a kinase is, I don't know what phosphorylation is. I don't understand the text. But if the questions are simple, perhaps I could answer some reading comprehension questions about this by parroting back quotations from it. Syntactically, there's nothing difficult here. I just don't understand it.

So I think it's unfair to call sentence parsing real reading comprehension, even if sometimes reading comprehension tests fail to differentiate between the two. You can parse sentences perfectly while understanding nothing. For example, I could answer the question "what is wrong with studying the physiological role of AGC kinases by overexpressing the active forms in cells?" which the first sentence refers to. I can just quote the second sentence "Due to the overlapping substrate specificities of many AGC kinases, it is likely that the over-expression of one member of this kinase subfamily will result in the phosphorylation of substrates that are normally phosphorylated by another AGC kinase". I don't understand, but I think I correctly parroted.

So there are real issues in determining comprehension. The higher the level of the question that is asked, the harder it is to answer without actually understanding the text.

I did find your work very interesting, thank you.

1 more reply

tariqali3410y ago

>I think it is too much to call this "capable of basic reading comprehension". Surely, "simple sentence parser can answer reading comprehension questions" would be more correct?

We test whether a human have "basic reading comprehension" by asking them reading comprehension questions. If they answer the questions successfully, then we assume that they have this "reading comprehension" skill. Therefore, if an AI can answer these questions, then it must also have "reading comprehension".

Maybe these "reading comprehension questions" don't actually test reading comprehension, just pattern matching and sentence parsing. In which case, we shouldn't be asking these questions to anyone (human or AI). So what we need are new questions.

kajecounterhack10y ago

We test human "reading comprehension" with the assumption that humans have an underlying world model.

This is starkly different from a machine whose only ability is information retrieval -- this program (and more advanced versions e.g Freebase) parse and restructure text to be more retrievable. But there is no comprehension because there is no world model.

You wouldn't call parsing a google search query "reading comprehension."

1 more reply

dave_sullivan10y ago· 3 in thread

Not to be discouraging, but I think research along these lines http://arxiv.org/abs/1503.08895 stands an exponentially better chance of leading towards what OP is talking about. For anyone interested in "building AI", read that paper and all it's references.

marcosdumay10y ago

You know, high order logic memories are a completely different kind of intelligence, that, if somebody makes it work, can be much more capable than plain pattern matching techniques.

Granted that it's a fundamentally more difficult problem, so the pattern matching will probably get there earlier.

kaffeemitsahne10y ago

Just because it could be done with the current hip thing (neural nets) doesn't mean all other approaches should be disregarded.

dave_sullivan10y ago

They're the current hip thing because they work really well and keep working better. They take a fundamentally different (I think better) approach than he's taking.

My recommendation comes from a more informed place than "Dur, neural networks!" Of course, this is my opinion, and you are welcome to have a different one.

What is your recommendation on the topic of most promising research areas for teaching reading comprehension to computers? Skip deep learning and read what instead? Or the OP has it figured out?

1 more reply

creyer10y ago· 2 in thread

I think the AI should do more than just correlate some verbs. It should also be capable of understanding concepts like. If I give the following: "John and I are brothers. My mother has a brother named James. " And we ask: "What is the name of my uncle?" The initial results are great but in my humble opinion the big quest is to make computer learn concepts.

veb10y ago

You obviously sound like you're interested! Why not fork the repo, contribute? :) It's a great demo project, and I love seeing these on HN.

OP - Instead of linking directly to en/Stanford Parser etc, you should get together a list of dependencies people need to run your application. Usually as easy as 'pip install pattern' (for the 'ImportError: No module named en') which is `import pattern.en` :-)

I like it! It's nearly 2am so better catch some sleep, but I'm definitely going to have a look further tomorrow.

youngprogrammerOP10y ago

> It's a great demo project, and I love seeing these on HN.

Thanks!

> OP - Instead of linking directly to en/Stanford Parser etc, you should get together a list of dependencies people need to run your application. Usually as easy as 'pip install pattern' (for the 'ImportError: No module named en') which is `import pattern.en` :-)

I didn't actually pip install anything for my project, I just downloaded and extracted the Stanford Parser, and Nodebox Linguistics libraries. The setup should be in the readme. I'll try to see if I can find the pip dependencies and update the readme.

vonnik10y ago· 1 in thread

This is a really neat, rules-based, Chomskyian NLP (as opposed to the statistical kind represented by Word2vec). It's an old division...

http://norvig.com/chomsky.html

The essential question is: Can it go from basic reading comprehension to advanced just by adding more rules. Is intelligence simply 10 million rules? If so, how do we go about creating new rules as language evolves? By hard-coding them, as in the example code?

The real test for general AI and NLP is how well it, well, generalizes; i.e. how well does it deal with situations we have not explicitly anticipated?

In my opinion, the fuzzy, statistical methods @davesullivan mentions have a better chance at generalizing (although they may well be augmented by rules-based AI).

If an AI doesn't have a good way of transferring what it knows to novel problems, then it is severely limited. It's treating the world like a canned problem with a finite number of possibilities, like chess or checkers, when in fact the world is much more complex.

The way DeepMind combines deep learning and reinforcement learning is one way of acknowledging that complexity.

Deep learning learns patterns in raw sensory data, which means it can ingest and handle the new. Reinforcement learning learns to perform actions over a series of unknown states, improving its choices by monitoring the rewards it receives for those actions. They both maximize within uncertainty, and I think that's our best bet going forward.

Because the world, and language, cannot be known in their entirety. The number, motion and interrelation of the atoms of air in the room where I'm typing this are all too large and complex to be computable. Their fluid dynamics can only be vaguely guessed at, not deterministically predicted in a few lines of code.

The trick will be to bridge the gap between the hard-coded, limited rules and the unlimited recombinations of language, which is inventing new rules and words all the time.

youngprogrammerOP10y ago

> The essential question is: Can it go from basic reading comprehension to advanced just by adding more rules. Is intelligence simply 10 million rules? If so, how do we go about creating new rules as language evolves? By hard-coding them, as in the example code?

I believe that it can go from basic reading comprehension to more advanced by adding many rules but of course manually adding them is not very feasible or scaleable.

> In my opinion, the fuzzy, statistical methods @davesullivan mentions have a better chance at generalizing (although they may well be augmented by rules-based AI).

I agree that a statistical model would be better since it will be able to handle more complexity. It would be much easier to train the rules from a dataset instead of hard coding all of them and it would be able to adapt to new rules as well. However, I could not find a good data set for the task I wanted.

ClintEhrlich10y ago

Thanks for sharing your work. As a hobby, I have spent years working on extracting semantic ontologies from natural language, so it was fun to see someone else's take on the problem.

As other have mentioned, you will make progress more efficiently if you survey the linguistics literature, where a tremendous number of very smart people have spent decades grappling with the same essential problems.

Heterodox linguistics is a veritable goldmine of ideas that can be implemented in AI. My favorite approach is Richard Hudson's "word grammar," which you can read about here: http://www.phon.ucl.ac.uk/home/dick/wg.htm

Word grammar is particularly well suited for coding, because it strips away lots of arbitrary linguistic formalisms in favor of a flexible, network-centric framework. Some of the core principles, like default inheritance, were actually taken directly from computer science.

anigbrowl10y ago

Nice work, but but isn't this way of breaking down sentence structure already a standard thing? I get the impression that the writer didn't find much on this in the AI literature, but reinvented a wheel that has been long-established in linguistics departments.

https://en.wikipedia.org/wiki/Sentence_diagram

I don't mean that in a dismissive way - even if it was a reinvention of the wheel it's still an elegant and useful one. It would be interesting to work up to larger chunks of text, and also to encapsulate ambiuities in some way, such that if presented with a sentence that admits of two meanings the program could honestly say 'I don't know, tell me more.'

antome10y ago

I always find it interesting how many things can be represented by finite state automata, and related concepts. I wonder what languages/libraries specialise specifically in handling machines of a directed-graph style? I would imagine VHDL and co. have functionality for it.

markjspivey10y ago

this post and many of the comments here don't necessarily have a "semiotic" or "usage-based language acquisition" or "emergent-grammar" nature about them ...

which is fine ... just different results ...

specifically:

1. meaning is usage. 2. structure emerges from usage.

this post and many of the comments have a world view akin to:

1. meaning is structure. 2. usage emerges from structure.

what i mean by this is that the "analysis" of the text doesn't exist in the same "world" as the text.

meaning that its nothing like "real" (natural) language.

what i mean further by this is simple:

humans don't "use" natural language... it is an emergent property of other systems of externalized behaviors and such by individual humans.

and such emergent properties and systems are also evident in any development of this actual system and many of the comments here.

producing the comprehension introduces in-comprehensible things ... or at best just divorces systems (discontinuous) ... at which point any thing can be any thing, so debating it as such here doesn't even matter (value) .

im not exactly sure what im saying here but it is akin to:

1. nthorder cybernetics (mostly like 3rd and 4th and such) 2. autopoesis (humberto maturana and francisco valera)

im going through the same type of analysis regarding "activity stream" type APIs which use an "actor verb object" type form ... (usage from structure) ...

sylphiae10y ago

Really impressive program! I'm a beginner as well and I'd like to know about how you learned to program. Your code looks advanced to me:)

Tobu10y ago

This rules-based framework is way too rigid for “reading comprehension”. It would fail to recognize much from naturally written sentences.

acd10y ago

Very cool project. I would be good to have an AI similar to this and if it could read and comprehend lots of research articles. I dream of a AI capable of reading all research articles on the latest battery tech and then be able to understand and make recomendations from that. The question I would like us to ask the AI how would you create the worlds most efficient battery?

ThomPete10y ago

It's thanks to people like you things move forward for that I thank you.

Of course it's going to be challenging, who cares – you are going to learn a lot of things even if this attempt fail and you are going to allow other people to stand on your shoulders.

sabujp10y ago

this is first year ai stuff

j / k navigate · click thread line to collapse

61 comments

47 comments · 16 top-level

kajecounterhack10y ago· 8 in thread

https://en.wikipedia.org/wiki/Propositional_calculus#Solvers

  Bobby picked up the toy. Then he put down the toy.

aamar10y ago

Classic example:

    The iron ball fell on the glass table, and it shattered.
    The glass ball fell on the iron table, and it shattered.

Human readers will pick the correct antecedent for "it" in each case, so it's not ambiguous. But correct interpretation depends on knowing something about how likely glass and iron are to shatter.

mrec10y ago

These are called Winograd Schemas; you can read more about them and see more examples at https://www.cs.nyu.edu/davise/papers/WS.html

kajecounterhack10y ago

1 more reply

oh_sigh10y ago

For those curious, this process is called "anaphora resolution", which is the process of understanding what a reference(usually a pronoun) actually refers to.

reptation10y ago

While this is a interesting point, grammarians will bristle at such an imprecise use of 'it', which should unambiguously refer to one antecedent.

1 more reply

T-Roy10y ago

OpenCyc is a really cool implementation of one solution to this problem.

The way it was explained to me, computers don't know that when you turn a full cup of coffee upside down, the liquid will fall out. Cyc attempts to provide a framework for that.

http://sw.opencyc.org

youngprogrammerOP10y ago

> Bobby picked up the toy. Then he put down the toy.

markjspivey10y ago

there is a linguistic research called "usage based" and also "emergent grammar" that i find interesting regarding subjects like this:

1. meaning is usage. 2. structure emerges from usage.

meaning we don't use grammar to produce language, it is an emergent property (irreducible) of "pre-linguistic pragmatics" ...

borkabrak10y ago· 6 in thread

I love the ambition. But when you stated your next goals, I'm afraid I thought of this:

http://xkcd.com/1425/

I may very well be wrong, and I'd love it if you showed me I am. Good luck. As I said, I really would be excited to see this go farther.

meeper1610y ago

Take a look at text sentiment analyzers and processors that employ advanced machine learning algorithms like this too, http://52.10.12.34/TuataraSum

nyamhap10y ago

When was that xkcd posted? - couldn't not find a date.

wslh10y ago

You are taking the xkcd strip very literally instead of understanding the central concept on it.

You comment reminds me of this: http://www.uh.edu/engines/epi879.htm

2 more replies

seiji10y ago

Yeah, Randy was a bit miscalculating the estimate there. Within a month of that comic, flickr made http://code.flickr.net/2014/10/20/introducing-flickr-park-or...

1 more reply

ganarajpr10y ago

One researcher and a month ( perhaps even less ! ) is more like it..

1 more reply

sp33210y ago

September 24, 2014

mratzloff10y ago· 6 in thread

Some thoughts about creating a system like this:

nileshtrivedi10y ago

There has to be a hierarchy of world models, and only the bottom-most layer would be the "real" one. Any kind of planning will require simulating hypothetical worlds.

phkahler10y ago

mratzloff10y ago

I'd also go as far as saying part of the internal model of the world should also have a confidence. Any failure to understand a sentence may actually be a problem with its world view.

Yes, but that happens automatically by virtue of learning through natural language teaching. Put the VI through school.

phkahler10y ago

RHSeeger10y ago

> Further, if you mention that John also has a brother named Sam, and then you mention Sam in an ambiguous context, the program should be smart enough to ask, "Which Sam? Mary's brother or John's?"

Option 3: The only Sam, since Mary and John are brothers :)

jessaustin10y ago

Back to the drawing board, you've failed this Turing test! (Mary is no one's brother.)

logicallee10y ago· 5 in thread

I think it is too much to call this "capable of basic reading comprehension". Surely, "simple sentence parser can answer reading comprehension questions" would be more correct?

http://ideone.com/84QSHC (output at bottom)

Would you say that 14-line Perl program is capable of basic reading comprehension? I wouldn't!

youngprogrammerOP10y ago

For some reason, the library I used to get the present tense of "was" is "be". I had a manually fix for this but accidentally removed it when I was cleaning up the code. Sorry if I disappointed you

> I think it is too much to call this "capable of basic reading comprehension". Surely, "simple sentence parser can answer reading comprehension questions" would be more correct?

> Would you say that 14-line Perl program is capable of basic reading comprehension? I wouldn't!

kajecounterhack10y ago

  > Would you say that 14-line Perl program is capable of
  basic reading comprehension? I wouldn't!

  I would say it is not because it does not understanding the 
  relation between objects; it only understands that sentences   
  starting with "why" should be answered with everything after 
  the string "because". Also, in your program, if there are 
  two instances of "because" in the source, your program will 
  only choose the first one.

This line is arbitrary. These programs aren't "understanding" anything, they are just able to generate different facts. Your "AI" is very input-dependent too, which was his point.

logicallee10y ago

Thanks for the reply! Thanks for sharing your work with us as well. It's impressive.

I would say that I would be able to answer a question about some biomedical excerpt that I can't understand a word of, I just can't make heads or tails of it, let's say:

So there are real issues in determining comprehension. The higher the level of the question that is asked, the harder it is to answer without actually understanding the text.

I did find your work very interesting, thank you.

1 more reply

tariqali3410y ago

>I think it is too much to call this "capable of basic reading comprehension". Surely, "simple sentence parser can answer reading comprehension questions" would be more correct?

kajecounterhack10y ago

We test human "reading comprehension" with the assumption that humans have an underlying world model.

You wouldn't call parsing a google search query "reading comprehension."

1 more reply

dave_sullivan10y ago· 3 in thread

marcosdumay10y ago

You know, high order logic memories are a completely different kind of intelligence, that, if somebody makes it work, can be much more capable than plain pattern matching techniques.

Granted that it's a fundamentally more difficult problem, so the pattern matching will probably get there earlier.

kaffeemitsahne10y ago

Just because it could be done with the current hip thing (neural nets) doesn't mean all other approaches should be disregarded.

dave_sullivan10y ago

They're the current hip thing because they work really well and keep working better. They take a fundamentally different (I think better) approach than he's taking.

My recommendation comes from a more informed place than "Dur, neural networks!" Of course, this is my opinion, and you are welcome to have a different one.

What is your recommendation on the topic of most promising research areas for teaching reading comprehension to computers? Skip deep learning and read what instead? Or the OP has it figured out?

1 more reply

creyer10y ago· 2 in thread

veb10y ago

You obviously sound like you're interested! Why not fork the repo, contribute? :) It's a great demo project, and I love seeing these on HN.

I like it! It's nearly 2am so better catch some sleep, but I'm definitely going to have a look further tomorrow.

youngprogrammerOP10y ago

> It's a great demo project, and I love seeing these on HN.

Thanks!

vonnik10y ago· 1 in thread

This is a really neat, rules-based, Chomskyian NLP (as opposed to the statistical kind represented by Word2vec). It's an old division...

http://norvig.com/chomsky.html

The real test for general AI and NLP is how well it, well, generalizes; i.e. how well does it deal with situations we have not explicitly anticipated?

In my opinion, the fuzzy, statistical methods @davesullivan mentions have a better chance at generalizing (although they may well be augmented by rules-based AI).

The way DeepMind combines deep learning and reinforcement learning is one way of acknowledging that complexity.

The trick will be to bridge the gap between the hard-coded, limited rules and the unlimited recombinations of language, which is inventing new rules and words all the time.

youngprogrammerOP10y ago

I believe that it can go from basic reading comprehension to more advanced by adding many rules but of course manually adding them is not very feasible or scaleable.

> In my opinion, the fuzzy, statistical methods @davesullivan mentions have a better chance at generalizing (although they may well be augmented by rules-based AI).

ClintEhrlich10y ago

Thanks for sharing your work. As a hobby, I have spent years working on extracting semantic ontologies from natural language, so it was fun to see someone else's take on the problem.

anigbrowl10y ago

https://en.wikipedia.org/wiki/Sentence_diagram

antome10y ago

markjspivey10y ago

this post and many of the comments here don't necessarily have a "semiotic" or "usage-based language acquisition" or "emergent-grammar" nature about them ...

which is fine ... just different results ...

specifically:

1. meaning is usage. 2. structure emerges from usage.

this post and many of the comments have a world view akin to:

1. meaning is structure. 2. usage emerges from structure.

what i mean by this is that the "analysis" of the text doesn't exist in the same "world" as the text.

meaning that its nothing like "real" (natural) language.

what i mean further by this is simple:

humans don't "use" natural language... it is an emergent property of other systems of externalized behaviors and such by individual humans.

and such emergent properties and systems are also evident in any development of this actual system and many of the comments here.

im not exactly sure what im saying here but it is akin to:

1. nthorder cybernetics (mostly like 3rd and 4th and such) 2. autopoesis (humberto maturana and francisco valera)

im going through the same type of analysis regarding "activity stream" type APIs which use an "actor verb object" type form ... (usage from structure) ...

sylphiae10y ago

Really impressive program! I'm a beginner as well and I'd like to know about how you learned to program. Your code looks advanced to me:)

Tobu10y ago

This rules-based framework is way too rigid for “reading comprehension”. It would fail to recognize much from naturally written sentences.

acd10y ago

ThomPete10y ago

It's thanks to people like you things move forward for that I thank you.

Of course it's going to be challenging, who cares – you are going to learn a lot of things even if this attempt fail and you are going to allow other people to stand on your shoulders.

sabujp10y ago

this is first year ai stuff

j / k navigate · click thread line to collapse