I would appreciate very frank advice on what I should do with my start up, Anaphoric Systems (http://www.anaphoric.com).
The whole effort started as a research project several years ago, but when I was informed that I actually owned the system, I entered a business plan competition and then started to execute on the plan.
The problem is that I have been wearing two hats: 'researcher/educator' and 'entrepreneur'. This has led to conflicts of interest as well as diluted effectiveness. So basically I need to decide if I am going to 1.) quit my tenured position, move to a start-up hub, find a partner, get-funding, etc. or 2.) open source the whole project and treat it as a research project. (It's 18k lines of LISP). I suppose in case 2 I could always do something commercial if the system was of high impact.
I am leaning toward option 2, but I thought I should run my plan by the YC community first.
Regards,
Michael Minock (http://www.cs.umu.se/~mjm)
A serious problem I have however is that this geography database isn't so interesting. Who is really going to learn anything from this rather low quality database? What I am looking for is some database that is: 1.) complex enough to preclude standard forms interfaces and 2.) contains information worth asking about. Not an easy task to identify, obtain such data sources.
I was thinking maybe a sports scores database or something...
I see it augmenting the standard facet-filter style browse. For example http://nymag.com/restaurants/: I enter "indian food in the east village" -- your product helps pre-arrange the facet filters for me, but I can still add a price range or another neighborhood to the query through standard UI.
The obvious benefit is that natural language browse offers quick access to complex queries (within the user's mental model). But if the results are presented well, your product can be used to train the user to understand the mapping between natural language and executable query (and in that way inform their mental model).
If that means that some users only need to use the natural language feature a few times (before switching to standard UI), it's still successful.
what's the biggest city in the us?
The response was:
"Sorry, but 'what's the biggest city in the us?' is not understood by the system. The system does not understand what the word ''s' means."
Which was disappointing.
2. Next up:
does portland have a bigger population than seattle?
Response:
Sorry, but 'does portland have a bigger population than seattle?' is not understood by the system. The system does not understand what the word 'bigger' means.
Boo. Also the word 'system' has no meaning to ordinary people, and only vague meaning for the rest of us.
3. Have you considered using a greater dataset for a better demo? Ie, police crimes, or CC data from wikipedia?
4. Show us what you can do on the front page. I don't give a damn about your sepia toned dude writing on a blackboard, or your brown colored optical fibre stock image, or that you, like everyone else, intends to be the prime supplier of whatever the hell it is you do.
who had the most stolen bases from 1935 to 1955? how many people have had back to back 5 hit games?
Are you interested?
I like your videos. Very well done. We can discuss via email. mjm@cs.umu.se is probably the best mail to use.
Option 2 might be your only real option, but even then I would think you might be able to use your work as a springboard into something more lucrative...?
That said, if you really just 'give up' any commercial hopes, a BSD style license is friendlier to companies who may want to do something with your code, and, if they're smart, maybe even contribute something back.
It may not be the right time for him to make the leap, but there is always a right time to make the leap (or give up on the business or on entrepreneurship in general). Since he's asked us here, and seems to have put a lot of thought into it, I suspect he is now at that point. Thus it is not a "needless risk". It may be exactly the risk that needs to be taken, and it may be exactly the right time to take it.
Entrepreneurship does involve putting some skin in the game. It's not optional and you generally can't be a regular guy with a regular job, and also a big entrepreneurial success. At some point, you have to decide what you want, and go after it with vengeance.
To answer your question: If I were in your position, I'd try to work an option 1.5 -- see if you can take an unpaid leave of absence for 4 months and spend that time working on your startup. Most universities are happy to let their faculty do this sort of thing, since they usually come back far more excited about their research than when they left.
Spend those 4 months working on your startup full-time, and plan on deciding at the start of the 4th month whether you're going to take your option 1 or your option 2. I don't think you can really make a rational decision without working on this full-time for at least a few months.
Re your startup: Have you considered putting it together with freebase? I don't know how structured your input databases need to be; but since freebase is basically wikipedia (EDIT: and apparently other data sources now, too) parsed and converted into a database, being able to run natural language queries against it could be very interesting...
Yes I am considered taking a leave next Spring. It would be difficult to Swing this Fall. And if I did take a leave, I would definitely move to a start-up hub. I really agree with Paul on the need for that.
As for freebase, yes I had considered that earlier and indeed I should check back. Alas a nice thing about having my system as an open-source system, would be that I could get a masters student to do the integration with freebase as part of a project...
Another idea might be to find a partner first, and let that person handle most of the business, and just concentrate on the technical aspects of it.
Good luck!
I was thinking that if I open-source it I would put it under a very open license (e.g. BSD) and assume/hope that anyone that extended it would make contact with me for help/advice.
Try and search for the shortest river. No dice. Given that it knows about second, third and the like I tried to rephrase this as "last longest river". All I got back was the longest river, it dropped 'last'. It knows average but not mean. It reports 'average length of a river' as 1117.2391304347826087. That's a great deal of precision for something without a unit of measure. What is this in: meters, kilometres, centimetres, feet, inches, miles? Given that the data is from America I would expect this to be in miles but I suspect that it is probably in kilometres.
Has this really taken 18k lines of code?
Stay put and knock this thing into shape on the universities dime then start your own company.
> this system has a looong way to go
yes it does... Natural language interfaces are a very hard area. Lots of failed attempts (e.g. MS EnglishQuery). This in fact is what attracted me to the area. If as a researcher I can help make NLIs more robust/usable, then that would be a 'result'.
> Has this really taken 18k lines of code?
Actually that is an approximate figure and includes test routines, comments, etc. Plus what is a line of lisp code? A large amount of code is to support non-linguist authoring of the interface. So in fact I was able to patch the system for "shortest river" and "mean" in a jiffy.
> Stay put and knock this thing into shape
Yes that's what I will probably do. I toned down Anaphoric's web-site and will work with this system from more of a research perspective. I will try my best to avoid open sourcing it, but in the end circumstances may compel that move...
"Server Down on localhost:9009 Please visit again in a couple of hours."
1. Don't spend any huge money on this project until it picks up significantly.
2. treat this as a hobby, which means don't lose sleep over it.
3. focus on your work that makes money if money is a factor
4. take it easy. leave the company as it is.
5. try to make this not be written in LISP. that is a lot of lines of code. convert it.
6. Keep everything going the same time. don't quit your job.
1. You have to take risks sometimes. He's done good work and it's worth putting some cash behind it (if necessary).
2. It's not a hobby. It's a business or a research project. In either case, it's a serious thing.
3. This project could be a money-maker in two ways: corporate sponsors for a research project, or corporate clients for a business product.
4. I agree with the take it easy part, but not the other part.
5. That's a joke right?
6. That's also a joke right?
Personally, I've been running my own (much simpler) VPS hosting startup (http://prgmr.com) for more than two years. I'm on my second Porsche worth of personal money plowed into the thing (ok, second Porsche boxter... I'm probably still on my first, you know, real Porsche.)
I dono. I mean, my company breaks even these days... most of what I've gotten out of it is experience and contacts (and those are worth something... my bill rate as a consultant verges on ridiculous, and I have a book deal. Pretty good for a Midwestern hick with no formal education.) I don't regret doing it, but if I had a chance to do it over I'd focus more on being more public (blogging, documenting, etc...) and less on the actual business.
If I could 'open source' my project, I think that would magnify the benefits (that is, credibility and contacts) But you can't really 'open source' an ISP, which is essentially what I am.
So yeah, personally I'd advise you to go for option #2, unless you can get significant investment from someone else.
But then, you sound like you started from a different position than I did. It's possible that because I had so little when I started the credibility is worth a lot more to me than it is worth to you.
Also, I have my own conflict of interest... your project sounds pretty cool, and I'd like to be able to use that technology for my own projects, or use products built by other people using that technology. Open-sourcing it is likely the best way to get the tech out to the masses.
Why does it have to be #1 or the other? Could you see about taking a one-year leave of absence from your tenured position? Seems that if you're successful you'll generate a lot of good publicity for your university. Furthermore, you could try to find a visiting relationship with a university at one of the hubs. For you it would be a good way to meet people and your current university could see you're still serious about the academy even if you only "visit" a minority of your time.
Treating your project as a research project does not equate to giving up.
I don't know enough about this market/competitors/technology/market size, etc to define what you should do going forward.
The question I think you should be asking is 'is this a viable business'.
I suspect most of the YC community who are telling you to go for it! are potentially speaking from a start-up attitude of 'live the dream' etc. etc. They may be right, but not every business is a good business.
Your technology might be the best out there, I just don't know. Personally, I wasn't blown-away by the demos, but I'm not saying that to be harsh, just that from my perspective, I'm not seeing your competitive advantage. You've put a bunch of work into this, and it may be the best in the market and able to make a significant contribution to the world.
Does your university have a business school, or entrepreneurs club or something? Maybe you could get somebody there to help with defining the market and business planning.
You ask us to choose either 1 or 2, yet we have no idea what situation and/or persons brought you to this apparent fork in the road. I suspect there may be a 3rd, 4th, &etc paths that prove better. Nor do I see 1 and 2 as mutually exclusive as you state. Many, if not most, researcher/educators are simultaneously entrepreneurs. And giving up tenure may prove costly, especially in the long run.
Perhaps tell us more about the conflicts of interest that have arisen. I sense that they involve other peoples wishes or meddling, which are usually best ignored or stepped on.
- what is the benefit of your algorithm and code over Lucene/SOLR? quantifiable metrics, e.g.precision/recall-wise? What natural languages does it parse/work best in?
- what kind of analysis, in layman's terms (Layman: somebody who's implmeented Lucene, sphinx, the search engine, not CMU sphinx, MySQL fulltext search, not an NLP researcher) From the company name, i would assume anaphor resolution is included, or something related, named entity resolution.
My suggestion is that you use your salary to hire someone to work fulltime on the project. You are presumably in a good position to find a smart employee who is interested in your work and will be able to contribute within a short period of time.
If you cannot make headway in the market with two developers/proponents, it seems unlikely you will succeed alone. Not all of the work needs to be technical - there's no reason you can't hire someone and have them handle marketing/sales.
Actually we (reviewgist.com) could also do with something like this. Infact, we spent some time working on a similar system for our needs, but anticipating a dis-proportionate consumer response dropped it. But if you go through our site, you would know exactly what I am talking about.
Also to answer your question, imo go with 1.
Step 2: Quit and start your startup
Step 3: If things don't work out, then go to your step #2, and move onto other jobs