I highly doubt that your queries are primarily coming from your own engine. The results are too good. It has good ranking, spelling correction, super large indices, and is fast. For example searching "GTGACCTTGGGCAAGTTACTTAACCTCTCTGTGCCTCAGTTTCCTCATCTGTAAAATGGGGATAATA" works even though it only occurs on a few pages and as the blog post that string came from explains, you need super fancy indexing techniques to handle things like that quickly. You also talk about this as if it is a single-person project, which makes it even less likely you made all this from scratch.
I like the concept and the parts that you undoubtedly make yourself like the UI, image retrieval and caching, are really good. This is a great site don't get me wrong. I just think you should be more forthcoming about where your results are coming from.
oh man. My absolute favorite assignment in undergrad computer science classes involved searching ~50GB of compressed text files containing protein sequences, to see how many times "ATG" or something occurred.
It was a ton of fun. Minimum requirements were to get the correct counts. Then it turned in to a competition to see who could make the fastest solution. You had like 8 machines at your disposal, each with the full dataset, to distribute whatever you wanted.
I think we did it in 3 languages or so (java, erlang, something else...).
For example it would have been nice to see a "results from Bing" message somewhere on the search page, or an item on the about page saying that you use another search engine's results.
It would actually have increased my confidence and impression of your project. I know that you can't have made a fantastic query engine like Bing/Google's without a ton of engineers so you must be using someone else's, and it would have given me a better impression if you said that up front instead of my having to infer that.
What's really made it difficult for competitors is being able to match Google's index. Bing crawls a lot of pages but is much more picky about what it index?
Also can you say where you are pulling the results from the Yandex API, Bing, Yahoo or Google itself?