yeah my original idea was a sort of a Google Freebase
P2P Strikes Back Like KaZaAImagine a structured gigantic tree data where relational, tabular data of any data is available?
You can see why I was seduced by this idea of an all out liberalization of information by making the storage and computation decentralized and p2p reliant-selective one at that which allows you to create your own private "pool" of workers each with a rotation-des-adresses IP of your choosing.
Basically any data that is uploaded to a scrape:// URL which is seeded by other peers running the Scrape.it client (people you shared the URL with) will stay up there theoretically as long as there's "seeders", sorta like Bittorrent.
You could share a scrape:// URL and maybe somebody will knock down your doors (CIA_OPEN_UP) but if it's shared globally and theoretically if one uses other means of traditional anonymically inclined tools of your choosing online that rhymes with possibly XOR, then anything is possible ¯\_(ツ)_/¯
Large scale amounts of data can be crawled because speeding up the volume and speed is literally authorizing another peer to have write access to your local Data Sanctuary (by default only read access is granted), virtually even the most stubborn 2009 non-vanilla AJAXY-ANGULARLY-JQUERY-SPHAGEHTEI web apps where the backward navigation is broken, Scrape.it have powered right through, essentially dramatically lowering the cost barrier to that data available online.
Ex) it can try every order of form automation permutation combinations, for every option in select drop down, for instance: search this list of product id and crawl everything on this J2EE enterprisesque web app from 10 years ago.
The most recent discussion on how the web needs an open index, https://news.ycombinator.com/item?id=19713604, so that others may build on it will still the ideal standard.
Creating a completely free and decentralized bank for structured data that is impossible to completely take down once shared with other than your group, with full end to end encryption in-between, cryptographically verifiable list of order edits....
but alas, I'm really pressured to get this out the door so looks like I will just have to focus on the bare minimum in terms of client for now.
I originally aimed this to be a standalone self-hosted desktop/server tool...
anyways I digress, I need to get back to work. Lately honestly been a challenge mentally and emotionally(?) from some other life bullshit, its only when I get into the zone do I feel free