It's like a tweaked version of DataGrip + PyCharm, but catering specifically to the particular needs of data scientists.
It's pretty buggy, and they haven't made much progress on fixing the bugs. I've used it pretty much since it came out - I do Microsoft SQL Server & Postgres work on my Mac - and the thousands of open issues on a relatively new product say something:
That's generally a good thing. If a product had no issues that would mean that there's no uptick in usage.
> You may not sublicense the Software Code or any use of it
https://github.com/microsoft/azuredatastudio/blob/main/LICEN...
Would be super excited if you guys have any feedback. It's nowhere near perfect yet, but you can already use it to build some great data-centric use cases. Amongst others for sentiment analysis, conversational AI or finetuning of your embeddings (which you can check out here: https://github.com/code-kern-ai/refinery-sample-projects).
Let me know what you think :)
To exaggerate a bit, but I like that idea: With "regular programming" (not the best term, but I mean rule-based systems etc.), you used to develop via punch cards. You had to think multiple times before "compiling" something, right? I believe that we're currently in that phase regarding supervised learning development. If you have a labeling project, you need to plan this long in advance, ...
We're in love with VS Code, but we're missing something like this for AI. Our application tries to show how developers can program their training data, i.e. refine raw data into training data, and do so with many programmatic approaches. We're trying to show how something like this could look like (hence the title), and do so in NLP.
But again - I agree with you :)
[0]: https://marketplace.visualstudio.com/items?itemName=RandomFr...
If anyone is remotely interested in data-labelling/exploration, I would definitely recommend checking this out, it has some really exciting features, for example, built-in zero shot classification for heuristics/baselines: https://docs.kern.ai/docs/building-zero-shot-classifiers
I'm also really impressed with the architecture! Very neat.
Not affiliated with the project, just very pleased to see something like this as an open source release.
> I’m also really impressed with the architecture! Very neat.
thank you very much for your feedback, really appreciate it :) We work hard on the architecture of the product and are wrapping our heads constantly around how to make things faster, more stable, and scalable, which is much fun to do for such a data-intensive application. We are open to constructive feedback on our tool so please feel welcome to join our community on discord https://discord.gg/JzA3zDH2
For non-MSSQL things, it's usually DBeaver.
No need for a new IDE for me.
I know the price is going up in Oct, and I will likely 'buy ahead' 2-3 years at the current price now to save a bit extra.
generally anything that scales and "somewhat" guarantees the users to input valid labels.
But it definitely offers something that new tools don't: users are super familiar with it.
EDIT: I can some sources in other repo of the same org, for instance: https://github.com/code-kern-ai/refinery-ui so it's just a matter of making it easy for dev to navigate the code.
you can take a look at our architecture overview here: https://github.com/code-kern-ai/refinery#-architecture
A bit below it, you find a table with the links to all repositories. All of them are open-source. But thanks for the feedback, I'll try to make it a bit easier to understand! I appreciate that! :)
We've focused on JSON as the user-specified data model. So you can upload anything fitting into a JSON. We're using pandas to process the uploaded data, so spreadsheets or CSV-ish also work.
We've got a public roadmap (https://github.com/code-kern-ai/refinery/projects/1), and we're looking forward to also integrate e.g. native PDF labeling sometime soon.
We aim to extend on that idea by providing something that comes as close to a programmable interface for data-centric tasks as possible, and do so via open-source.
There are lots of cool tools out there btw. in that area. Definitely worth to have a look at a landscape (haha idea for the next HN post incoming I guess :D)