Like 4o with a CBT overview, or a succinct Claude. Excited to hear your thoughts!
It's a Tauri 2.0 desktop app (not Electron!), so it is using the Mac's native browser view and a Rust backend. It also makes DMG size relatively small (~25mb but we can get it much smaller once we get rid of some bloat).
Right now Chorus is proxying API calls to our server, so it's free to use. We didn't add bring-your-own-api key to this version because it was a bit quicker to ship. This was kind of an experimental winter break project, so didn't think too hard about it. Likely will have to fix that (and add bring your own key? or a paid version?) as more of you use it :)
Definitely planning on adding support for local models too. Happy to answer any other questions, and any feedback is super helpful (and motivating!) for us.
UPDATE: Just added the option to bring your own API keys! It should be rolling out over the next hour or so.
Another mind-numbingly obvious feature — hitting enter should just create a new-line. And cmd-enter should submit. Or at least have it configurable for this.
(EDITED for clarity)
> Typing messages in these chat apps quickly becomes tedious and autocompletions help a lot with this.
If you're on Mac, you can use dictation. focus text-input, double-tap control key and just speak.
My Mac now has built in copilot style completions (maybe only since upgrading to Sequoia?). They're not amazing but they're decent.
https://support.apple.com/guide/mac-help/typing-suggestions-...
Two or so years ago I built a localhost web app that lets me trivially fork convos, edit upstream messages (even bot messages), and generate an audio companion for each bot message so I can listen to it while on the move.
I figured these features would quickly appear in ChatGPT’s interface but nope. Why can’t you fork or star/pin convos?
Wow that's a bit scary (use GPT a lot) how bad a fail that is!
https://beta.gitsense.com/?chats=ba5f73ac-ad76-45c0-8237-57a...
The left window contains all the models that were asked and the right window contains a summary of the LLM responses. GPT-4o mini got it right but the super majority got it wrong, which is scary.
It wasn't until the LLM was asked to count out the R's that it acknowledges that GPT-4o mini was the only one that got it right.
Edit: I've disabled chatting in the app, since I don't want to rack up a bill. Should have mentioned that.
At a technical level, they don't know because LLMs "think" (I'd really call it something more like "quickly associate" for any pre-o1 model and maybe beyond) in tokens, not letters, so unless their training data contains a representation of each token split into its constituent letters, they are literally incapable of "looking at a word". (I wouldn't be surprised if they'd fare better looking at a screenshot of the word!)
Let me count carefully: s-t-[r]-a-w-b-e-[r]-[r]-y
There are 3 Rs in "strawberry".
It could make available only the LLLMs that your Mac is able to run.
Many Silicon owners are sitting on very able hardware without even knowing.
I use it mainly because my LLM runs on a server, not my usual desktop.
uvx --python 3.11 open-webui serveIt is fast, native and cross-platform (built with Qt using C++ and QML).
Likely going to add bring your own API keys (or a paid version) soon.
Update: just added option to bring your own keys! Should be available within an hour.
The app presented the option of prompting additional models, including Gemini Flash 2.0, one I'd never used before. It gave the best response and was surprisingly good.
Curious to know how Chorus is paying for the compute, as I was expecting to have to use my own API keys.
I just checked to see if it was signed, without running it. It is. I don't care to take the risk of running it even if it's signed. If it were a web app I'd check it out.
I don't know if there's any sort of login. With a login, they could throttle based on that. Without a login, it looks like they could use this to check if it's being used by an Apple computer. https://developer.apple.com/documentation/devicecheck/valida...
What isn't there and would be useful is to not have them side by side but rather swipable. When you're using for code comparisons even 2 gets stuffy
https://beta.gitsense.com/?chat=51219672-9a37-442d-80a3-14d8...
It provides a summary of all the responses and if you click on "Conversation" in the user message bubble, you can view all the LLM responses to the question of "How many r's in strawberry".
You can fork the message as well and say create a single response based on all responses.
Edit: The chatting capability has been disabled as I don't want to incur an unwanted bill.
A number of terminals can also do that natively (kitty comes to mind).
But the actual amount of effort to get to the level of dropbox in a multiple device context is a number of magnitude higher than the triviality of autoloading a handful of cli tool in different panes and synchronizing them in tmux.