Onboarding my non-software engineer teammates to it has super-charged them and essentially given them all their own personal developer that can automate tasks for them. Managing codebases, etc. is still a hassle though.
90% of the power of Excel was that it was functionally a database that a normal person could actually use. I think we'll see something similar with coding agents.
That's what they aim Claude Cowork at. Every executive/leader I've shown Claude Cowork to has gone from 'what is AI' to 'vibecoding whole apps' in weeks. Then when Claude is down for an hour, they get visibly angry and don't remember how to do anything pre-Claude :)
I understand the impulse to provide a UI to manage codebases, etc. But my observation is that these people just ask Claude to do whatever it is they need done. Codebase needs managing? They just ask Claude to do it. No idea how to deploy an app? They just ask Claude to do it.
Any app built on top of this stack to 'make it easier' is competing with 'I don't care what's happening, just ask Claude to do it'.
The drug is scary when everyone is depending on it. I wonder what is future like.
Do you, and those executives, own the risks associated with that practice? Are those risks actually indemnified?
Its neat that 'anyone can do anything' but if they don't actually know what the risk to business or 3rd parties, why is this a good thing, especially in the enterprise where there are actors who are explicitly looking for this type of environment to exploit?
Reading the first part, I was going to say they don’t even care about whether or not there’s a codebase. It doesn’t matter; it could be all gremlins and hamsters in wheels for all they care, and for all they should care. All that matters is the functionality, the value it gives them.
We’re even getting disposable code now. Entire single-use ephemeral web apps, built on the go to enable, visualise, or simplify a specific thing, then thrown away.
Will it all lead to some trouble? Definitely. So did computers, and so did the internet.
Weird times. Fun times.
To put it another way, the customers of these frontier models are implicitly being competed against by the model itself.
Withdrawal symptoms. We've all been there.
We are now measuring productivity in lines-of-code. This is not going to end well, not to mention introduce massive amounts of burnout.
That would be a capable 'personal assistant', or 'executive assistant', of 'chief of staff'.
Why? because the point is, just like in real life, to abstract away the complexity, irrespective of domain.
"Average user" implies someone not skilled or savvy in the domain you're thinking of. For a medical doctor, the 'average user' is not-a-doctor. For a technologist, the average user is not-a-technologist. For an insurance specialist, an average user is not-an-insurance-specialist. Etc. etc.
The personal assistant, exec assistant or chief of staff are themselves not necessarily experts in any domain, but they do rely on specialists to get stuff done.
So the UI for this killer app is basically voice input, keyboard input, camera input (mirros of human output) in the user's language with natural language interaction, and the output is voice and monitor/screen, and possibly a robotic arm/hand/body (mirrors of human input). Anything more complex than that would require tailoring it to a domain/domains.
If you doubt this analysis, think of all those folks for whom the IE/Chrome icon was/is "The Internet". Sure, you can go one level deeper with having them put in URLs, or operate email through the aol/gmail bookmark or desktop icon, maybe open documents/files from 'My Documents', but are they going to go any deeper than that, for the 'average user'?
You mean UX? Isn't Claude Cowork supposed to be 'Claude but for normies'? As for Claude Code / OpenAI Codex for non-programmers, believe Replit, Loveable, & others are trying & succeeding.
WhatsApp comes to mind in how its sole focus on replacing SMS (rather than Skype/AOL/MSN Messenger/YChat/GChat) meant it had no (user-facing) password/username, no elaborate signup, no login, no chat/friend requests, no sync etc. & became the biggest social network right under the nose of well resourced competitors with worldwide distribution, like Google & Facebook.
Probably phone operators were not impacted too: SMSes bundled with flat plans are still flat plans and Europe style unlimited calls + 100 SMS per month plans are still there and those SMSes are still mostly unused.
So we could have a killer app and yet nothing changes in the flow of money around it.
UX wise, WhatsApp is a big improvement over SMS. Vocal messages, I'm not a fan of them. A waste of my time.
It'll just be power users. We're moving toward a world of significantly fewer analysts and more into "Super SMEs" that can actually learn tools like Claude and manage enormous complexity with them.
Just giving average users these tools will produce garbage. This example from Claude is so contrived and any business analyst can see how a process that requires uploading additional data will fail. You can't expect users that don't even know their own data to be able to make this thing work.
There will be no "average" user in the future. It'll be multi-disciplinary SMEs that are extremely creative and knowledgeable about their businesses.
I think you’re underestimating “average users”. If we talk about the median, then probably you’re right, but if we talk about “the group of people clustered around the average” I think there’s a lot of untapped potential, especially in people who assumed data and programming were unknowable/impossible and have therefore been held back by “good” tools like excel
We're obviously going to be holding ourselves back in terms of scale and in terms of not being a "true" SaaS with this approach, but my thesis is that we get much higher quality results and higher compliance/activation and can charge more for the bespoke model backed by our own platform.
The power of Excel is not what it was. Nor is the power of ordinary thought.
These narrow integrations with specific software suites seems like a dead end.
If you look closely, people we already creating databases and doing computation. But on paper. Spreadsheet software move the medium to the digital and with that brings a lot of convenience. Same with email, instant chat, and shopping on the web. The killer app is not about bringing something new, but making an old problem easy to solve.
The issue with LLMs is that it makes errors. Uncontrollably. And even if you can spot the obvious ones, there’s always some you won’t be able to catch unless you’re a subject expert. I’ve never seen a random people willing to monitor a piece of tech.
If they can build an integrated AI assistant (what Siri should be) that can spin up and call agents it will be big (or it will flop but my money is on big if it’s the easiest way to use agents in your daily life)
I haven't tried it, or know a lot about it, but isn't this the whole claw thing?
ChatGPT/Claude's web ui is much more like something for average user, tbh.
I really thought Airtable would take off because it was even more of a "database that a normal person could actually use".
Learning how to type commands and use a terminal is not something people cannot already learn right now. And that was the way before.
I think the real killer app is making marketing and other non development (non analytical) work better. In case of marketing, we have tried many AI tools for marketing, and so far they mostly make campaigns more generic, less exciting, and often worse. They help a little but you need to careful that they do not to make it worse.
This is probably fine as long as the code is acting on local resources. The moment you have vibe coded software interacting with shared state or database the risk increases exponentially and all it takes to have a bad day is a poorly worded prompt from one of those users.
Some oversight by humans or automated guardrails will probably reduce those instances.
A figma like dashboard for turning ClaudeCode, Gemini Cli, Codex into an OpenClaw but with security measures to break the lethal trifecta while running on a VM.
But it's not quite there in terms of usability. I agree that is the hardest part of the equation. It's something I'm constantly experimenting with and haven't found the solution to it yet. Open to feedback!
excel isnt used because it's a database, it is because you can do things in it in relatively unstructured ways and reference things youve already done with a click. the future of databasing is bringing more spreadsheet UI to the database, not bringing more users away from spreadsheets. with AI i agree there could be some sort of UI that could pop off that leverages it well, but im not sure its going to be t bring users closer to coding. I think it is going to look more like a project management tool than anything else. i mean shit, it might even just be an excel add-on because excel is still where the data is
It's targeted for creatives atm. For the few in private testing, it's been amazing what they're able to do with the little tooling I've given them. It is a legitimate change in their daily drive.
I don't know anyone not building a product in that space
I'm currently doing something like this in the internal model-independent LLM chat app I work on at a F100, specifically targeted at our everyday users. <input type="file" webkitdirectory> lets the user give the model read and write access to a local folder (and OPFS lets us reuse the same fs tools we give the model for files manually attached to the chat, or for files tools want to create if they haven't granted folder access).
Every time we used to release a new version it was "still can't handle the 6MB Excel file I drop into it" when that was being extracted to CSV and added to context - now it can poke about in the big Excel file directly with SheetJS to pull the sheets/headers and inspect the shape of the data, and use locally sandboxed code execution to write code against either extracted data or the spreadsheet itself via SheetJS for pivot tables and such (all locally - none of which need go into the context).
The base models are good enough at tool calling (I really mean Claude, though, the GPTs just go on a tear calling tools with no context for the user) they're already decent at automating stuff for the user without a dedicated harness (our default system prompt is still "You are a helpful AI assistant", lol). Add tools for Graph API stuff, and now it can pull the nightly batch file from a support inbox, unzip the spreadsheet within, diff it against yesterday's and generate an import file for new users and draft an email to welcome them, something that used to be a daily support task (which I'd already automated most of - but now you don't need a dev for this kind of thing). Or go find the big 450,000+ row spreadsheet that's being automated somewhere on SharePoint, pull it down in 150,000 row chunks (Graph Excel REST API limit) and write code to go figure out whatever the user is asking.
Having implemented and used it, I like this setup so much it kinda ruined Claude.ai and ChatGPT.com for me, so I've hooked up similar access for them using a browser extension to add the folder picker input, with the extension talking to a local server to tell it which folder to give access to, and Claude/ChatGPT talking to the same server over MCP via a CloudFlare Tunnel to work with the selected folder.
Think the movie Her 2013. OS1 it's called.
Isn’t that literally Claude’s web UI?
Super early stage but I am really happy to read your comment.
Claude can write code pretty well, but there are just a few tasks that I need to do to orchestrate everything. If it could do those tasks well even some of the time it would be about 10x more useful.
It's called Zenning AI - we're a small team in London, testing it with a few companies at the moment!
Honestly though we are finding that a little FDE to set up pre-bake stuff that’s sufficiently specific to the customer is needed. Otherwise people are like, “I don’t need to close the books, I need to do a per-working-day profitability analysis for 10 EU countries with different public holidays”, and they get stuck there.
> Intuit QuickBooks handles payroll planning, the monthly close, and cash-flow, along with tools to help businesses prepare for tax season, and reconciliation work that touches every other system.
I can't wait for the horror stories, this is going to be fun. Remember last month when Anthropic was like: no, we're not going to refund you even though we admit we're in the wrong for anti-competitively burning credits? These are some of the last things I would trust an LLM with in a small business and on top of it Anthropic has shitty customer support. I will actively be telling prospects to avoid.
This is one of those areas I would spend more time checking the outputs than it would take me to click the button myself.
I run a small business (no employees) and GnuCash was ok. Then I got tired of battling it for years to do certain things.
Spent a few days human coding a command line income and expense tracker a little over a year ago at https://github.com/nickjj/plutus.
I do my estimated quarterly taxes with its assistance in literally 5 minutes. All I do is download the CSV files from my bank and run the reports I'm interested in seeing through it. At the end of the year I run through the full numbers and triple check things in about 10-15 minutes. These numbers give me complete confidence to file my taxes accurately from a business income / expense perspective.
Of course you can use the tool for personal income / expense tracking too. Personal vs business is an arbitrary category name.
Systems like quickbooks, hubspot, payment processors all have tiers where yes on paper they make it easy to properly setup good accounting practices, but you’ll spend an additional 500/month+ to get those features.
Hiring an accountant to clean up the books and do quarterly book keeping is equally as expensive if not more.
Especially for small service based businesses, where margins can be tight, revenue can fluctuate heavily MoM, committing an additional $6k+ per year just to keep books organized is non-trivial.
As an experiment, i gave all our finance data for 2025 to an agent, and it did quite well after spot checking. There may be a middle ground where users can do exports, verify with “real” software, and have agents handle contextual classification to considerably cut down costs
A computer can help you find that problem, but solving it is still a human issue. One of the things people want to know about invoices is which are likely to be paid on time, as some customers consistently delay or attempt to avoid paying.
It's not just about being able to balance Xero but knowing rules, procedures and the way the tax office works.
Full stop.
From the more obvious possible issues: no payroll, massive refund overpayment, legally binding agreement that puts the business at disadvantage.
FWIW, I like the idea, but I sure as fuck would not let LLM touch real money or pieces that can allow to move it around.
I'm quite sure at the time that they said they wouldn't give compensation, not that they wouldn't refund them.
[1] https://www.arte.tv/en/videos/126831-000-A/arte-reportage/
Example: https://www.theguardian.com/world/2024/dec/18/why-former-fac...
Or don’t tell me, if it’s well worth the 24min watch
AGI will solve poverty, btw. Any second now. Just need 500 bil more bro.
In my neck of the woods, B2B invoices are now required to be delivered over the Peppol network in UBL format, which further improves reliability.
Doesn't necessarily eliminate the need for an accountant, because the chosen UBL standard has lots of room for interpretation and ambiguity, and it's impossible to uniformly decide how process an invoice based on the invoice alone (e.g. is this deductible? is this even a business expense at all? which ledger should this go in? etc).
Murphy’s Law is undefeated. Add in a psycophantic hallucination black box to critical business data and you have a recipe for hilarity.
Normies cannot be trusted to hand off these functions to an LLM because they are mostly incapable of verifying the outputs. Worse yet - these tools are actually idiocratizing the masses to the point they don’t even think they need to.
And of course Anthropic will never have any liability for marketing and selling tools that are unfit for purpose.
Not exactly accounting, but ChatGPT (whatever the paid model was in March) told me that paying down principal early would have virtually no effect on interest over the remainder of the loan. It was confused by the fact that it was a short balloon with payments amortized using a 30 year schedule. I did the math by hand to check and told it it was incorrect and it gave me the classic “oh yeah, sorry about that”. It’s the type of thing where for someone that is knowledgeable about the domain, it wouldn’t pass the sniff test. I am not sure if LLMs have a sniff test.
I can’t imagine how hard this will hallucinate when there are layers of accounting, tax codes, etc. But who will notice when it sounds so convinced it is right?
If you thought society was just an imaginary collective delusion before, now it can be collective hallucination too.
The python script is basic enough that even I can figure out what it is doing, and I still have to review the import to GnuCash and reconcile with my bank.
It is saving me about an hour of work every week right now.
I think this is my biggest use of AI - making small tools to do the work locally rather than sending things to the cloud to be stolen and messed with.
Ironically for this thread, I think an AI redesigned website would do wonders here.
GNU cash website immediately tells that it is not a Saas and doesn't need to upsell the latest trendy addons, for it to survive.
It tells that it is not "investing" in marketing to eventually turn a profit.
It is not looking for acquisition opportunities or next funding rounds.
If you want to see what a trustworthy website looks like, take a look at SQLite or postgresql or even this website itself.
No need to have a desktop app to do entry.
Why would I worry about an LLM properly cataloging expenses (book keepers job) when we keep human in the loop with the CPA to check their work?
I think you don’t understand the problem the AI solved/reduced costs on.
IRS is going to make a ton of money off you naive people. Get a better CPA who's not committing malpractice like your current one.
In books of the future, if we ever hold one, I think this will be studied a lot. We have seen before competitions and rivals, but they mostly were rivalry of craft. Here it is a rivalry of velocity and reach. Who can first target user with whatever they have ready to offer.
What competition? To have competiton, you need to have a market. And to have a market, you need to have a well defined product or service. What these guys are offering is a toy, for which they desperately try and invent new potential use cases every week. Metaverse, NFT and Blockchain once again, "supercharged" by trillions of VC money, soon coming for your pension fund too. What could go wrong?
This opens that surface area of attack again, but now on a much larger scale, if not careful
* find invoice I_E for expense E
* associate and categorize E based on I_E and transaction field
These things are annoying but Claude Code is great at it and it leaves a much smaller set I have to manually resolve. This is a class of problems that are tractable and checkable, which I happily use LLMs on. If it miscategorizes it, I'm going to see it because I'm looking over the accounts. In fact, I was previously using a different accounting app which had poor API support, so I dumped it so I could use Claude and it's incredible how much this helps me.
There is an enormous number of use-cases that Claude/GPT are good for and the hard part is market penetration here. As an example, my dad was looking at some statistical health survey data in India and working out what things you could glean from it. Claude identified the things that would complicate his analysis in no time. He's 70 years old, and he'd done it all manually until he asked me (I've got a Mathematics degree) if something made statistical sense to do. I told him what it likely was and then asked him to try Claude. Knocked out his work and mine in moments. But he didn't think to use it. Now I have to get him a ChatGPT/Claude subscription.
It's like how if you go to the Datadog pricing page they don't list a feature set. They have all these use-case lists with prices. You can build things using their base metrics functionality and logs functionality but showing the use-cases must have more adoption.
1. Automated ingestion of hand-written tuition scholarship applications into Google Sheets. Near flawless OCR to structured spreadsheet ingestion and image extraction.
2. Revamped the website completely from a simple static website to a dynamic one which accepts donations (started with Claude Design, handed off to Claude Code). Old: https://csmforchrist.com --- New: https://stage.csmforchrist.com
3. Included sponsorship applicant pages (from #1) to let supporters read profiles and choose who to support through the website (this used to be a fully phone/email process before)
As an aside, it feels great to use AI for something that improves people's lives today.
I wonder if this is going to give text based accounting a boost. Reviewing clearly worded git commits is so much more reassuring then letting an LLM drive your accounting package and hoping it doesn't mess up somewhere.
Interesting, sometimes they want to show you they’ll simply charge 2-3 percent of your monthly spend (https://www.datadoghq.com/pricing/?product=audit-trail#produ...)
This is dangerous. Relying on so much of your business on a third party. We've seen this many times before where businesses get destroyed because something gets broken somewhere that they have outsourced and have no control over.
In my view this service should not be used, unless there is a local llm or clear manual alternative.
Then the question begs - Why use Claude at all?
Maybe a proof of concept only while you come up with a real solution. Maybe to use claude to get rid of Claude
The people who get dazzled by bright lights are going to be the ones licking their wounds later. There is going to be eggs on faces one day.
Must be nice being able to ruthlessly lie with "this is the future" marketing claims, while hiding behind this term of service.
If you don't actually believe in your product's capabilities, why sell it?
It amazes me that we are going to litigate this like they did with cars over horses, or machines vs human labor. I honestly don't think Claude should be running companies.
Anthropic's response: let's make a nice package out of this, and let's target specifically the businesses that are less likely to be ready to manage such horrible events.
Also, small business contracts likely do not have the same type of language around indemnity/SLAs, so it is easier for the harms of this type of system to go unpunished because those who are harmed are even less knowledgeable.
It's just like getting Google support.
Never in my life would I have thought a business with more than 100 employees could be considered small. In the EU the cutoff is 50.
A couple more thoughts here - the hard part is not just the data side of it, it's replaying/unplaying actions. Many actions are non-reversible. Code is clean in the same way that google docs is clean. But for many business processes, some actions just can't be unwound once started. If claude initiates a wire that it shouldn't, no amount of git technology will undo that wire.
Am I too close to AI that this sounds fucking crazy to me? In no world would I give Claude or any AI agent direct write access to financial operations like payouts/settlements.
does "settling" not mean, "writing", ie moving cash around for real
> https://www.media.mit.edu/publications/your-brain-on-chatgpt...
Reviewing automated output is very different from actually doing the task, and results in skill decay and atrophy.
> https://en.wikipedia.org/wiki/Ironies_of_Automation
The gap between write access and humans just rubber stamping output is not much at all.
I can tell you the drag is between your own tools and the real world (which is very messy and inconsistent): taxes, compliance, payroll, amendments, share structures, etc.
Within my island, my books are in order, invoices and time keeping is fully automated, calendars and sales pipelines are connected.
I'm sure there are many businesses whose inner islands are not as orderly. The zillion tools out there all try to bring equanimity to the chaos and yet here we still are with fresh books, quickbooks, and xero...
I scaled to 30+ people with automated administration. My cost was under $150 a month for everything we needed to run a successful consultancy and product business. Our accountant was blown away by how simple his life was.
I'm constantly amazed at how it has gotten much worse in the resulting decade.
E.g traditional automation + humans handling the drag = $4,000 per month with a couple of known blunder each year
vs traditional automation + AI = $400, with unknown number of blunders.
Of course it depends how much a blunder costs, to solve, or swallow. But I would bet that accounting errors even for a small business would cost the business on the long run. And that's assuming we don't yet have adversarial behavior which we can expect to come from both the inside and the outside.
Coders don't all have those kind of security hygiene instincts either
You might be assuming small businesses have less than ten people. That’s a category of small business called a “micro-business” or microenterprise, depending on funding model.
In EU where I'm from the micro/small/medium business sizes are tied to both employee count AND revenue. Micro is below 10 employees and below 2 million € revenue, Small is below 50 employees and below 10 million € revenue, Medium is below 250 employees and 50 million € revenue.
So if you had 100 million revenue you would be a large business even if you had less than ten people.
A skill cannot provide MCPs and can't provide custom template prompts, each skill is it's own slash command.
A plugin you can define N number of custom slash commands, and you can define MCPs as well as skills. So it bundles like all the things together.
By installing a plugin, you are basically installing a bunch of MCPs, skills and custom slash command prompts.
Feels like they're just using LLMs to produce enormous levels of output, without understanding that quantity ≠ quality.
Payroll/reconciliation is already a couple of clicks and 2 humans sign off. A 'morning brief', well lol.
'Growth', how would you not know your numbers as an SMB? Everything is already in a tool with dashboards and reports for people to act on.
Also, I have zero confidence in the example prompt.
This all seems incredibly uninspired.
Now I have claude hooked up to a dozen projects I used to maintain manually. It is such a pleasure watch it read the complaint and go to town on small problems without dropping any databases or removing home dirs.
My point being, they know they need to make a viable business, and they've clearly seen demand. Meaning there are already a lot of small businesses trying to use Claude to do these things.
Given what they have I wouldn't be surprised if they setup a pipeline of niche toolsets that they can spin up in response to mass user prompting.
Not a pretty future for SaaS and side hustles.
Since the "grand" idea is that all they need is the "god model with infinite parameters requiring infinite energy", the business model will align there.
I know that Google, Atlassian, Microsoft et al have been having access to our emails and online docs for a while… it just strikes me as naive to now sharing everything by default to a single company just like that. They are not just training on internal business data, I would imagine they also have plans to monetise it somehow
Ps.: see http://www.bricklin.com/firstspreadsheetquestion.htm on whether VisiCalc was the first or not.
Just today there are 3 stories on front page about Claude--seems to me someones PR is working overtime
My guess is that they are trying to increase the cost of switching as much as they possibly can before the VC subsidies run out and they have to 10x their prices.
Possibly, could also just mean that they've internalized the bitter lesson. https://www.cs.utexas.edu/~eunsol/courses/data/bitter_lesson...
My experience running a few LTDs is that there is a gap between the accountants and what you need, and running an SME business means you are too busy not to do stupid things and the net effect is lost productivity, less entrepreneurial activity and less growth overall. Dealing with VAT, PAYE, and a million other stupid small things prevents most people from succeeding at running an effective business.
Claude and OpenAI have been surveyed to be most impactful to SMEs, and I think it’s only going to accelerate.
Hopefully this is hugely positive, I see risks, but I don’t see real societal downsides if people get AI to make their basic business operations better, cheaper and most importantly simpler and easier.
These takes are so uninformed. We live in a country completely captured by the multi-million dollar advertising campaigns that are meant to make us behave in whatever way makes the 1500 richest people the most amount of money possible.
That's rich. What public benefit mission? The benefit of extracting money from the public?
Have you ever run a business?
These "people" fundamentally misunderstand how tech illiterate the average person is and don't care about AI outside it appearing in their search results as an occasional convenience. My Mom (in her 50s) heard about ChatGPT for the first time this month and doesn't care about it, nor eager to figure it out.
Small business owners are not going to put their life's work in the hands of AI, they don't even trust the most basic versions of it and they're certainly not going to use "agents", and the ones that do trust it are naively going to overly trust it because the faulty marketing from these companies and very bad things are going to happen.
I know they are trying to get their product to fit-in & justify the massive valuations.
but this ain't it - just like the other Claude for ** -- the market doesn't exist.
if they spoke to small businesses they would know their problems are either around marketing or data.
You wouldn't operate in any startup having a public camera streaming your laptop screens, most intimate tactical conversations and strategy to the open internet - so why give that data to companies who's sole goal is to make money?
eSignatures are probably in the top 10 at risk businesses right now. I did eSignatures before, and even I didn't buy a project just vibe coded the signature piece for a recent product.
Good validation that this is indeed a space the frontier firms are thinking about along similar lines.
Large companies can navigate the waters with teams of lawyers and accountants.
I don't think it will be possible to run a small business without AI in the near future, as the complexity of the law will increase beyond any comprehension.
I don't run a small business myself, but I assume the scope of administrative tasks in such company is well defined and understood.
Turns out Anthropic is pivoting so fast that they're doing all the 'Claude for X' themselves.
Surely 'Claude for Cheese' is soon.
Inspiring quote there.
The funny part is Opus was the one which generated the files in the first place. This was Opus 4.7 High. So no thank you, Anthropic.
If you want to help SMB, stop with the interconnectivity hype of bringing outrageously expensive software together. Try making something that really helps instead of syphens more money and hurts the workforce. Seriously, what's Claude going to do for a landscaper using pen-and-paper anyway? That's the majority of your SMB. The grifting MSPs are your target for this bs.
Remote LLM's should be prohibited ASAP.
It has never been easier to give Claude a list of tools you want in your stack and have it get them up and running on your own server, including audits against exploits.
I want that Claude for small businesses, even though I understand why partnering with these other companies is the better revenue play.
SFDC announces "Headless CRM" and Anthropic is like meh.
Our company supports small teams in Germany with the use of agentic AI. We're guinea pigging this on ourselves. There is a lot of friction taking AI into use right now for people who aren't developers. Most tools are aimed at developers and are useless without a lot of complicated hoops that you need to jump through to connect stuff, deal with permissions, etc.
I'm seeing a wider issue that OpenAI and Anthropic seem to just have a few blindspots when it comes to dealing with UX topics and product management. Anthropic seems a bit ahead but not much on supporting business users. But not by a lot.
I'm more familiar with the OpenAI side. I'm a developer, so I can work around it. But I've been onboarding our non developer CEO and friend to codex so he can actually get shit done and it's not been pretty. He's constantly fighting with trying to wrap his head around repositories, git, having to edit small text files, etc.
Despite all this, it's hugely empowering for him to be using codex. I got him working on our website directly (content and design), he has managed to get his inbox hooked up and our google drive. He's working on presentations, sales offers, CRM topics, accounting topics, and more. Not your typical programmer centric topics (aside from the website). It's OK, he's smart enough. But I'd hate to go through this with junior business interns.
The key challenge I see is company level guardrails and skills and permission hell. I got our CEO on codex because in ChatGPT can't use tools or skills. And you need both to get productive. So Codex is the only option right now (in OpenAI). Claude Cowork and Claude for Small Businesses is a good move.
Skills are where you can express organization specific rules, processes, etc. Simple things like when dealing with gmail, don't send emails and only create drafts. Because we want people approving the final email that gets send, always. We have a growing number of those that are specific to our company and tools.
Another challenge I see is dealing with team collaboration tools and AI. We currently have these weird 1 on 1 tools where you have session with an agent to do stuff. But collaborating with more people requires proper team chat tools. That does not exist currently. I have some internal experimental setup involving Matrix, OpenClaw, and some skills that actually is super useful for this. But I would not recommend that for obvious security reasons.
Another challenge is that most things you'd want to connect seem to be completely unprepared for this. This is an industry wide problem that seems to affect most SAAS products with very few exceptions. Existing data silos are going to be connected to AI tools and this is going to escalate fast. So far, there's a lot of mumbling about APIs, cli tools, and not much else. However, most of these products are completely unprepared for an influx of business users wanting to do productive stuff with these tools and AI. There is going to be a lot of friction there and I think a few SAAS companies seem incapable at this point of adjusting their roadmaps and fighting their reflex to deny access to absolutely everything and protect their walled gardens. I think it's going to be a blood bath in that market with customers and users jumping ship to more AI ready alternatives.
We're only four years in to this revolution but especially with Google their level of preparedness with Google Workspace for this is shockingly poor. Gmail access is essentially all or nothing currently. That's going to cause issues. I don't think MS is much further in their thinking. And these two are some of the more clued in companies in the AI space given that they funded and invented most of it.