Agents for financial services and insurance (opens in new tab)

(anthropic.com)

243 pointslouiereederson4d ago174 comments

174 comments

I don't trust these AI-only companies to be overnight experts in properly handling medical, financial and insurance data. They have no business providing these tools, unless they want to take all the risk too.

scottyah4d ago

I think a lot of people are misunderstanding the typical workload of people in Financial Services. They aren't using Claude to transfer money, they're just building a LOT of slideshows and fancy excel docs on made-up numbers to try to sell mergers and new financing options/types of loans. Most programmers would just consider this "sales".

rubyfan4d ago

That’s a gross over generalization. Some of the insurance data here suggests use of AI to make underwriting decisions. There are several states with regulations which could potentially pull these agent solutions into their regulatory oversight if used by the industry to effect insurance outcomes.

cschneid4d ago

Odd lots podcast had an interesting snippet about an financial institution that uses AI to make loan decisions. The guest said that they only use it on applicants who were rejected in the traditional sequence, and then uses AI to accept them if possible. That way there's an articulable reason for a rejection, but they use the non-deterministic AI to allow an extra person through - since the laws about loans are mostly around not discriminating against people - companies are (generally) welcome to accept whoever.

1 more reply

pjc503d ago

> made-up numbers

It's important to note that in most jurisdictions you can't actually do this legally? Like, you may be able to get away with it, but it is actually illegal to sell financial services by misrepresentation?

jmalicki3d ago

You can say anything you want in forward looking statements in the US, within reason.

"They will have $1B in revenue by year end" is perfectly fine to completely make up.

scottyah3d ago

Forecasts have to be made up by definition unless time-travel has been solved. Forecasting is very important, and using the best, most truthful data to get as close to what the future holds is the name of the game, but at the end of the day you're guessing and hoping.

dzonga3d ago

there's a lot of domain knowledge in some of those areas.

that domain knowledge is acquired by talking to people - which A.I can't do - all kinds of people since the knowledge isn't written down.

I know this having dated a girl who did M&A deals for media properties - you know your big tv shows/movies etc

Terr_4d ago

> They aren't using Claude to transfer money, they're just [...]

It might be lower stakes, but isn't that still a juicy target for data-exfiltration attacks?

In other words, imagine if one of your direct competitors was watching everything your employee read while making spreadsheets and slideshows.

scottyah4d ago

Yes, corporate espionage may be alive and real but would claude on their microsoft/amazon/google cloud be different from documents on that same cloud?

1 more reply

motbus34d ago

The only reason they are doing it is because there are regulation for people but not for machines.

tyre4d ago

This is objectively not true. You can’t get around HIPAA by saying “lol wasn’t me it was an Agent”

deepfriedbits4d ago

Yep. There's no certification needed to create a financial model or close monthly books.

2 more replies

bluefirebrand3d ago

You can bet that someone out there is probably trying to build a startup right now based on that idea.

crowcroft4d ago

Can't wait for Claude to submit fake tax records for me so that I can commit fraud legally.

octoberfranklin4d ago

This is my litmus test.

If AI is really as wonderous as everybody says, why didn't all the employees of all the AI companies simply type "Claude, file my taxes for me" as a prompt and walk away?

1 more reply

greenpresident4d ago

My experience has been quite the opposite. Some bank processes remain oral traditions about clicking excel filters by hand because any code would have to be extensively documented and tested.

tossandthrow4d ago

I would recommend you to not use these, if you are not willing to absorb the risk.

Luckily there is still a significant market for the services.

stubish4d ago

Some human always gets to be the certified fall guy for non-compliance. Maybe the legal agent can help structure the company so that is an ignorant lower level accountant and not the CFO.

Currently we don't know the risk, so it is kind of hard to absorb.

Terr_4d ago

Decade-old spoilers for "How I Met Your Mother" ... but there's a character who has that kind of job, as a legal meat-shield.

https://www.youtube.com/watch?v=8u62HptZ6TE

zx80804d ago

> properly handling

Why, they can sell user data to other brokers. Experts indeed! But not in insurance or finance, of course.

areoform4d ago

Claude's actually pretty great at this! I actually used to use Claude A LOT to answer interesting questions (which I'll be writing up on!) More generally, Claude is palpably different from most other agents. I'd recommend these models – especially Opus – without qualifications.

But there's a process risk here based on their current practises. I'm hoping those practises change so that I can recommend Claude to everyone I know, but as of now, there's existential risk exposure here that's greater than Google's.

Anthropic's automated systems can and will ban you for pretty arbitrary things; and you won't get human support or Claude – even if you are an enterprise paying out of your nose. And there's 0 redressal unless you go viral on social media. Or know someone who knows someone. See: https://x.com/Whizz_ai/status/2051180043355967802 https://x.com/theo/status/2045618854932734260

And I say that as someone who likes how Anthropic has been training Claude and Opus. I just don't think they're prepared to be the trillion dollar company they've become. They are – in a very real way – suffering from success. Which is extremely inconvenient to be on the receiving end of when you're on a deadline.

brunoborges4d ago

Before AI, shipping code to production used to be a two-person task: one writes the code, another one reviews the code. Now with AI writing the code, the developer that was supposed to write the code, only has to review it. And this is because they are responsible for the code they ship.

Code review has become unbearable because before AI, developers were reviewing code as they went writing it in the first place. Granted, never perfect and why a second person reviewing code was (is?) a best practice. But effectively there was always some level of code review happening as developers wrote code.

I fear it is way more boring to review financial and medical documents completely written by AI than it is to write (and at the same time review) by yourself. And way more dangerous to ship mistakes than in most software.

traceroute664d ago

> the developer that was supposed to write the code, only has to review it.

But more often than not that developer ends up reviewing far more lines of code due to the typical verbosity of an LLM.

1 more reply

areoform4d ago

I am/was writing up an interesting hypothesis with Claude's help. But I redid the most important parts of the data pipeline manually. As in went in and cmd-c + cmd-v'ed the data by hand to create a reference, and I'm randomly spot checking 33% of the larger records.

The analysis itself; I'm doing it by hand.

kiba4d ago

Why not the developer write the code, then the AI review the code, and then finally a signoff from another human?

Far too often people think productivity is the point. Maybe the point is developer's understanding of the product IS the product?

You're not engineering black boxes, you're engineering legible boxes.

1 more reply

orochimaaru4d ago

Isn’t there a code review agent?

2 more replies

sorieus4d ago

Pretty great at what? I work in the insurance industry specifically medicare. All I see is sales people and other managers slopping out AI dashboards off of spreadsheets galore. Not only is it terrible for protecting PHI/PII. It also doesn't do things like RBAC very well either. Now instead of preventing a person from externally sharing a file i have to make sure they didn't egress the file to supabase or some other platform.

Here's some of the horrible things i've seen. Frontend dashboard with PHI/PII deployed via vercel/next because AI told them how to get their site online. Login is hardcoded into the frontend so anyone with inspect can find the password.

Another "fixed" dashboard deployed the same way. This time they added firebase auth so they got sign in with Google added with only logging into our domain. Wait how would they be able to create a token for our domain? They didn't the frontend just blocks domains from calling firebase.auth but firebase doesn't care. So simply calling the function in the console lets me login with any gmail account....

They also where showing me their RBAC with firebase. Again they don't have access to our Orgnization/Directory/Groups. So i wondered how they did this.. wouldn't you guess its a hardcoded list of approved users. You can literally call firebase.auth and sign in anonymously. Again only the frontend checks the email addresses. So now that i have a firebase auth all the backend firebase function just check that you have auth'd. So i can make any request i want to the backend. The frontend simply won't show me the code.

I could go on and on about the stupidity levels I'm facing but I don't feel like crashing out.

All I can say is this tool is only useful if you already know how to correctly implement these things. Does it save me time sure but I have to call it retarded and explain why not to do things. Honestly I feel like claude is good for people who like to gamble. When it gets it right it feels great but I don't want to roll the dice 30 times to get it correct.

1 more reply

intended4d ago

> and you won't get human support or Claude – even if you are an enterprise paying out of your nose. And there's 0 redressal unless you go viral on social media.

Sadly this sounds like par for the course when it comes to tech. Too many messages and requests for help depend on knowing someone in the right slack groups.

alwillis4d ago

If you’re paying out of your nose, you would have forward deployed Anthropic/OpenAI engineers on the premises.

1 more reply

areoform4d ago

Which is very confusing to me. If you have groundbreaking AI, you can offer groundbreaking support at scale.

4 more replies

dakolli4d ago

They aren't even close to a 1T company, they're valued at <400bb and that's at like a 20x-30x multiple. They can probably raise money at a higher valuation but its literally just value based on hype, not revenue.

areoform4d ago

https://www.businessinsider.com/anthropic-trillion-dollar-va...

KellyCriterion4d ago

Check the secondaries market ;-)

1 more reply

wxw4d ago

> We’re releasing ten ready-to-run agent templates for the most time-consuming work in financial services

The templates being: pitch builder, meeting preparer, earnings reviewer, model builder, market researcher, valuation reviewer, general ledger reconciler, month-end closer, statement auditor, KYC (Know Your Customer) screener.

Seems pretty scattershot. Reminds me of GPT Store.

order-matters4d ago

the details are key here. there is plenty of automatable financial work, sure, but also when it comes to reporting finances/costs (formally or informally) and having a real human being be accountable for them, you REALLY need to trust that nothing is hallucinated.

Any idea how they ensure this doesnt happen? As in, how can a user verify that the model did not touch any of the numbers and that it only built pipelines for them.

what I've been telling my CFO who wants to get AI involved in things is that for a lot of accounting and finance work "Trust but verify" doesnt work because verify is often the same process as doing the work.

tomrod4d ago

> Any idea how they ensure this doesnt happen?

Build a deterministic query set and automate it for monthly or daily reporting reconcilliation.

Leave AI out of it.

scottyah4d ago

The "real humans" doing the tasks being replaced are overworked kids less than 2yrs out of college on an average of 4hrs of sleep at working at 3am. If the AI makes their jobs take half as much time I bet they're a lot more likely to catch errors (and live longer).

order-matters4d ago

at risk of sounding facetious, how exactly do you catch an error in a sum without performing the sum yourself?

How do you verify that all the tariffs are properly allocated to the correct GL code without going through the invoices and checking for each tariff on the list? How do you make sure none were accidentally assigned to other GL codes? All you have is pdfs, you dont know what the AI did or didnt do with the info on the pdf, there are not many use-cases to catch its errors without doing the work yourself.

If anything, it's going to add a step to these "kids" work where they have to use the AI to do the work and then redo 90% of the work anyway just to verify the output and then AI is going to get the credit anyway.

Or the overworked people are going to use AI and not verify it, which means not catching any errors or hallucinations, which apparently is fine because someone claims it's a solved problem for the black box of infinite possibility and inconsistent output.

1 more reply

infecto4d ago

To be honest I am having a hard time remembering the last time a LLM hallucinated in our pipelines. Make mistakes, sure but not make things up. For a daily recon process this is a solved problem imo.

fnordpiglet4d ago

I see it hallucinate quite often in development but mostly in getting small details wrong that are automatically corrected by lint processes. Large scale hallucination seems better guarded but I also suspect it’s because latitude is constrained by context and harnesses like lint, type systems, as well as fine tuned tool flows in coding models to control for divergence. But I would classify making mistakes like variable names wrong or package naming or signatures wrong as hallucations.

KellyCriterion4d ago

Curious! Could you elaborate a little bit on your pipeline as we are currently looking to solve this for our internal processes in which we have to deal with lots of financial information from outside, containing mass of numbers, like annual reports, bank statements, balance sheets etc.

1 more reply

GCUMstlyHarmls4d ago

I'll be honest, I thought the first few items on your list of time consuming work was sarcasm.

moregrist4d ago

A recent episode of Matt Levine’s podcast (Money Stuff) covered this: apparently investment bankers spend a huge amount of time preparing pitch decks for companies that don’t want them. Apparently Claude is quite good at making a pitch deck that no one but your boss wants or cares about.

I feel like there’s a metaphor in there... maybe I’ll ask Claude about it.

Terr_4d ago

Much like a lot of internal daily status report stuff: The BS generator is actually a great fit when the task is making BS output nobody used or deeply cared-about in the first place.

1 more reply

infecto4d ago

Reads different to me. Some examples to go run with and build your own. Covers cases from the investment side and then the obvious ones in an accounting perspective. It would be highly surprising that any of these would be use in production without modification. I am sure it will happen but the intent to me is to take this and run with your own process.

rubenflamshep4d ago

I find all of these .md files released by the labs to be ai generated slop. The only exception being maybe the /simplify command

subscribed4d ago

"Claude, build me 50 skills an Account Analyst would find useful, then run them through the agent at maxxxx thinking and ship the top 10 of them"

My money's on that.

sothatsit4d ago

It still surprises me how effective the /simplify skill is.

I’ve also had some great results with a /reflect skill that asks the agent to look at the work in the broader context of the project. But those are the only two skills I use regularly that aren’t specific to our company, codebase, or tools.

rubenflamshep2d ago

Some engineers I work with have had less than desirable results with /simplify but it overall seems to work! I used to use some of the humanlayer subagents but they haven't been updated in several months

tantalor4d ago

No surprise there. Of course the skill files are not human written.

The AI is an expert in both following and generating prompts.

rubenflamshep2d ago

Hmmm, I disagree. The AI is exceedingly average at everything it does and requires an expert human-in-the-loop to catch things that appear plausible but are slop. To that degree I think there is a difference between "AI generated" and slop.

sumeno4d ago

Why do you think it is an expert in generating prompts? It has no additional insight into how it works internally than anyone else

1 more reply

anonfunction4d ago

I've been doing bias and misaligned behavior research, creating custom private eval suites to test and compare models. Claude Opus 4.7 is heavily biased and presents clear regulatory and reputational risk.

It seems the initial product footprint tries to sidestep this problem by not giving the agents control on who to lend to or which applications to approve. Even so I think it's quite an optimistic read on their end. Happy to share reports to anyone who's interested (montana@latentevals.com), especially if you work at a frontier model lab and are interested in plugging my evals into your RL systems!

scottyah4d ago

Slightly related, I used Opus 4.6 to help me make marketing copy and ideas for my app. It understood the vibe I was going for on my baby-naming app (elation at discovery, curiosity, shared experiences), while 4.7 instantly wanted to pit the couples against each other (really highlighting the he said/she said) and the marketing copy went from "find a name easier" to "Our new feature is great. You're welcome." I can't get it to drop the snarky sass no matter how much I change CLAUDE.md, brand voice, etc.

All I did was upgrade claude code and use the new model. It most definitely exhibits misaligned behavior (compared to 4.6)

philipkglass4d ago

I tried Opus 4.7 for two days before I started beginning every session with "claude --model claude-opus-4-6".

I assume that 4.6 will become unavailable at some point, but I hope not any time soon. 4.7 hit usage limits faster, didn't do anything obviously better, and had more annoying behaviors in other aspects. I don't know if this is strictly a model issue or if there are also problems with how it's harnessed through Claude Code. I'm not willing to spend more time digging into it until I'm forced to.

scottyah4d ago

Join me in a petition for them to opensource 4.6 as their first model! It'll be like gemma4 but good enough for all the coding we do.

tpurves4d ago

Nobody is using LLMs to make lending decisions. They are using LLMs to read, extract and audit the supporting documents that go into normal well-tested, compliant and rules-based underwriting systems. And firms A/B test against humans doing the same work. The outcomes your are looking for are metrics like delivering faster results back to customers, with fewer mistakes and less fraud, more compliant, than a comparable human-only process.

1 more reply

suriya-ganesh4d ago

Will the big labs leave anything for external competition?

This probably killed a thousand startups in this space.

in the early internet you wouldn't see google creating their own news site or facebook building their own animal farm. what happened to platformication of everything?

bcrosby954d ago

Building a startup on an LLM is like building a house on a foundation of quicksand. As the LLM gets better it naturally erodes your moat. It's a completely different dynamic compared to the internet. It's why I'm watching this from the sidelines.

PyWoody4d ago

I have a close friend who is trying to build a company entirely on top of Claude. He doesn't know how to program. He can't do basic arithmetic. Yet, the company he's building is a "Data Science AI for the Government" because, according to him, all of the data scientists at NOAA don't know what they're doing.

I have given up on trying to get through to him how bad of an idea this is. He's unemployed and has been working on this for over a year.

blitzar4d ago

Sounds like YC material.

Y Combinator is accepting applications for the Summer 2026 Batch funding cycle. Make sure they don't miss out!

ecshafer4d ago

Scientists are pretty solid overall for the Government. Lots of Phds that decide to take the steady reliable income and solid benefits over the risk of Academia for a few post docs hoping for a professor job.

1 more reply

edm0nd4d ago

Yeah that's the scary part about these coding LLMs.

Before, some idiot would pitch their stupid idea to dozens of local webdev companies and banks and get told dozens of times their idea is straight up stupid and never going to work and they are stupid.

Now these LLMs allows them to bypass all of that advice and create what they want without any input or even knowing how the tech behind it works.

We are so fucked lol

intrasight4d ago

Building a business on top of any SaaS platform is building on quicksand. I know that from experience.

gwerbin4d ago

> Will the big labs leave anything for external competition?

No, why would they if they have the choice?

> what happened to platformication of everything?

Business happened. The web works differently from how it used to. The users are different. LLM inference and AI tools is a different core product from search and ads. That, and we have the benefit of hindsight now. Maybe a Google newsroom would've actually been a good idea in 2006 in hindsight, who knows.

Also realistically you could say the same thing about Google Maps and Street View. That probably also killed some startups. Google isn't running a charity for startups.

anon3738394d ago

This was their play all along with their unethical data collection practices: let others use the APIs to discover the applications, then use the data against them to offer integrated solutions in every vertical of interest. Cursor, once Anthropic’s biggest customer, was one of the early ones they screwed.

They are also fighting for their lives because these insane valuations simply aren’t justified by being dumb pipes. Fortunately, open weights models are widely available and have crossed a threshold of usefulness that cements their place as good substitutes.

csoups144d ago

Amazon Basics for Knowledge Work™

wongarsu4d ago

I guess the argument is that a tool built by a company with actual insight into and focus for financial services, with Anthropic as inference provider, would lead to more adoption and more use of Anthropic models. Something Anthropic could achieve either by just leaving things alone and having the best models, or alternatively by starting some kind of incubator or something. AWS might be a good model

The issue with that is obviously that most of the generated value would be captured by that company in the middle, while Anthropic would stay in the cost-conscious inference market.

noitpmeder4d ago

Why would anthropic at all prefer this approach when that middle man can switch and cost-arbitrage between countless other model providers.

We're not talking about what is best for the consumer (ex more competition to force iterations and improvements), but what Anthropic thinks is best for Anthropic.

1 more reply

ctoth4d ago

I'm confused because I remember using Google News in 2006?

suriya-ganesh4d ago

there has been a product called Google News since 2002. It was only aggregating information from news channels

_pdp_4d ago

> Will the big labs leave anything for external competition?

Is this a serious question?

Without the big labs with deep pockets investing to change the consumer mindset do you think a small company with no funding has any chance of even existing?

I remember when paying $1.99 for a mobile game on iOS was considered too expensive and now it seem most consumers are primed to spend more on in-app purchases every week. That mind-shift did not happen overnight.

It was not that long ago $200 for ChatGPT subscription was considered extravagant but now even wrappers can charge this price without hesitation - some of them do.

What Anthropic is doing is priming the market of which they will be potentially one of the main beneficiaries as long as they can continue existing. But I don't think anyone will go to Anthropic directly to source their financial services agent. They will go to financial service companies that use Anthropic to build the capabilities.

BowBun4d ago

I work in a space where one could imagine a Claude replacing our product.

I think someone stated it clearly - they can't take on these kinds of businesses until they build out the risk side and the personnel, all of which is a human problem not a tech one. A lot of processes still require physical steps and backstops because it's not possible to source all the data needed to act on it in the first place. Then you have audits and reconciliations, a bunch of strict workflow rules and atomicity to reach levels of software that bigger financial institutions would accept.

My gut reaction to stuff like this is a mix of "oh shit, they could take over my company" and "they're the next script kiddy that thinks software is anywhere near a majority of the work in some software spaces".

bix64d ago

> they can't take on these kinds of businesses until they build out the risk side and the personnel, all of which is a human problem not a tech one.

Yes they can? They have infinite more cash to pay off any risk. What do you need personnel for besides sign off if the AI does it right?

BowBun3d ago

Managing risk is a negotiation across parties, not just ponying up the cash. Personnel completes steps that are still physical requirements, usually by law. I think those are two things Anthropic isn't going to want to get into anytime soon.

stubish4d ago

The personnel also need to take the fall when the AI does it wrong. A judge isn't going to jail Claude, they are going to jail the sucker who unknowingly authorized the fraud.

Will Anthropic externalize the risk, selling access to agents? Or will internalize the risk and liability, selling financial services? Maybe both? I guess lots of companies want both, doing some things internally and keeping other things at arms length by outsourcing to 3rd party accountants.

1 more reply

ambicapter4d ago

> in the early internet you wouldn't see google creating their own news site

Google News was definitely a thing (and actually still exists).

suriya-ganesh4d ago

it's been a things since 2002. but it's a news aggregator not directly competing with newyork times

1 more reply

landian664d ago

just looked up, it is still a thing - learn something new everyday!

colesantiago4d ago

> Will the big labs leave anything for external competition?

Unfortunately no.

The TAM for Anthropic and OpenAI is anything that runs software or a screen.

Any software or technology business that has high margins that Anthropic and OpenAI are not doing will be a target.

After both their IPO's mandates Wall Street them to push for more growth by competing in other technology business areas or they will get punished in the markets.

It is ROI or bust.

sokoloff4d ago

I'm not sure if this was tongue-in-cheek or not, but Yahoo created its own news site in 1996: https://en.wikipedia.org/wiki/Yahoo_News and FB had Zynga's Farmville as well.

lordgrenville3d ago

I think OP meant that Farmville was built by a different party (Zynga). FB was trying to encourage other businesses to build apps on its platform, not build them itself.

selicos3d ago

Or "rent seeking" type tools/options that should be baked into products from a much earlier point.

There was an app for OSX that added window snapping, long before Apple added it to their desktop environment. $5 or something for a free feature that just makes sense to build into your product from the start. Apple is king at absorbing this sort of paid add on, eventually. AI makes that faster.

_fizz_buzz_4d ago

But Google did move into a lot of spaces: maps, mail, docs, etc.

1 more reply

mobattah4d ago

This is premature caution/fear.

SoftTalker4d ago

Why control part of the world when you can control it all?

Less cynically, you might say that "use AI to do <obvious thing>" is not really a viable startup pitch anymore. That's not necessarily bad.

bombcar4d ago

It's not wise to build a startup that is just a feature of the product that you're building on.

What's even sadder is it can work for way too long.

robotswantdata4d ago

History suggests otherwise. railroads, telecoms, search all consolidated. The natural equilibrium for transformative infrastructure is winner take all. AGI/ASI won’t be different but will be nearly every vertical and governments will legislate too little too late.

agentultra4d ago

Nothing natural about it. Such monopolies were propped up by the state using public funds and profits captured by the capital class. Many benefitted by the arrangement and so it became normalized. But it’s a choice people made to structure things that way.

The car industry, oil and gas… all could have played out differently if different players had gained wider adoption or if governments used a different economic model.

colechristensen4d ago

local models are going to win and therefore the hardware providers, Apple and nvidia.

There isn't going to be any moat for the hosted providers besides hardware scale. They can run your request on shared 1TB memory hardware, or whatever.

But local hardware is going to catch up, the hosted providers are going to become commoditized, and the costs are just going to be compute whether its your hardware or theirs.

And your laptop is going to be powerful enough to be good enough for most cases.

robotswantdata4d ago

Local hardware catching up doesn’t matter if the thing worth having never leaves the building. Enterprise services are hard, moat is in distribution and know how.

1 more reply

debarshri4d ago

I am not sure if people are using claude design, security review stuff and other tools they have built so far.

Building is the easy part. There are lot of service level stuff that I am sure anthropic will not be able to provide, therefore they are trying to partner with other orgs in that realm.

I am very skeptical about their stuff now.

If you are builder, I believe you should avoid anthropic, it can be default to monopolistic behavior, I am not saying they are doing it, but they could, where in they see what you are building, if you have traction, position a product in that realm. Just saying.

owebmaster4d ago

I don't think Claude Design killed menu competitors and I don't think this will too

tyre4d ago

You’re advocating for less competition? AI startup valuations are out of control. People are raising $20m seed rounds.

If you can’t prove PMF and differentiation with $10m, I’m sorry but you’re not a serious enterprise.

And if what you’re building is “pitch deck AI”, I mean, come on.

iewjj4d ago

lol these agents are missing the point re. What people actually do in these jobs.

This is an attempt to inflate token generation to fool people into increasing anthropic’s valuation.

vatsachak4d ago

> tfw you've been huffing your own copium so much that you forgot you're selling shovels

delfugal4d ago

Can Agents put Intuit out of business? Asking for a few hundred million Americans tired of their lobbying $$ that killed off IRS direct tax filing.

milkglass4d ago

Would love to see this

Havoc4d ago

For those in the finance space, are you actually seeing any real AI tools being used? Like for actual operational tasks?

I've really only seen it used for research / exploration thus far. Either for economic research slide deck or for exploring trading hypothesis

OkayPhysicist4d ago

On the spend management side of things, I've found pretty remarkable success in letting LLMs check "does this receipt match this reimbursement request and based on all the information about the user, the request, and our policy, is it appropriately allocated to appropriate GL, Location, Department, and Project codes?" If the verification step fails, it kicks it back and the user can either override it (which gets it flagged for AP review), or fix it. It does substantially better than the naive Bayes classifier I was using before.

ofjcihen4d ago

I’m not saying your implementation is bad or anything but my visceral reaction to this was “I’m glad I’m not on the other side of that”

JamesSwift4d ago

Why? It sounds exactly like the design I would hope for. It automates what I'm going to do already without needing to wait. And it allows you to bypass it entirely and just revert to the manual process (along with waiting).

1 more reply

mikeyouse4d ago

In many businesses, the employee is responsible for inputting most of that. If a LLM can get to 95% accuracy and flag exceptions, the employees (and AP team) would actually have less work and bureaucracy.

Though we’ve had a few incidents where employees have submitted AI-generated receipts for reimbursement which is another issue..

2 more replies

infecto4d ago

What is your point? This is pretty normal expense management in any company setting. I don’t know what is so bad about being on the other side of that. Hope I am not too inflammatory by asking what is the point but genuinely you pointed it out like it’s some archaic process flow but it’s part of almost every expense system.

1 more reply

infecto4d ago

Yes. On the accounting side agents can handle a lot of the low value work like recons and other ledger activity pretty well. On the investment side I think like you pointed out it’s going to be a lot of research, industry, company, macro etc. Value in letting run on top of the data you have and put together ideas at a quicker pace than a human can. There is still a human in the loop but it can do a nice job of lining up thought you might have otherwise missed.

Havoc4d ago

What does the integration look like on accounting? Is this a tool provided by the accounting software provider?

I'm in that space so naturally interested in what people are up to :)

j_w3d ago

Likely "dump all your data into our system then we provide you AI tools." Typically how it goes with most fintech startups.

torben-friis4d ago

Pretty good as a dev with finance stakeholders. We have skills in place acting over our automated month closing and it was able to provide manual checks and flag issues, for example.

Nowhere near self sufficient tools though, just great to answer questions over the data that would usually take a few hours of custom scripting/excel. I wouldn't trust our stakeholders using AI directly either, being frank.

timbaboon4d ago

Seen it used in some of the fraud models (I work in insurance). So that's both from the perspective of people trying to claim fraudulently and from suppliers over charging. I can't say how much of a lift we actually get vs existing ML models

bvan4d ago

Yes, in very specific cases where I fully understand the methodology(ies) that is (are) applicable, and am able to verify correct implementation. Also, as an enhanced ‘Google search’ to supplement what I have found. I am the skeptical type… yet, so far have been impressed. But, I wouldn’t trust using AI to blindly give me solutions to a problem I couldn’t solve myself, albeit much more slowly.

iewjj4d ago

Nope If anything firms are pulling back (I know someone closely who works at blackrock).

semiquaver4d ago

I don’t just know someone who works in finance, I am someone who works in finance and I say you’re wrong.

1 more reply

kx_x4d ago

In what context?

For research and theses evaluations, we're observing that firms - of names we all know - are bullish and even eager to try AI products.

Regarding automated asset management and the likes, indeed there's much more apprehension.

biophysboy4d ago

pulling back as in setting more realistic token budgets, or something more drastic? I'm curious

iewjj4d ago

Stopped using them altogether in the context of productivity - in essence they’re useless.

1 more reply

TacticalCoder4d ago

> For those in the finance space, are you actually seeing any real AI tools being used? Like for actual operational tasks?

> I've really only seen it used for research / exploration thus far

Summaries and translation for sure.

Speaking with devs in the field I know that AI tools are used to summarize and extract data from... PDFs. Now, thankfully, LLMs got better at answering "How many 'r' in 'strawberry" and it looks like they're good enough for summarizing PDFs and extracting key numbers but I'd still be cautious.

And I've got a friend who's a translator specifically for financial documents: she's a contractor and getting about 1/10th of the work (and 1/10th of the pay) she used to have for now she's only tasked to verify that the translations are correct. Of course she already had lots of tools, way before he LLM era, automating some of her work but she was still billing he use of those tools. Now LLMs are doing nearly all the work and not "for her": it's happening upstream and she only gets the output of the LLMs and has to verify them. And there aren't that many errors.

apaprocki4d ago

We’re integrating AI tooling into the Bloomberg Terminal for everyone to use.

https://www.bloomberg.com/professional/insights/press-announ...

jcims4d ago

This is great but as someone in infrastructure tech at a large financial, there is almost no framework for cleanly separating control from data plane operations, read vs write, anything. As of right now you have to build nearly all of that yourself.

It feels like juggling pipe bombs and I have a ton of empathy for the teams being pressured by the business to roll them out with no appreciation for the regulatory rat's nest that ensues.

botacode4d ago

Great to see more insurance hype! We've been working on AI to solve the consumer search problem in the industry for the past 3 (almost 4) years and it's great to see the big labs getting their hands dirty and building tools for practitioners in the space.

More industry exposure to well-managed agentic experiences will create oodles of opportunities to reduce premiums for consumers and offput some inflation-driven increases in cost of coverage.

BrandiATMuhkuh4d ago

we tried it just before. it's interesting what it does. writing lots of python scripts.

however the result (excel/spreadsheet) looks different each time you run it. Which is annoying when you run it at the end of each month.

btw: this is not surprising when you look at the low details the skills have.

dkersten4d ago

Given the quality of Claude code lately, I wouldn’t trust them in financial services.

0123456789ABCDE4d ago

patagonia is gonna to lose some clientele

KellyCriterion4d ago

haha, insider! :-D

Just yesterday I told a colleague that he should by some of their vests for his company :-D

HyperL0gi4d ago

Anyone still use claude design? I’ve not seen any mentions on X, here or youtube recently, so wonder if it was all hype or people are actually using it.

traceroute664d ago

I stopped reading at paragraph one:

"ready-to-run agent templates for the most time-consuming work in financial services: building pitchbooks, screening KYC files, and closing the books at month-end"

Ok, maybe you can squeeze a vaguely passable pitchbook out of Claude.

But screening KYC files or closing books at month-end ?

"I'll have some of what they're smoking" as the cool kids say.

No regulator or tax office on this planet is going to accept the "but Claude said it was ok" excuse.

The only people who are going to profit out of this are Anthropic, Lawyers and Governments (through increased fines).

Lio4d ago

Wow, really going for those white collar jobs. This is going to be an interesting few years.

23dsfds4d ago

What will happen is what has happened for the past few years: mostly nothing. People employed, things keep trucking forward.

LLMs do not change the equation all that much: human's ability to imagine is the most scarce resource on the planet and LLMs will not help all that much with it.

scottyah4d ago

Just a natural rebalancing of the Rise of the Laptop Class. I think we'll get more productive as the white collar jobs become more efficient, and less days with 8hrs of meetings and responding to emails from people too lazy to look information up themselves.

jqpabc1234d ago

AI and finance --- what could possibly go wrong?

Better Call Saul when (not if) it does.

Ekaros4d ago

Well at that point you can use AI as legal help, right?

jqpabc1234d ago

Yes, you can. But expect similar results.

https://www.lawnext.com/2025/05/ai-hallucinations-strike-aga...

sharadov4d ago

Next couple weeks - financial and insurance services announce layoffs!

KellyCriterion4d ago

There was a paper lately, claiming that bank & insurances are going to layoff around 200k in the next years globally. (which would be according to them a reduction of 3-4% of finance people)

prosunpraiser4d ago

Of course - finance is the best domain to depkiy a stochastic parrot which hallucinates and forgets stuff frequently and doesn’t follow your instructions - even with SOTA models. One where you need absolute accuracy and auditabikity.

Why didn’t I think of that.

throwpoaster4d ago

Because the l on your keyboard is broken?

sovenyr4d ago

this is to risky as for me!

vatsachak4d ago

Everything is going to be slop and you're going like it.

Is the plan to have an LLM do everything? And do it worse?

"Oh yeah my Claude didn't agree with the pitch from their Claude"

The goal of current tech is to make humanity a gerbil running on a Claude wheel

nothinkjustai4d ago

At that point what even is the point of doing anything at all? Like, it’s less than useless.

vatsachak4d ago

That is what people like Thiel actually believe, that humanity is just a cradle to bring about a machine god.

I don't necessarily disagree with that but doing it through LinkedIn slop companies? Come on man you know better than that

soupspaces4d ago

Follow the money, until you can't (compute credits)

simianwords4d ago

Does anyone else think "agents" are the wrong abstractions? Agents look like UI wrappers over LLM's - they are inherently not composable. Tailor made agents for UI's don't seem to scale. I predict they wont take off.

What I predict instead is that we will have a common UI layer plugin and a "protocol" than can speak to ui elements -- this might be more composable.

guluarte4d ago

How long until Anthropic or OpenAI builds an interview platform around AI tools, where candidates build a feature end to end using AI?

As someone who has been interviewing lately, I think this is the next step after leetcode and whiteboard style interviews.

dakolli4d ago

Why would this be useful in a zero sum environment like markets, why would you want to use the same tool that everyone else has access too? Top performers will always be the people that hand craft their solutions, just like why the top performers in the watch space are the people that make handmade watches in Switzerland not the guys make 100k watches a month in China.

codemog4d ago

Making the most convoluted and idiotic insurance process on earth and then delegating that process onto an AI that requires huge buzzing data centers.. Is there an option to respawn in the non-clown world universe? It was funny at first but it gets tiring eventually.

jeffreyrogers4d ago

What does a better insurance process look like? Outside of health insurance, which is complicated for a variety of reasons, most insurance is pretty easy to procure. I got an umbrella policy recently and it took about 30 minutes of talking with an agent and answering pretty reasonable questions.

codemog4d ago

1. A better insurance process is clearly out of the scope of a hn comment, and I have trouble believing you don’t know that too.

2. I’m almost certainly talking about health insurance, made obvious by you even mentioning that. There’s a HN guideline about discussing in good faith.

3. I find it humorous you hand-wave away our inhuman healthcare system as “for a variety of reasons”.

4. I see your career is in hedge funds, defense, and big tech. Best of luck ;)

jeffreyrogers4d ago

I don't think it's obvious that you were talking about health insurance, which I consider fairly distinct from property, casualty, liability, and life insurance, which are all quite large markets in themselves. The reason I made a distinction is because health insurance is quite different from other lines of insurance because healthcare is federally regulated while other insurance is regulated at the state level.

As mentioned the problems with the US healthcare system are numerous, complex, and interrelated. I don't think they have a simple solution, nor do I think they are insurance problems at their core. For example the cost of drugs in the US vs the rest of the world has very little to do with insurance.

j / k navigate · click thread line to collapse

174 comments

tencentshill4d ago

scottyah4d ago

rubyfan4d ago

cschneid4d ago

1 more reply

pjc503d ago

> made-up numbers

jmalicki3d ago

You can say anything you want in forward looking statements in the US, within reason.

"They will have $1B in revenue by year end" is perfectly fine to completely make up.

scottyah3d ago

dzonga3d ago

there's a lot of domain knowledge in some of those areas.

that domain knowledge is acquired by talking to people - which A.I can't do - all kinds of people since the knowledge isn't written down.

I know this having dated a girl who did M&A deals for media properties - you know your big tv shows/movies etc

Terr_4d ago

> They aren't using Claude to transfer money, they're just [...]

It might be lower stakes, but isn't that still a juicy target for data-exfiltration attacks?

In other words, imagine if one of your direct competitors was watching everything your employee read while making spreadsheets and slideshows.

scottyah4d ago

Yes, corporate espionage may be alive and real but would claude on their microsoft/amazon/google cloud be different from documents on that same cloud?

1 more reply

motbus34d ago

The only reason they are doing it is because there are regulation for people but not for machines.

tyre4d ago

This is objectively not true. You can’t get around HIPAA by saying “lol wasn’t me it was an Agent”

deepfriedbits4d ago

Yep. There's no certification needed to create a financial model or close monthly books.

2 more replies

bluefirebrand3d ago

You can bet that someone out there is probably trying to build a startup right now based on that idea.

crowcroft4d ago

Can't wait for Claude to submit fake tax records for me so that I can commit fraud legally.

octoberfranklin4d ago

This is my litmus test.

If AI is really as wonderous as everybody says, why didn't all the employees of all the AI companies simply type "Claude, file my taxes for me" as a prompt and walk away?

1 more reply

greenpresident4d ago

My experience has been quite the opposite. Some bank processes remain oral traditions about clicking excel filters by hand because any code would have to be extensively documented and tested.

tossandthrow4d ago

I would recommend you to not use these, if you are not willing to absorb the risk.

Luckily there is still a significant market for the services.

stubish4d ago

Some human always gets to be the certified fall guy for non-compliance. Maybe the legal agent can help structure the company so that is an ignorant lower level accountant and not the CFO.

Currently we don't know the risk, so it is kind of hard to absorb.

Terr_4d ago

Decade-old spoilers for "How I Met Your Mother" ... but there's a character who has that kind of job, as a legal meat-shield.

https://www.youtube.com/watch?v=8u62HptZ6TE

zx80804d ago

> properly handling

Why, they can sell user data to other brokers. Experts indeed! But not in insurance or finance, of course.

areoform4d ago

brunoborges4d ago

traceroute664d ago

> the developer that was supposed to write the code, only has to review it.

But more often than not that developer ends up reviewing far more lines of code due to the typical verbosity of an LLM.

1 more reply

areoform4d ago

The analysis itself; I'm doing it by hand.

kiba4d ago

Why not the developer write the code, then the AI review the code, and then finally a signoff from another human?

Far too often people think productivity is the point. Maybe the point is developer's understanding of the product IS the product?

You're not engineering black boxes, you're engineering legible boxes.

1 more reply

orochimaaru4d ago

Isn’t there a code review agent?

2 more replies

sorieus4d ago

I could go on and on about the stupidity levels I'm facing but I don't feel like crashing out.

1 more reply

intended4d ago

> and you won't get human support or Claude – even if you are an enterprise paying out of your nose. And there's 0 redressal unless you go viral on social media.

Sadly this sounds like par for the course when it comes to tech. Too many messages and requests for help depend on knowing someone in the right slack groups.

alwillis4d ago

If you’re paying out of your nose, you would have forward deployed Anthropic/OpenAI engineers on the premises.

1 more reply

areoform4d ago

Which is very confusing to me. If you have groundbreaking AI, you can offer groundbreaking support at scale.

4 more replies

dakolli4d ago

areoform4d ago

https://www.businessinsider.com/anthropic-trillion-dollar-va...

KellyCriterion4d ago

Check the secondaries market ;-)

1 more reply

wxw4d ago

> We’re releasing ten ready-to-run agent templates for the most time-consuming work in financial services

Seems pretty scattershot. Reminds me of GPT Store.

order-matters4d ago

Any idea how they ensure this doesnt happen? As in, how can a user verify that the model did not touch any of the numbers and that it only built pipelines for them.

tomrod4d ago

> Any idea how they ensure this doesnt happen?

Build a deterministic query set and automate it for monthly or daily reporting reconcilliation.

Leave AI out of it.

scottyah4d ago

order-matters4d ago

at risk of sounding facetious, how exactly do you catch an error in a sum without performing the sum yourself?

1 more reply

infecto4d ago

To be honest I am having a hard time remembering the last time a LLM hallucinated in our pipelines. Make mistakes, sure but not make things up. For a daily recon process this is a solved problem imo.

fnordpiglet4d ago

KellyCriterion4d ago

1 more reply

GCUMstlyHarmls4d ago

I'll be honest, I thought the first few items on your list of time consuming work was sarcasm.

moregrist4d ago

I feel like there’s a metaphor in there... maybe I’ll ask Claude about it.

Terr_4d ago

Much like a lot of internal daily status report stuff: The BS generator is actually a great fit when the task is making BS output nobody used or deeply cared-about in the first place.

1 more reply

infecto4d ago

rubenflamshep4d ago

I find all of these .md files released by the labs to be ai generated slop. The only exception being maybe the /simplify command

subscribed4d ago

"Claude, build me 50 skills an Account Analyst would find useful, then run them through the agent at maxxxx thinking and ship the top 10 of them"

My money's on that.

sothatsit4d ago

It still surprises me how effective the /simplify skill is.

rubenflamshep2d ago

tantalor4d ago

No surprise there. Of course the skill files are not human written.

The AI is an expert in both following and generating prompts.

rubenflamshep2d ago

sumeno4d ago

Why do you think it is an expert in generating prompts? It has no additional insight into how it works internally than anyone else

1 more reply

anonfunction4d ago

scottyah4d ago

All I did was upgrade claude code and use the new model. It most definitely exhibits misaligned behavior (compared to 4.6)

philipkglass4d ago

I tried Opus 4.7 for two days before I started beginning every session with "claude --model claude-opus-4-6".

scottyah4d ago

Join me in a petition for them to opensource 4.6 as their first model! It'll be like gemma4 but good enough for all the coding we do.

tpurves4d ago

1 more reply

suriya-ganesh4d ago

Will the big labs leave anything for external competition?

This probably killed a thousand startups in this space.

in the early internet you wouldn't see google creating their own news site or facebook building their own animal farm. what happened to platformication of everything?

bcrosby954d ago

PyWoody4d ago

I have given up on trying to get through to him how bad of an idea this is. He's unemployed and has been working on this for over a year.

blitzar4d ago

Sounds like YC material.

Y Combinator is accepting applications for the Summer 2026 Batch funding cycle. Make sure they don't miss out!

ecshafer4d ago

1 more reply

edm0nd4d ago

Yeah that's the scary part about these coding LLMs.

Now these LLMs allows them to bypass all of that advice and create what they want without any input or even knowing how the tech behind it works.

We are so fucked lol

intrasight4d ago

Building a business on top of any SaaS platform is building on quicksand. I know that from experience.

gwerbin4d ago

> Will the big labs leave anything for external competition?

No, why would they if they have the choice?

> what happened to platformication of everything?

Also realistically you could say the same thing about Google Maps and Street View. That probably also killed some startups. Google isn't running a charity for startups.

anon3738394d ago

csoups144d ago

Amazon Basics for Knowledge Work™

wongarsu4d ago

The issue with that is obviously that most of the generated value would be captured by that company in the middle, while Anthropic would stay in the cost-conscious inference market.

noitpmeder4d ago

Why would anthropic at all prefer this approach when that middle man can switch and cost-arbitrage between countless other model providers.

We're not talking about what is best for the consumer (ex more competition to force iterations and improvements), but what Anthropic thinks is best for Anthropic.

1 more reply

ctoth4d ago

I'm confused because I remember using Google News in 2006?

suriya-ganesh4d ago

there has been a product called Google News since 2002. It was only aggregating information from news channels

_pdp_4d ago

> Will the big labs leave anything for external competition?

Is this a serious question?

Without the big labs with deep pockets investing to change the consumer mindset do you think a small company with no funding has any chance of even existing?

It was not that long ago $200 for ChatGPT subscription was considered extravagant but now even wrappers can charge this price without hesitation - some of them do.

BowBun4d ago

I work in a space where one could imagine a Claude replacing our product.

bix64d ago

> they can't take on these kinds of businesses until they build out the risk side and the personnel, all of which is a human problem not a tech one.

Yes they can? They have infinite more cash to pay off any risk. What do you need personnel for besides sign off if the AI does it right?

BowBun3d ago

stubish4d ago

The personnel also need to take the fall when the AI does it wrong. A judge isn't going to jail Claude, they are going to jail the sucker who unknowingly authorized the fraud.

1 more reply

ambicapter4d ago

> in the early internet you wouldn't see google creating their own news site

Google News was definitely a thing (and actually still exists).

suriya-ganesh4d ago

it's been a things since 2002. but it's a news aggregator not directly competing with newyork times

1 more reply

landian664d ago

just looked up, it is still a thing - learn something new everyday!

colesantiago4d ago

> Will the big labs leave anything for external competition?

Unfortunately no.

The TAM for Anthropic and OpenAI is anything that runs software or a screen.

Any software or technology business that has high margins that Anthropic and OpenAI are not doing will be a target.

After both their IPO's mandates Wall Street them to push for more growth by competing in other technology business areas or they will get punished in the markets.

It is ROI or bust.

sokoloff4d ago

I'm not sure if this was tongue-in-cheek or not, but Yahoo created its own news site in 1996: https://en.wikipedia.org/wiki/Yahoo_News and FB had Zynga's Farmville as well.

lordgrenville3d ago

I think OP meant that Farmville was built by a different party (Zynga). FB was trying to encourage other businesses to build apps on its platform, not build them itself.

selicos3d ago

Or "rent seeking" type tools/options that should be baked into products from a much earlier point.

_fizz_buzz_4d ago

But Google did move into a lot of spaces: maps, mail, docs, etc.

1 more reply

mobattah4d ago

This is premature caution/fear.

SoftTalker4d ago

Why control part of the world when you can control it all?

Less cynically, you might say that "use AI to do <obvious thing>" is not really a viable startup pitch anymore. That's not necessarily bad.

bombcar4d ago

It's not wise to build a startup that is just a feature of the product that you're building on.

What's even sadder is it can work for way too long.

robotswantdata4d ago

agentultra4d ago

The car industry, oil and gas… all could have played out differently if different players had gained wider adoption or if governments used a different economic model.

colechristensen4d ago

local models are going to win and therefore the hardware providers, Apple and nvidia.

There isn't going to be any moat for the hosted providers besides hardware scale. They can run your request on shared 1TB memory hardware, or whatever.

But local hardware is going to catch up, the hosted providers are going to become commoditized, and the costs are just going to be compute whether its your hardware or theirs.

And your laptop is going to be powerful enough to be good enough for most cases.

robotswantdata4d ago

Local hardware catching up doesn’t matter if the thing worth having never leaves the building. Enterprise services are hard, moat is in distribution and know how.

1 more reply

debarshri4d ago

I am not sure if people are using claude design, security review stuff and other tools they have built so far.

Building is the easy part. There are lot of service level stuff that I am sure anthropic will not be able to provide, therefore they are trying to partner with other orgs in that realm.

I am very skeptical about their stuff now.

owebmaster4d ago

I don't think Claude Design killed menu competitors and I don't think this will too

tyre4d ago

You’re advocating for less competition? AI startup valuations are out of control. People are raising $20m seed rounds.

If you can’t prove PMF and differentiation with $10m, I’m sorry but you’re not a serious enterprise.

And if what you’re building is “pitch deck AI”, I mean, come on.

iewjj4d ago

lol these agents are missing the point re. What people actually do in these jobs.

This is an attempt to inflate token generation to fool people into increasing anthropic’s valuation.

vatsachak4d ago

> tfw you've been huffing your own copium so much that you forgot you're selling shovels

delfugal4d ago

Can Agents put Intuit out of business? Asking for a few hundred million Americans tired of their lobbying $$ that killed off IRS direct tax filing.

milkglass4d ago

Would love to see this

Havoc4d ago

For those in the finance space, are you actually seeing any real AI tools being used? Like for actual operational tasks?

I've really only seen it used for research / exploration thus far. Either for economic research slide deck or for exploring trading hypothesis

OkayPhysicist4d ago

ofjcihen4d ago

I’m not saying your implementation is bad or anything but my visceral reaction to this was “I’m glad I’m not on the other side of that”

JamesSwift4d ago

1 more reply

mikeyouse4d ago

Though we’ve had a few incidents where employees have submitted AI-generated receipts for reimbursement which is another issue..

2 more replies

infecto4d ago

1 more reply

infecto4d ago

Havoc4d ago

What does the integration look like on accounting? Is this a tool provided by the accounting software provider?

I'm in that space so naturally interested in what people are up to :)

j_w3d ago

Likely "dump all your data into our system then we provide you AI tools." Typically how it goes with most fintech startups.

torben-friis4d ago

Pretty good as a dev with finance stakeholders. We have skills in place acting over our automated month closing and it was able to provide manual checks and flag issues, for example.

timbaboon4d ago

bvan4d ago

iewjj4d ago

Nope If anything firms are pulling back (I know someone closely who works at blackrock).

semiquaver4d ago

I don’t just know someone who works in finance, I am someone who works in finance and I say you’re wrong.

1 more reply

kx_x4d ago

In what context?

For research and theses evaluations, we're observing that firms - of names we all know - are bullish and even eager to try AI products.

Regarding automated asset management and the likes, indeed there's much more apprehension.

biophysboy4d ago

pulling back as in setting more realistic token budgets, or something more drastic? I'm curious

iewjj4d ago

Stopped using them altogether in the context of productivity - in essence they’re useless.

1 more reply

TacticalCoder4d ago

> For those in the finance space, are you actually seeing any real AI tools being used? Like for actual operational tasks?

> I've really only seen it used for research / exploration thus far

Summaries and translation for sure.

apaprocki4d ago

We’re integrating AI tooling into the Bloomberg Terminal for everyone to use.

https://www.bloomberg.com/professional/insights/press-announ...

jcims4d ago

It feels like juggling pipe bombs and I have a ton of empathy for the teams being pressured by the business to roll them out with no appreciation for the regulatory rat's nest that ensues.

botacode4d ago

More industry exposure to well-managed agentic experiences will create oodles of opportunities to reduce premiums for consumers and offput some inflation-driven increases in cost of coverage.

BrandiATMuhkuh4d ago

we tried it just before. it's interesting what it does. writing lots of python scripts.

however the result (excel/spreadsheet) looks different each time you run it. Which is annoying when you run it at the end of each month.

btw: this is not surprising when you look at the low details the skills have.

dkersten4d ago

Given the quality of Claude code lately, I wouldn’t trust them in financial services.

0123456789ABCDE4d ago

patagonia is gonna to lose some clientele

KellyCriterion4d ago

haha, insider! :-D

Just yesterday I told a colleague that he should by some of their vests for his company :-D

HyperL0gi4d ago

Anyone still use claude design? I’ve not seen any mentions on X, here or youtube recently, so wonder if it was all hype or people are actually using it.

traceroute664d ago

I stopped reading at paragraph one:

"ready-to-run agent templates for the most time-consuming work in financial services: building pitchbooks, screening KYC files, and closing the books at month-end"

Ok, maybe you can squeeze a vaguely passable pitchbook out of Claude.

But screening KYC files or closing books at month-end ?

"I'll have some of what they're smoking" as the cool kids say.

No regulator or tax office on this planet is going to accept the "but Claude said it was ok" excuse.

The only people who are going to profit out of this are Anthropic, Lawyers and Governments (through increased fines).

Lio4d ago

Wow, really going for those white collar jobs. This is going to be an interesting few years.

23dsfds4d ago

What will happen is what has happened for the past few years: mostly nothing. People employed, things keep trucking forward.

LLMs do not change the equation all that much: human's ability to imagine is the most scarce resource on the planet and LLMs will not help all that much with it.

scottyah4d ago

jqpabc1234d ago

AI and finance --- what could possibly go wrong?

Better Call Saul when (not if) it does.

Ekaros4d ago

Well at that point you can use AI as legal help, right?

jqpabc1234d ago

Yes, you can. But expect similar results.

https://www.lawnext.com/2025/05/ai-hallucinations-strike-aga...

sharadov4d ago

Next couple weeks - financial and insurance services announce layoffs!

KellyCriterion4d ago

There was a paper lately, claiming that bank & insurances are going to layoff around 200k in the next years globally. (which would be according to them a reduction of 3-4% of finance people)

prosunpraiser4d ago

Why didn’t I think of that.

throwpoaster4d ago

Because the l on your keyboard is broken?

sovenyr4d ago

this is to risky as for me!

vatsachak4d ago

Everything is going to be slop and you're going like it.

Is the plan to have an LLM do everything? And do it worse?

"Oh yeah my Claude didn't agree with the pitch from their Claude"

The goal of current tech is to make humanity a gerbil running on a Claude wheel

nothinkjustai4d ago

At that point what even is the point of doing anything at all? Like, it’s less than useless.

vatsachak4d ago

That is what people like Thiel actually believe, that humanity is just a cradle to bring about a machine god.

I don't necessarily disagree with that but doing it through LinkedIn slop companies? Come on man you know better than that

soupspaces4d ago

Follow the money, until you can't (compute credits)

simianwords4d ago

What I predict instead is that we will have a common UI layer plugin and a "protocol" than can speak to ui elements -- this might be more composable.

guluarte4d ago

How long until Anthropic or OpenAI builds an interview platform around AI tools, where candidates build a feature end to end using AI?

As someone who has been interviewing lately, I think this is the next step after leetcode and whiteboard style interviews.

dakolli4d ago

codemog4d ago

jeffreyrogers4d ago

codemog4d ago

1. A better insurance process is clearly out of the scope of a hn comment, and I have trouble believing you don’t know that too.

2. I’m almost certainly talking about health insurance, made obvious by you even mentioning that. There’s a HN guideline about discussing in good faith.

3. I find it humorous you hand-wave away our inhuman healthcare system as “for a variety of reasons”.

4. I see your career is in hedge funds, defense, and big tech. Best of luck ;)

jeffreyrogers4d ago

j / k navigate · click thread line to collapse