SQL patterns I use to catch transaction fraud (opens in new tab)

(analytics.fixelsmith.com)

315 pointsredbell1d ago127 comments

127 comments

> Real cardholders almost never buy something for exactly $1.00. Coffee is $4.73, gas is $52.81. The roundness is the signal.

Surely this depends on how the vendor sets their prices? If you're going to buy something from a website to test a stolen credit card you don't just get to make up your own prices.

And I think you may be over-indexing on the US "prices don't include tax" thing. Elsewhere, round-number prices are extremely common.

In fact a lot of the rest of the stuff in the post seems like it wouldn't work very well either. (E.g. you're flagging anyone who has done a transaction in the last 90 days outside the range of hours at which they have 2+ transactions? Wouldn't that be like 50% of people?).

It's unclear to me whether this article is an attempt at breaking down complex expertise into over-simplified SQL queries, or whether it is all speculative and made up.

There is a conflict between "Six SQL patterns I use to catch transaction fraud" and "Nothing here comes from anything I’ve actually worked on or seen".

nswango1d ago

The "transaction outside usual hour range" seems pretty basic.

I don't usually buy gas, coffee or snacks at 2am. But on the very rare occasion that I do, I'm dealing with some kind of personal emergency and don't also want to have to call my bank.

I get that that's also a time opportunistic thieves, etc, might be operating. But the cost of false positives is also a thing.

normie30001d ago

Worse than that.

Coffee usually _is_ a round number in my experience, and I know of people who aim for round numbers when filling their car, and of fuel stations which require a pre-set value, often 10, 20, 50€ etc

sheept1d ago

Yes, as your parent comment points out, the article centers itself on US transactions, where listed prices seldom include tax and are frequently a cent below a round number. For example, the menu says a dish is $15.00 but the restaurant charges $18.83 after tax and tip. Globally, there's no doubt the US is the exception rather than the norm.

panflute1d ago

That sounds reasonable for some states but 5 states have no sales tax and many states have exclusions to sales tax. Many of those are also likely to have rural areas where small businesses like to use even amounts.

Niksko1d ago

All of that is easy to account for, all of the metadata you need is available. This also applies to the sibling comment about rounding up to charity at the grocery store, the data is all there, even if it's e.g. the fraud analyst at the bank or credit card company instead of the fraud analyst at the grocery store.

normie30001d ago

I don't need to account for it - I'm just stating that this doesn't match my experience:

> Real cardholders almost never buy something for exactly $1.00. Coffee is $4.73, gas is $52.81. The roundness is the signal.

1 more reply

LandR1d ago

Yeah I was in a bar one night and was peckish, so tried to buy a packet of crisps. They said minimum spend on card was £5, so I said just charge me the £5 it's fine.

Card got blocked as they thought it was fraud. Annoying! And not something inebriated me wanted to deal with at 2am.

Ok. Maybe they protected me from myself, but still!

mike_hock1d ago

This is also

a) trivial to bypass by adding dither to the test transactions and

b) trivial to improve upon with proper statistical analysis and

c) shouldn't this kind of heuristic pattern recognition with no expectation of near-100% accuracy be what AI is good at?

themafia1d ago

I'm seeing a few stores here and there which have a "round up to donate" option. I guess I'm a bit of a sucker and I always use that option. My groceries are always a round number as a result.

time4tea1d ago

Ive always suspected that this is all of a tax dodge, a money spinner, and a pr exercise "we gave xxx to charity" - no, your customers did.

Just set up a direct debit to your favourite charity.

nubg23h ago

The article is LLM-generated.

relevant_stats1d ago

Reading this to the very end uncovers empty and contradictory advice. I'm almost sure it's LLM generated.

We learn simultaneously that 'your team' shouldn't rely on any one of those patterns ('none of them is enough'), but that pattern 1 'alone will surface a useful amount of fraud'.

We also read strange sentences like "Every analyst on your team will use them (ie window functions) once they exist, and adding the next fraud pattern stops being a project. [end of paragraph]"

Or irrelevant discussions about how filtering by "IS NULL" might be not applicable when almost none of the provided examples uses it (and the one which does uses it in different context).

This is low quality and too long.

Kwpolska1d ago

> Border crossings inside 10 minutes. International rings.

Or normal people living in Europe in border-adjacent areas.

Also, I guess you don't include card-not-present transactions in this, but you incorrectly assume that every merchant has their location set correctly. And that every sale happens in a brick-and-mortar establishment, not from travelling salespeople or whatever. And that all transactions happen online.

compel21601d ago

Even US -> CA was maybe 10 min when I crossed a few weeks ago...

weird-eye-issue1d ago

Is it possible that you are actually in a transnational crime syndicate without knowing it?

reconnecting1d ago

Hacker News, we need to talk!

"Fixel Smith" is an AI-generated person, with an article that has very little to do with fraud analysis. 'This' is also a music artist (1), novelist (2), fraud analyst (3), influencer (4), and whatever else you can imagine.

220+ points and 70 comments, and very few notice it's quite a fake post — and no one that it's an AI generated person?

1. https://www.amazon.it/Forged-Soundtrack-Explicit-Fixel-Smith...

2. https://fixelsmith.com

3. https://analytics.fixelsmith.com/

4. https://www.instagram.com/fixeltales/

relevant_stats1d ago

Hacker News has developed recently a frustrating habit of upvoting such low quality AI sloppy submissions.

Makes me wonder if this AI flood uncovers the unflattering truth about this community acuteness, or it's only a failure of existing guardrails and we just need to change them.

Igrom1d ago

I was checking the submission on the phone and only peeked at the comments section. While it's not always easy to judge if something is AI-generated or edited, here it was obvious at first glance from the quotes. Assuming that all of the comments were done in good faith, I think that the low AI literacy even here is really concerning.

geoduck1423h ago

>Assuming that all of the comments were done in good faith

Well,sure. But some people come here just for the comments and don't read the articles

diatone1d ago

A cursory glance does make it appear like either a prolific individual, or a bot. The fact that the novel bears little relation to the analytics posts, which seem to bear the style of LLM prose, makes the whole thing fishy. Ironic given the subject matter of TFA

tdeck1d ago

I'd be more surprised to hear that most folks made a habit of investigating the people whose articles we read. To be honest, I usually don't even look at the byline, let alone the rest of the website.

reconnecting1d ago

I'm one of the creators of an open-source security framework (1). I've been eating online fraud for breakfast for 8 years. The article is delusional enough that I had to visit the top page of the domain (2).

1. https://github.com/tirrenotechnologies/tirreno

2. https://fixelsmith.com

hju22_-31d ago

I could imagine a person having or doing all of these over time, people do have many interests, but a cursory glance does give an impression of AI. The Instagram account uses a lot of it at least, and the top domain was likely made in conjunction with AI, given the style.

Kind of fascinating, though it could still be a person doing this using AI as opposed to an entirely generated persona. Thanks for bringing it up.

reconnecting1d ago

If it looks like an AI, writes about fraud like an AI, and sings like an AI, then it's probably not a duck.

reconnecting1d ago

We develop tirreno (1), an open-source security framework.

I question the described approaches. For example, while impossible travel is a legitimate and widely used technique, it's related to online user behaviour based on IP address. Moreover, tirreno, for example, has separate rules for cases where the IP clearly comes from Apple Relay or VPN/Tor — those are separate flags. I assume some or all examples are LLM-generated, as the context is mixed up and no one actually collects GPS location in bulk for card swipes.

1. https://github.com/tirrenotechnologies/tirreno

jwr23h ago

After 35 years of building software systems I've learned to temper my hubris. These days I rarely assume things to be "definitely true".

For example "Impossible travel": these days you can add your credit card to your phone and use Apple Pay. Well, this is useful for many things, one of them being adding your credit card to your kid's (teenager) phone, so that your kid can use your card in case of need/emergency when they are away from you. I did exactly that recently and actually worried about fraud control systems when my child paid using my card in Boston while I was in Europe.

Many things which you think are true might not be.

Anecdotally, US banks are terrible at building fraud control systems. It seems US banks assume any transaction that is charged by an entity outside the US is fraud. In my 10-year history of running a SaaS, the US banks and their "fraud control" systems have been one of the biggest billing problems.

VladVladikoff23h ago

Apple Pay & Google Wallet are actually considered lower risk by card brand than other card transactions because Apple and Google have so much tracking and biometrics on you, the phone must be unlocked and pin entered to pay. My company gets lower rates on these types of transactions than regular card transactions, lower rates because fraud is paid for by the fees, less fraud for a transaction type, lower merchant rates. So likely those transactions your kid does on their phone are flying way below the fraud threshold to trigger, even if it hits one trigger like “impossible travel distance”.

VladVladikoff23h ago

>Anecdotally, US banks are terrible at building fraud control systems. It seems US banks assume any transaction that is charged by an entity outside the US is fraud. In my 10-year history of running a SaaS, the US banks and their "fraud control" systems have been one of the biggest billing problems.

This rings home so true, as a Canadian company I am SO TIRED of US banks flagging our transactions as fraud. We have done so much to try to prevent it too. We have a mail forwarding office address in the US. A bank account in USD in the US registered to that address, the merchant account tied to that charging in USD, and still we get these fraud flags. And we’re over the 10 year mark now, I think almost 15. You would thing we would have built up some trust at these banks, but nope.

My next biggest hassle lately is we are a “tokenize and bill later” type service, and we don’t charge monthly recurring exact same amount, depends on the users incurred charges in that period. And lately it seems most Americans leave their cards on a permanently lock, and only unlock to allow a charge, this means most of our charges decline initially until the user unlocks their card and retries the payment. A real support headache if any has a fix to either of these problems I would pay good money for it.

gosub10022h ago

The card processor collects a lot of data, presumably they would have a flag whether the card was used via a phone or real plastic. I suspect the "card used in 2 locations" thing is pretty old. Cards are supposed to have switched to chip for many years now. AFAIK magstripes are the only ones that can be cloned.

enoent1d ago

> Fraud detection in transaction data is mostly SQL. Not machine learning, not graph databases, not whatever Gartner is hyping this year. SQL, run against the right tables, with the right joins, looking for the right shapes.

It's also not all program-integrity, which is the only work that could justify such blanket statements. Worse is better as long as it addresses the problem domain.

Fintech clients are generally interested in knowing whether a transaction happening _right now_ is fraud. They want to know that in a few milliseconds, for high-dimensional data. It's work done at a scale where relational databases cannot meet these real-time constraints, and instead find other uses like historical data loading. That's how you end up with in-memory databases, stream-processing engines, and yes, even machine learning.

Having said that, some of the author's points are valid, and I'm looking forward for their next writings, in particular dealing with noisy alerts is a general problem beyond performance engineering.

beamy1d ago

In my experience, what you're describing would more specifically be called Fraud Prevention rather than Fraud Detection. Both tend to coexist and are complementary in a mature setup.

For Prevention, you're always going to be constrained by latency requirements, available data and an incomplete picture of user behaviour. You make a quick decision using ML and rules that deals with the majority of cases. But those constraints make it impossible to precisely prevent all fraud.

Detection deals with the downstream consequences of this. A team of analysts will typically analyse the accepted transactions for signs of fraud. This is particularly important for fraud types where you don't get an external signal like a chargeback or customer complaint. Platform integrity is one such example. But Fintechs will also see this building anti-money laundering systems - you need to go looking for the fraud. This is the process the article is describing.

I say they're complementary because the detected transactions become the labels for training and evaluating the next iteration of prevention models.

0cf8612b2e1e1d ago

  If a card swipes in Chicago and seven minutes later swipes in Los Angeles, one of those swipes is fake.

How does this work with online shopping? When I am sitting on the couch and buy from Amazon, where does the address get registered?

Can also imagine an edge case: couple shares an online account, one is traveling and purchases with the saved card details.

teraflop1d ago

Swiping a card (or inserting, or tapping) is a "card present" transaction. Online shopping, where you type in the card number, is a "card not present" transaction. Retailers and banks can tell the difference.

thedebuglife1d ago

They can tell based on transaction metadata. Source: I worked at a cc company

rootusrootus1d ago

I believe the system distinguishes between card present and card not present.

crmd1d ago

> Drawback: this doesn’t work until you have history. New accounts have no baseline.

This is an underrated CX factor: If my card gets denied when i’m a new customer or exhibiting a new pattern, i’m impressed with their software.

However if they deny a transaction where there is any previous history of me authenticating, then I’m frustrated by their naive paranoid algorithm.

chii1d ago

The incentives of the bank is to cut fraud.

Fraudulent transactions will eventually cost the bank (when they would have to reverse/reimburse it and eat the loss). A denied transaction only results in an angry customer who will quickly forget after they complained - so the customer bears the brunt of the externalized cost.

Therefore, the bank's incentive is to err on the side of more caution, and deny transactions when finding false positives.

kikimora1d ago

When someone disputes a charge and wins banks charge processor and the original merchant. The bank won’t loose a penny, merchant will.

fny1d ago

Isn't the point of ML that you learn these rules from the data? The right approach to me would be to use ML models to detect patterns that correspond with fraud and then evaluate them to see if any make sense. This way you might discover new hyptotheses.

bob10291d ago

Anything that can't be explained and iterated deterministically is too risky for the business of declining financial transactions.

Human analysts need to be able to explain to compliance in a single 5 minute email why a specific transaction was declined, and most importantly, what could have been done differently to avoid the adverse decision.

Fixing one problem with ML often creates two new problems that aren't quite obvious yet. SQL tends to have fewer surprises with regard to regressions and unexpected side effects as things change over time.

epgui1d ago

In my experience, Visa support can’t tell me why my legitimate transaction was tagged as fraudulent, other than to say it triggered an AI thing. They also can’t tweak the settings like they used to do, but they can manually allow specific transactions one by one on an ad hoc basis.

hansvm1d ago

Recently, they've stopped even being able to allow specific transactions through for me. They can tag the flagged transaction as legitimate and hope the AI picks up on that, but that hasn't worked once in the last ~15 calls for me. I've just stopped trying to use Visa as my primary card online, a habit that bled into in-person purchases as well.

saagarjha1d ago

I assume that this option is unlocked once you enter litigation against the company.

1 more reply

janpeuker21h ago

I agree the post looks a little AI written but generally this kind of analysis is quite common. Leaving aside human heuristics that are generally too well known to catch real scammers (like time travel or "7 days", which is bad because often weekly patterns are important so at the very least look at 10 days) and actually have low precision, what I find odd it that all results just return a user ID.

So this is really just surfacing cases, but with not enough context to be useful to prioritise. I would expect a score to be included.

Apart from that it misses a lot of signals like refunds, declines, disputes etc [1].

1) https://stripe.com/gb/guides/improve-fraud-management-with-r...

1 more reply

daneel_w23h ago

In reality, most banks perform a lot of these transaction checks in real time to block fraudulent txes up-front, instead of validating tx legitimacy retroactively at a point where the money is already gone. Some 15 years ago a security rep with Nordea (a large Nordic bank) called me late at night asking if I was currently in South Korea and had just a minute ago used my card in a shop. Someone had initiated a "card present" purchase with my card for 1337 SEK (I'm certain this amount was intentional), which Nordea automatically blocked as it was near the edge of possibility relative to my previous card swipe in Sweden earlier in the day, and they wanted to make sure they weren't about to mistakenly strand me abroad by blocking the card.

maciekkmrk1d ago

What if I go on a roadtrip and suddenly get gas at 2am?

vesrah1d ago

I had this happen once - I flew to a city about 8 hours of driving time away to buy a motorcycle and landed late in the evening. My card was declined when I got gas a little after midnight and I had no cash or other card with me so I called the 24 hour support line. I had a quick conversation with a support agent explaining that I was traveling and the card needed to be reactivated right away. Within five minutes the card was working and I was back to working my way down a long chain of mistakes.

masklinn1d ago

As the tail end of the article explains these are independent pieces of evidence not independent proofs: most of them can be legitimate operations (even the speed one, airliners cruise at that speed but if you get to ride a long-range business plane they can cruise faster).

layer823h ago

> Views are mine, not my employer’s.

What about the tables?

dnnddidiej1d ago

2 can be genuine use. I let my partner use my card and I use it on my phone as rfid. Maybe ignore phone usages since they are secured pretty well.

masklinn1d ago

All of them can be genuine use, these are fraud signals not fraud proofs, and the article does cover this:

> What works is running them all and scoring each transaction across the signals. A transaction that fails on three or four of them is almost always fraud. A transaction that fails on one might be your grandma being weird with her debit card on vacation.

dnnddidiej1d ago

> If a card swipes in Chicago and seven minutes later swipes in Los Angeles, one of those swipes is fake. The card is cloned. This is the most uncontroversial fraud signal you’ll find — there’s almost no legitimate reason a single card is in two distant places in seven minutes.

lmz1d ago

The question is whether they would treat that as a single card (physical vs digital).

1 more reply

rswail1d ago

The Apple/Google Pay cards have a DPAN (device account number) that is different to the CPAN of the physical card. It keeps the same issuer (first 6 digits) and the same "last 4" digits, but the others are different.

The DPAN is translated into the CPAN by software at the issuing bank, so it's not identifiable by the merchants.

Merchants get the "last 4" digits, but that's not enough to identify specific CPANs.

WhyIsItAlwaysHN1d ago

Exactly, hopefully this is not an autoblock in the future.

masklinn1d ago

> A transaction that fails on three or four of them is almost always fraud. A transaction that fails on one might be your grandma being weird with her debit card on vacation.

WhyIsItAlwaysHN1d ago

The article states that the particular item is a clear sign of fraud. If that was true, then it should be treated in a special manner. A more paranoid bank could enforce it without adhering to this guidance of multi-factor detection.

It isn't though, so balancing it with other rules is fine.

nujabe1d ago

They can distinguish a physical card vs Apple Pay

dogscatstrees1d ago

The main problem with these SQL calculations is that they are deterministic shortcuts for a probabilistic problem. Fraud is not usually a “true because rule X matched.” It is more like "what is the probability this is fraudulent"? SQL patterns are useful, but they are blunt instruments. I really don't think banks use deterministic heuristics but more data science stuff.

tdeck1d ago

I have a fair amount of experience in this industry, albeit a couple of years old now. I worked at Square on their payment risk team in 2015 and 2016, at Plaid om their ACH fraud API product called Signal from 2021 to 2024. At Plaid I was involved in client meetings and learned how many companies were already handling risk, and I've interviewed at a handful of other companies' risk teams when I was looking for a new role.

Basically it's not just banks and formal financial institutions doing this, and how they do it depends on the company size. Size tends to correlate not only with how many resources you have for a risk team, but also with whether fraud rings are targeting you.

Usually what I've seen is that companies start with some kind of batch SQL/simple logic process that runs daily and tends to flag accounts for manual review and block automatic events like settlement or trading (or whatever the platform does) until that review has been done. Then over time the company will transition to an ML-based approach that still mostly flags things for manual review. The goal of the ML is to improve the precision of the flagging without hurting dollar recall or fraud event recall too much. Depending on the payment system companies may be sensitive to both (for example, in ACH if you get too many returns, even very low dollar payment returns, you're going to get a hard time from your partner bank and you risk not being able to use ACH anymore).

datsci_est_201513h ago

Or, “rules-based logic encoded in SQL queries without any backing data”.

Bunch of thresholds, no data proving those thresholds are meaningful.

inheritedwisdom1d ago

This takes me back, fighting telephone fraud back when folks use to accept cc over the phone. We used similar patterns but only had phone numbers and the white pages. Cross state boundaries inside similar time frames and categorizing similar merchant types. It’s fun to see these same patterns still in use 20 years later for the same purpose.

vladiat0r23h ago

These seem like reasonable interview questions. Otherwise, these seem very basic and naive.

noduerme1d ago

This is very cool to read. Although I've never truly worked in fraud prevention, I stumbled into automating a lot of similar pattern checks to catch collusion and fraud when I wrote and ran a poker site / casino. Window functions were not available then so the queries were LONG. One way I'd deal with it was to assign uuids to every pair of players who'd ever shared a poker table, and then run nightly analysis of how much their betting deviated from expected norms and their own baseline on each stage of the game if they were in the same hand as each other. This could actually be done in one or two magnificent 100+ line SQL queries on the history table, on a read replica.

Lagging window functions and/or lateral joins probably would have reduced it to 1/4 the size but definitely increased the cost versus just narrowing the sets into smaller tables first.

sincerely1d ago

This is quite interesting, but the blatantly AI generated explanations are like an anti-signal for quality

skeeter202022h ago

These all seem pretty elementary TBH. they focus on identifying fraudulent transactions, vs (IME) the more valuable deciding if a transaction is fraudulent. This is totally double today. Example: instead of some sort of "outside of normal transaction" you can confidently determine that "coffee at 2am" is likely fine, if they also bought gas 10 minutes earlier from the same merchant, dinner 300 miles away at 7pm, and again gas 8 hours ago in their home town.

achierius1d ago

This seems interesting, but has so many signs of AI writing that I worry it's not just edited but generated from whole cloth. Probably still a lot of truth in there but it does give me pause!

> The roundness is the signal.

> Slight pain, same result.

to point at a few.

jorisnoo1d ago

> Three filters. That’s it.

And my favourite most hated pattern, the no no no:

> Not machine learning, not graph databases, not whatever Gartner is hyping this year.

gwerbin1d ago

I suspect the entire concept was vibe coded. There's a reason fraud detection uses machine learning.

zapkyeskrill1d ago

Oh shyte, I use (and have used) these for a long time. Guess everything is classes as AI nowadays just yield and use it (everyone thinks you do anyway)

arcfour1d ago

Nah, human writing will bleed through as imperfect (in a quaint way).

jorisnoo1d ago

Not everything of course, not because of a few phrases, not your comment. But the recent omnipresence of these is just hard to ignore.

achierius1d ago

This comment certainly does not scan as AI! Look, this isn't perfect, but it's the best we've got, and so long as AI writing is meaningfully worse than human writing, people are going to try to tell the difference.

ande-mnoc1d ago

> The “two or more in that hour” filter on the inner query is doing important work.

This is Claude talking isn’t it.

AussieWog931d ago

Wait, you can clone a credit card? Why don't they use a public-private key pair?

f311a1d ago

You can only clone magnetic stripe. In a lot of places, you can’t make large transactions with it

TheOrange1d ago

Fascinating stuff. More please :/)

How do you deal with vacations and online shopping. You could be in another country or two in a few hours and purchase from across the world

Beestie21h ago

This is gold, mate. Much obliged.

1a527dd523h ago

I feel like if you've done any kind of investigation work then this is the normal baseline.

jason1cho22h ago

People here provide counterexamples to show this article is bullshit. Don't forget, fraud detection is about statistics. Outliers always exist.

Machine learning systems also learn your pattern. The article gives simple SQL rules. Don't dismiss this article as worthless.

sltr1d ago

Obligatory in any discussion of money fraud: https://www.bitsaboutmoney.com/archive/optimal-amount-of-fra...

atombender1d ago

This is AI slop, as has been pointed out by several other commenters. Flagged.

1 more reply

nubg23h ago

> If their card does, it’s either being used by someone else or they’re traveling — and travel produces other signals you can check.

Signal's he can check? So some random dude is looking at my credit card purchase history while playing around with his SQL queries?

mattmanser1d ago

This is the sort of thing I used to love doing and I often gaze at raw data analysis and sometimes wish my career had pivoted towards working with data like this.

But I must admit there was a point where I suddenly lost my love for SQL and it was pretty much when the OVER PARTITION BY syntax appeared.

It never clicks. I always have to look up how it works, I always find it unintuitive. I've never understood why I hate it so much.

nubg23h ago

Article was written by AI (dozens of give aways), so take the content with a grain of salt! They author could not be bothered to write it by hand, yet demands your attention to read it. I'm not even sure which parts are based on his prompt and which were hallucinated... How to butcher what could have been an interesting article... sigh

themafia1d ago

> If a card swipes in Chicago and seven minutes later swipes in Los Angeles, one of those swipes is fake. The card is cloned.

Or, the cardholder is trying to do the cannonball run:

https://www.youtube.com/shorts/Dx5WPNIEwiE

Hackbraten1d ago

Some of these heuristics are dystopian to no end.

> Most people are creatures of habit when they spend money. A nine-to-fiver doesn’t suddenly start buying gas at 3am.

Breaking out of a habit once in a while is what keeps one's mind sharp.

A big "fuck you" to financial analysts with those groundhog-day mindsets for making my life much more miserable than it needs to be and for adding a chilling effect to those little getaways that make life interesting and worthwhile. I despise you for this.

yieldcrv1d ago

the real question is whether you would point hour agentic system at this blog post and create

chargeback-mcp

or would you turn it all into a markdown file and call it a skill?

j / k navigate · click thread line to collapse

127 comments

jstanley1d ago

> Real cardholders almost never buy something for exactly $1.00. Coffee is $4.73, gas is $52.81. The roundness is the signal.

Surely this depends on how the vendor sets their prices? If you're going to buy something from a website to test a stolen credit card you don't just get to make up your own prices.

And I think you may be over-indexing on the US "prices don't include tax" thing. Elsewhere, round-number prices are extremely common.

It's unclear to me whether this article is an attempt at breaking down complex expertise into over-simplified SQL queries, or whether it is all speculative and made up.

There is a conflict between "Six SQL patterns I use to catch transaction fraud" and "Nothing here comes from anything I’ve actually worked on or seen".

nswango1d ago

The "transaction outside usual hour range" seems pretty basic.

I don't usually buy gas, coffee or snacks at 2am. But on the very rare occasion that I do, I'm dealing with some kind of personal emergency and don't also want to have to call my bank.

I get that that's also a time opportunistic thieves, etc, might be operating. But the cost of false positives is also a thing.

normie30001d ago

Worse than that.

Coffee usually _is_ a round number in my experience, and I know of people who aim for round numbers when filling their car, and of fuel stations which require a pre-set value, often 10, 20, 50€ etc

sheept1d ago

panflute1d ago

Niksko1d ago

normie30001d ago

I don't need to account for it - I'm just stating that this doesn't match my experience:

> Real cardholders almost never buy something for exactly $1.00. Coffee is $4.73, gas is $52.81. The roundness is the signal.

1 more reply

LandR1d ago

Yeah I was in a bar one night and was peckish, so tried to buy a packet of crisps. They said minimum spend on card was £5, so I said just charge me the £5 it's fine.

Card got blocked as they thought it was fraud. Annoying! And not something inebriated me wanted to deal with at 2am.

Ok. Maybe they protected me from myself, but still!

mike_hock1d ago

This is also

a) trivial to bypass by adding dither to the test transactions and

b) trivial to improve upon with proper statistical analysis and

c) shouldn't this kind of heuristic pattern recognition with no expectation of near-100% accuracy be what AI is good at?

themafia1d ago

I'm seeing a few stores here and there which have a "round up to donate" option. I guess I'm a bit of a sucker and I always use that option. My groceries are always a round number as a result.

time4tea1d ago

Ive always suspected that this is all of a tax dodge, a money spinner, and a pr exercise "we gave xxx to charity" - no, your customers did.

Just set up a direct debit to your favourite charity.

nubg23h ago

The article is LLM-generated.

relevant_stats1d ago

Reading this to the very end uncovers empty and contradictory advice. I'm almost sure it's LLM generated.

We learn simultaneously that 'your team' shouldn't rely on any one of those patterns ('none of them is enough'), but that pattern 1 'alone will surface a useful amount of fraud'.

We also read strange sentences like "Every analyst on your team will use them (ie window functions) once they exist, and adding the next fraud pattern stops being a project. [end of paragraph]"

Or irrelevant discussions about how filtering by "IS NULL" might be not applicable when almost none of the provided examples uses it (and the one which does uses it in different context).

This is low quality and too long.

Kwpolska1d ago

> Border crossings inside 10 minutes. International rings.

Or normal people living in Europe in border-adjacent areas.

compel21601d ago

Even US -> CA was maybe 10 min when I crossed a few weeks ago...

weird-eye-issue1d ago

Is it possible that you are actually in a transnational crime syndicate without knowing it?

reconnecting1d ago

Hacker News, we need to talk!

220+ points and 70 comments, and very few notice it's quite a fake post — and no one that it's an AI generated person?

1. https://www.amazon.it/Forged-Soundtrack-Explicit-Fixel-Smith...

2. https://fixelsmith.com

3. https://analytics.fixelsmith.com/

4. https://www.instagram.com/fixeltales/

relevant_stats1d ago

Hacker News has developed recently a frustrating habit of upvoting such low quality AI sloppy submissions.

Makes me wonder if this AI flood uncovers the unflattering truth about this community acuteness, or it's only a failure of existing guardrails and we just need to change them.

Igrom1d ago

geoduck1423h ago

>Assuming that all of the comments were done in good faith

Well,sure. But some people come here just for the comments and don't read the articles

diatone1d ago

tdeck1d ago

reconnecting1d ago

1. https://github.com/tirrenotechnologies/tirreno

2. https://fixelsmith.com

hju22_-31d ago

Kind of fascinating, though it could still be a person doing this using AI as opposed to an entirely generated persona. Thanks for bringing it up.

reconnecting1d ago

If it looks like an AI, writes about fraud like an AI, and sings like an AI, then it's probably not a duck.

reconnecting1d ago

We develop tirreno (1), an open-source security framework.

1. https://github.com/tirrenotechnologies/tirreno

jwr23h ago

After 35 years of building software systems I've learned to temper my hubris. These days I rarely assume things to be "definitely true".

Many things which you think are true might not be.

VladVladikoff23h ago

gosub10022h ago

enoent1d ago

It's also not all program-integrity, which is the only work that could justify such blanket statements. Worse is better as long as it addresses the problem domain.

Having said that, some of the author's points are valid, and I'm looking forward for their next writings, in particular dealing with noisy alerts is a general problem beyond performance engineering.

beamy1d ago

In my experience, what you're describing would more specifically be called Fraud Prevention rather than Fraud Detection. Both tend to coexist and are complementary in a mature setup.

I say they're complementary because the detected transactions become the labels for training and evaluating the next iteration of prevention models.

0cf8612b2e1e1d ago

  If a card swipes in Chicago and seven minutes later swipes in Los Angeles, one of those swipes is fake.

How does this work with online shopping? When I am sitting on the couch and buy from Amazon, where does the address get registered?

Can also imagine an edge case: couple shares an online account, one is traveling and purchases with the saved card details.

teraflop1d ago

thedebuglife1d ago

They can tell based on transaction metadata. Source: I worked at a cc company

rootusrootus1d ago

I believe the system distinguishes between card present and card not present.

crmd1d ago

> Drawback: this doesn’t work until you have history. New accounts have no baseline.

This is an underrated CX factor: If my card gets denied when i’m a new customer or exhibiting a new pattern, i’m impressed with their software.

However if they deny a transaction where there is any previous history of me authenticating, then I’m frustrated by their naive paranoid algorithm.

chii1d ago

The incentives of the bank is to cut fraud.

Therefore, the bank's incentive is to err on the side of more caution, and deny transactions when finding false positives.

kikimora1d ago

When someone disputes a charge and wins banks charge processor and the original merchant. The bank won’t loose a penny, merchant will.

fny1d ago

bob10291d ago

Anything that can't be explained and iterated deterministically is too risky for the business of declining financial transactions.

epgui1d ago

hansvm1d ago

saagarjha1d ago

I assume that this option is unlocked once you enter litigation against the company.

1 more reply

janpeuker21h ago

So this is really just surfacing cases, but with not enough context to be useful to prioritise. I would expect a score to be included.

Apart from that it misses a lot of signals like refunds, declines, disputes etc [1].

1) https://stripe.com/gb/guides/improve-fraud-management-with-r...

1 more reply

daneel_w23h ago

maciekkmrk1d ago

What if I go on a roadtrip and suddenly get gas at 2am?

vesrah1d ago

masklinn1d ago

layer823h ago

> Views are mine, not my employer’s.

What about the tables?

dnnddidiej1d ago

2 can be genuine use. I let my partner use my card and I use it on my phone as rfid. Maybe ignore phone usages since they are secured pretty well.

masklinn1d ago

All of them can be genuine use, these are fraud signals not fraud proofs, and the article does cover this:

dnnddidiej1d ago

lmz1d ago

The question is whether they would treat that as a single card (physical vs digital).

1 more reply

rswail1d ago

The DPAN is translated into the CPAN by software at the issuing bank, so it's not identifiable by the merchants.

Merchants get the "last 4" digits, but that's not enough to identify specific CPANs.

WhyIsItAlwaysHN1d ago

Exactly, hopefully this is not an autoblock in the future.

masklinn1d ago

> A transaction that fails on three or four of them is almost always fraud. A transaction that fails on one might be your grandma being weird with her debit card on vacation.

WhyIsItAlwaysHN1d ago

It isn't though, so balancing it with other rules is fine.

nujabe1d ago

They can distinguish a physical card vs Apple Pay

dogscatstrees1d ago

tdeck1d ago

datsci_est_201513h ago

Or, “rules-based logic encoded in SQL queries without any backing data”.

Bunch of thresholds, no data proving those thresholds are meaningful.

inheritedwisdom1d ago

vladiat0r23h ago

These seem like reasonable interview questions. Otherwise, these seem very basic and naive.

noduerme1d ago

Lagging window functions and/or lateral joins probably would have reduced it to 1/4 the size but definitely increased the cost versus just narrowing the sets into smaller tables first.

sincerely1d ago

This is quite interesting, but the blatantly AI generated explanations are like an anti-signal for quality

skeeter202022h ago

achierius1d ago

This seems interesting, but has so many signs of AI writing that I worry it's not just edited but generated from whole cloth. Probably still a lot of truth in there but it does give me pause!

> The roundness is the signal.

> Slight pain, same result.

to point at a few.

jorisnoo1d ago

> Three filters. That’s it.

And my favourite most hated pattern, the no no no:

> Not machine learning, not graph databases, not whatever Gartner is hyping this year.

gwerbin1d ago

I suspect the entire concept was vibe coded. There's a reason fraud detection uses machine learning.

zapkyeskrill1d ago

Oh shyte, I use (and have used) these for a long time. Guess everything is classes as AI nowadays just yield and use it (everyone thinks you do anyway)

arcfour1d ago

Nah, human writing will bleed through as imperfect (in a quaint way).

jorisnoo1d ago

Not everything of course, not because of a few phrases, not your comment. But the recent omnipresence of these is just hard to ignore.

achierius1d ago

ande-mnoc1d ago

> The “two or more in that hour” filter on the inner query is doing important work.

This is Claude talking isn’t it.

AussieWog931d ago

Wait, you can clone a credit card? Why don't they use a public-private key pair?

f311a1d ago

You can only clone magnetic stripe. In a lot of places, you can’t make large transactions with it

TheOrange1d ago

Fascinating stuff. More please :/)

How do you deal with vacations and online shopping. You could be in another country or two in a few hours and purchase from across the world

Beestie21h ago

This is gold, mate. Much obliged.

1a527dd523h ago

I feel like if you've done any kind of investigation work then this is the normal baseline.

jason1cho22h ago

People here provide counterexamples to show this article is bullshit. Don't forget, fraud detection is about statistics. Outliers always exist.

Machine learning systems also learn your pattern. The article gives simple SQL rules. Don't dismiss this article as worthless.

sltr1d ago

Obligatory in any discussion of money fraud: https://www.bitsaboutmoney.com/archive/optimal-amount-of-fra...

atombender1d ago

This is AI slop, as has been pointed out by several other commenters. Flagged.

1 more reply

nubg23h ago

> If their card does, it’s either being used by someone else or they’re traveling — and travel produces other signals you can check.

Signal's he can check? So some random dude is looking at my credit card purchase history while playing around with his SQL queries?

mattmanser1d ago

This is the sort of thing I used to love doing and I often gaze at raw data analysis and sometimes wish my career had pivoted towards working with data like this.

But I must admit there was a point where I suddenly lost my love for SQL and it was pretty much when the OVER PARTITION BY syntax appeared.

It never clicks. I always have to look up how it works, I always find it unintuitive. I've never understood why I hate it so much.

nubg23h ago

themafia1d ago

> If a card swipes in Chicago and seven minutes later swipes in Los Angeles, one of those swipes is fake. The card is cloned.

Or, the cardholder is trying to do the cannonball run:

https://www.youtube.com/shorts/Dx5WPNIEwiE

Hackbraten1d ago

Some of these heuristics are dystopian to no end.

> Most people are creatures of habit when they spend money. A nine-to-fiver doesn’t suddenly start buying gas at 3am.

Breaking out of a habit once in a while is what keeps one's mind sharp.

yieldcrv1d ago

the real question is whether you would point hour agentic system at this blog post and create

chargeback-mcp

or would you turn it all into a markdown file and call it a skill?

j / k navigate · click thread line to collapse