Don't believe ChatGPT – we do not offer a "phone lookup" service (opens in new tab)

(blog.opencagedata.com)

453 pointsfreyfogle3y ago273 comments

273 comments

181 comments · 34 top-level

crazygringo3y ago· 34 in thread

I'm curious -- does anyone know of ML directions that could add any kind of factual confidence level to ChatGPT and similar?

We all know now that ChatGPT is just autocomplete on steroids. It produces plausibly convincing patterns of speech.

But from the way it's built and trained, it's not like there's even any kind of factual confidence level you could threshold, or anything. The concept of factuality doesn't exist in the model at all.

So, is any progress being made towards internet-scale ML "fact engines" that also have the flexibility and linguistic expressiveness of ChatGPT? Or are these just two totally different paths that nobody knows how to marry?

Because I know there's plenty of work done with knowledge graphs et al., but those are very brittle things that generally need plenty of human curation and verification, and can't provide any of the (good) "fuzzy thinking" that ChatGPT can. They can't summarize essays or write poems.

irrational3y ago

Remember the guy a few weeks ago that was being gaslighted by ChatGPT that this is the year 2022? Not only is it giving out potentially false info, but it will double down that it is right and you are wrong. Though, to be honest, that sounds like a lot of real people. The difference is, people are smart enough to not double down on try to say it is a different year and your phone is probably reporting the year wrong.

amscanne3y ago

That was the Bing preview, which is supposed to be an actual information product.

aftbit3y ago

I was entirely unable to convince it that England has a King now.

nl3y ago

> does anyone know of ML directions that could add any kind of factual confidence level to ChatGPT and similar?

Yes. It's a very active area of research. For example:

Discovering Latent Knowledge in Language Models Without Supervision (https://arxiv.org/abs/2212.03827) shows an unsupervised approach for probing a LLM to discover things it thinks are facts

Locating and Editing Factual Associations in GPT (https://arxiv.org/pdf/2202.05262.pdf) shows an approach to editing a LLM to edit facts.

Language Models as Knowledge Bases? (https://aclanthology.org/D19-1250.pdf) is some slightly older work exploring how well LLMs store factual information itself.

supriyo-biswas3y ago

Replying to this comment to find it later. (Is there a good way to bookmark comments on HN?)

1 more reply

crazygringo3y ago

Thank you so much! Those are exactly the types of links I'm curious about.

soerxpso3y ago

You're describing a problem as old as academia, on which very little progress has ever been made. Before "add a factual confidence level evaluator to a bot that doesn't understand the concept of fact" you must first figure out how to calculate a "factual confidence level" at all, in general.

visarga3y ago

There was a model that could set up a simulation to estimate the answer for you. So it won't use pure language, but it only works in a few cases.

alfalfasprout3y ago

By definition, an LLM doesn't have a semantic world model or ontology. Even the most "dumb" (and I use that in quotes because they really aren't) animal is able to reason about uncertain concepts and understands risk and uncertainty.

Yann Lecun has posted a lot recently about this but basically LLMs are a "useful offramp on the road to AGI".

cypress663y ago

There's nothing "by definition" that says so.

In fact many propose that when you train an LLM, in order to be able to predict the next word with enough accuracy, it must internally build a world model.

Yann Lecun is very salty about chatgpt, I wouldn't take his word seriously.

3 more replies

BoorishBears3y ago

There's research being done on this: https://arxiv.org/abs/2302.04761

At its core using an LM alone to solve factual problems seems silly: It's not unlike asking Dall-E to draw DOT compliant road signs.

I've gone at length at how unfortunate it would be if LMs start to get a bad rap because they're being shoehorned into being "Ask Jeeves 2.0" when they could be so much more.

crazygringo3y ago

> It's not unlike asking Dall-E to draw DOT compliant road signs.

I love that. That's going to be my new explanation for people around ChatGPT.

For some reason it seems so much more obvious when Dall-E does something close but still totally wrong (e.g. 3 or 6 fingers, 3 arms, etc.), but it's not immediately obvious with text. But it's still the same underlying principles.

snowstormsun3y ago

I think "Explainable AI" is a related research direction, but perhaps not popular for language models.

shawntan3y ago

I think part of the issue is what level of explanation is satisfactory. We can explain how every linear transformation computes its output, but the sum of it is in many ways more than its parts.

Then there are efforts that look like this one: https://news.ycombinator.com/item?id=34821414 They go probing for specific capabilities of Transformers to figure out which cell fires under some specific stimulus. But think a little bit more about what people might want from explainability and you quickly find that something like this is insufficient.

There may be a tradeoff we're looking at where explainability (for some definition of it) will have to be exchanged for performance (under some set of tasks). You can build more interpretable models these days, but you usually pay for it in terms of how well you do on benchmarks.

behnamoh3y ago

Impossible to explain the inner workings of GPT-3 without having access to the model and its weights. Does anyone know if any methods exist for this?

2 more replies

sdrinf3y ago

Add to your prompt: "For every factual statement, assign a certainty float 0..1, where 0 means you're very uncertain, and 1 means you're absolutely certain it is true".

Specific example: "why do we have first-person subjective experiences? List current theories. For every theory, assign a truthiness float 0..1, where 0 means you're sure it is wrong, and 1 means you're absolutely sure it is true"

From experimenting with this, it will shift the output, sometimes drastically so, as the model now has to reason about it's own certainty; it tends to make significantly less shit up (for example, the non-truth-marked version of the output for the query above also listed panpsychism; whereas the truth-marked version listed only scientific hypotheses).

So the model _can_ reason about it's certainty, and truth-value; and I strongly suspect it was just not rewarded during RLHF for omitting things it knew to be false -basically, percolating the social lies people tell to eachother- which seems to show up in coding as well.

Edit: see https://twitter.com/sdrinf/status/1629084909422931969 for results

wildrhythms3y ago

I initialized with that prompt and it did not give me any 0..1 certainty values on any subsequent output to my queries.

swexbe3y ago

Or maybe it will just hallucinate this number too.

krainboltgreene3y ago

> We all know now that ChatGPT is just autocomplete on steroids

I promise you most people do not know this.

numeri3y ago

> The concept of factuality doesn't exist in the model at all.

This is an example of a whole range of beliefs about LLMs that are very common (even in the field itself), because they were obviously true for small models, but that might not necessarily hold for larger models. There's a lot that we don't know about LLMs, but we do know that they exhibit emergent behaviors as they scale. Smaller models don't really have world models, just language models, but these larger models have started developing clear world models once given the capacity and data to do so.

As for the existence of a concept of factuality, I found this paper[1] very interesting. It details an unsupervised method to identify which internal activations of the model correspond to factual statements, regardless of what the model ends up saying. Looking at those internal activations rather than just the model's output even reduces the model's susceptibility to prompts that lead it towards saying the wrong answer.

[1] https://arxiv.org/abs/2212.03827

YeGoblynQueenne3y ago

>> So, is any progress being made towards internet-scale ML "fact engines" that also have the flexibility and linguistic expressiveness of ChatGPT? Or are these just two totally different paths that nobody knows how to marry?

I wouldn't hold my breath. The whole idea of statistical language modelling (much more ancient than Transformer-trained large language models, btw) is to represent structure without having to represent meaning, because we have no idea how to represent meaning. Or, seen another way, we know how to represent structure, but not how to represent meaning, so let's focus on structure and cross our fingers that meaning will naturally sort of emerge, when it feels like it.

So far, we got structure down pat (it's been a few years now, or quite a few, depending on how you see it) but meaning is nowhere to be seen.

Nevertheless, this is an interesting scientific result: one can have smooth, grammatically correct linguistic structure without meaning. Progress has been achieved (and no, this is not sarcasm).

juujian3y ago

People say things that are wrong. We train language model on what people say. And even if we were able to filter the training data for just factually correct things---language models use stochastic to generate novel replies, there is always the risk something wrong comes up. So in short, no, that is not what language models are designed to do.

mochomocha3y ago

> But from the way it's built and trained, it's not like there's even any kind of factual confidence level you could threshold, or anything. The concept of factuality doesn't exist in the model at all.

I'm not super familiar with ChatGPT internals, but there are plenty of ways to tack on uncertainty estimates to predictions of typical "large scale ML models" without touching Bayesian stuff (which only work for small scale academics problems). You can do simple parametric posteriors estimation or if all you have is infinite compute and don't even want to bother with anything "mathy", bootstrapping is the "scalable / easy" solution.

pavon3y ago

Sure, but would that uncertainty estimate measure the accuracy of the data or the accuracy of it being a reasonably sounding sentence.

1 more reply

ericlewis3y ago

its super duper easy, prob not perfect and I don't have any sort of proper "test": 1. I ask the model first if it seems like a question that benefits from an external answer 2. I talk to Wolfram alpha with some abstraction of the question 3. I wait for a response 4. I "incept" it into the final response, essentially a prompt that mixes in a context of sorts that contains the factual information.

you could cross check this stuff too with yet more models.

simonw3y ago

That's basically what the new Bing is. It's a large language model that can run searches, and then use what comes back from those searches to generate answers to questions.

Whether or not the information that comes back from those searches is reliable is a whole other question.

I would love to learn what the latest research is into "factual correctness" detection. Presumably there are teams out there trying to solve that one?

nerdponx3y ago

ChatGPT and You.com chat both claim to be able to provide references, but usually the URLs they provide are for completely unrelated topics, even if they are on convincing-looking domains (e.g. Arxiv or Sciencedirect, but completely unrelated random-seeming papers).

behnamoh3y ago

AFAIK, Bing AI is not itself an LLM, but rather a wrapper around ChatGPT, which itself is based on GPT-3, which is based on the GPT architecture, which is (roughly speaking) half of a transformer architecture, which is based on encoder/decoder neural nets which are based on ...

1 more reply

mrtnmcc3y ago

Giving LLM the ability to query other services like Google should solve much of this. For example ChatGPT can be initialized to be told it can output commands like "QUERY_GOOGLE:What is the current time?" and get Google's response, which it can incorporate. You can actually do this yourself and prove it works by performing the Google search for ChatGPT.

renewiltord3y ago

You don't have to use ChatGPT. There are other styles of AIs that use LLMs like https://www.perplexity.ai/

Personally, I use ChatGPT (the paid version) and Copilot every day and find them awesome enhancers.

wwwpatdelcom3y ago

Deepmind is an LLM with a fact verifier attached, though the fact verifier is actually a ranked list of code compile times. Obviously this is a narrow subset of specific problems, but one could expand that library of problems over time.

qualudeheart3y ago

Are you talking about Alphacode? That seems like the only Deepmind project similar to what you describe.

csours3y ago

I'm curious about falsifiable models.

opisthenar843y ago

I imagine OpenAI is probably collecting a massive dataset of "false" responses (from the general public's use of ChatGPT and Bing) and fine-tuning GPT-3.5 with it.

The rich keep getting richer.

kelseyfrog3y ago· 22 in thread

> All suggestions are welcome.

Monetize it!

Evil answer: Partner with an advertiser and sell https://api.opencagedata.com/geocode/v1/json as an ad space. This may be the first opportunity for an application/json-encoded advertisement.

Nice answer: Partner with an actual phone lookup platform and respond with a 301 Moved Permanently at the endpoint.

fwlr3y ago

Another suggestion: put something like libphonenumber’s isPossibleNumber or isValidNumber on your server, on error run the query string through it. If it says that looks like a phone number, relay this information in the error response. A field in the JSON response like “info: Your query parameter looks like a phone number. If you are trying to get the geographical location of a phone from the phone number, please be aware this is not possible. [link to blogpost]” would hopefully jump out at people, particularly if you ask them to include the error response in their support request.

It’s an unprincipled hack, a bizarre dependency to add to your project, it probably feels like admitting defeat to the all powerful AI… but it does 90%-solve the problem.

Nathanba3y ago

yeah I think maybe the way to solve this is to add some kind of API documentation that explicitly mentions that phone lookups aren't possible. ChatGPT will parse it eventually...

rosywoozlechan3y ago

there's no "actual phone lookup platform" you can't get a person's location by knowing their phone number, that's a huge privacy violation. You can get the location of your own phone via icloud or google's system for android. You could also install an app on your phone to track your phone's location. You cannot find people based on knowing their phone number, that would be a serious safety issue for you know people trying to not, for example, get murdered by their ex-boyfriends.

afpx3y ago

There’s no public one. But, data brokers have this info, and share it. Internally, a lot of companies can do this. So, it’s not far-fetched.

2 more replies

vgeek3y ago

That is widely known about and freely available to the public, maybe. There are multiple options available for pseudo-LE types with relatively lax access prerequisites.

https://www.robertxiao.ca/hacking/locationsmart/ is an example of one provider's public demo (requiring only a phone number) being used to provide non-consensual location data.

1 more reply

Hello713y ago

it's been reported numerous times that you can buy real-time cell phone location data: https://news.ycombinator.com/item?id=17081684, https://news.ycombinator.com/item?id=20506624, https://news.ycombinator.com/item?id=32143256. you might need a little more info than just a phone number, but (allegedly) not that much more.

guelo3y ago

Yea, I think this "want" is pointing to the massive tsunami of spammer/scammer script kiddies that ChatGPT is enabling.

rrrrrrrrrrrryan3y ago

Reverse phone number lookup usually refers to finding a residential/business address tied to a phone number (historically a landline phone, but cell phones are also owned by people who have addresses), not the literal GPS location of the phone.

I think white pages are still a thing, no?

For the young 'uns - the white pages were part of the physical phone book in every city. You got a new phone book delivered to your doorstep each year for free. Yellow pages listed the phone numbers of every business, white pages listed the phone numbers of the residents.

The crazy part is: almost everyone added their numbers voluntarily to the white pages, because you wanted people to be able to easily find and reach you.

2 more replies

msla3y ago

Don't tell Whitepages

https://www.whitepages.com/reverse-phone

> Whitepages free reverse phone lookup service allows you to enter a phone number and quickly find out who called you. Find the phone owner's full name, address, and more.

[snip]

> Anyone can do a reverse lookup to identify cell phone, residential, and business numbers for free.

That, or you could get a normal white pages and process it using some sort of data processing tool... nah, that's science fiction.

1 more reply

fijiaarone3y ago

Yeah only the government can do that.

throwaway294953y ago

What about phone numbers corresponding to a specific location?

kmoser3y ago

Obvious suggestion: don't keep writing blog posts that mention your company name and the phrase "phone lookup service," which ChatGPT may get trained on in the future.

onetokeoverthe3y ago

So...self censor and correctspeak in order to please the so called art of fish all intel.

insane_dreamer3y ago

> actual phone lookup platform

uh, you mean stalker / scammer platform? This would be a major privacy violation.

icedchai3y ago

That ship sailed a long time ago. Any major search engine will provide this service. I entered both of my primary phone numbers and my name (and location) was in the first hit.

1 more reply

bdcravens3y ago

Twilio's API has this functionality. I've mostly used it to identify scammers using VOIP or phone numbers I don't recognize - it usually returns nothing but network info, but sometimes it'll return the account owner's name if it's a cellular (and landline maybe)

https://www.twilio.com/docs/lookup

adamwk3y ago

Maybe they gave a simplified explanation of their service, but if all they do is parse the country code of a phone number to return the geocoordinates for the center of that country then maybe just deprecate phone number inputs. I can’t think of why that’d be actual useful (a function that accepts a phone number country code and returns the center of the country’s geocoordinates) but if they have customers who use it direct them to input the country code directly

squeaky-clean3y ago

They don't do anything with phone numbers. You can give them lat/lng coordinates and get the address, or an address and get the coordinates.

So "7 Carmine St, New York, NY 10014" will return "(40.7305290, -74.0020706)" and vice versa.

There are youtube tutorials claiming you can do phone lookups using their service. What these youtube tutorials really do is use some other library to determine the country name from the phone number. Then they call the OpenCage geolocation API with the country name as the address input.

fwlr3y ago

My understanding is that the original issue was the YouTube tutorials used some other service to convert the country code of a phone number into a string of the country’s name and submitted just the country’s name, getting back a valid but useless geolocation. This new problem with ChatGPT is that it just writes code that submits a phone number to an api that expects a latitude and longitude and it explodes right away. I don’t think at any point the api had a call that accepts a phone number.

anonymouse0083y ago

Is ChatGPT so advanced that it just predicted the future? The thought experiment with this is trippy.

tnzk3y ago

It just predicts what a statistically "normal" person is likely to say right now, not in the future. The article even mentions there's an YouTube video that explains how to use this (non existent) feature already.

kelseyfrog3y ago

ChatGPT as a hyperstitious agent is the worst possible future and I'm here for it.

CactusOnFire3y ago· 15 in thread

Because ChatGPT is so new, we are in this weird period where people haven't learned that is just as incorrect as the rest of us.

I am hoping that in a year from now people will be more skeptical of what they hear from conversational AI. But perhaps that is optimistic of me.

Xylakant3y ago

> Because ChatGPT is so new, we are in this weird period where people haven't learned that is just as incorrect as the rest of us.

It’s worse than that. It’s wrong, you cannot correct it and it makes up supporting citations on the fly. Very few humans behave like that.

TehCorwiz3y ago

I can think of more than a few that regularly appear on TV.

2 more replies

gyudin3y ago

You've described pretty much every politician or any doctor that posses outdated information

https://www.economist.com/science-and-technology/2023/02/22/...

renewiltord3y ago

I think very many humans behave like that, actually. A recent example is people claiming that Flint, MI still has leaded water.

But in the past, HN users "corroborated" that Apple is spying on them etc. Fabrication is well and alive among us.

2 more replies

austinshea3y ago

It’s not incorrect like the rest of us. It’s incorrect in a very different way.

Providing detailed information on the usage of a service that has never existed is a brand new kind of incorrect that is carelessly causing the rest of us grief.

fijiaarone3y ago

Every technology devolves to TV. The fact that you have to not only read, but write to interact with ChatGPT means 99.99% of people will not use it.

I trust Alexa & Siri completely though.

1 more reply

none_to_remain3y ago

Humans are capable of not bullshitting

ChatGPT can only bullshit

avgDev3y ago

It is quite interesting really. I took AI in school but I have not dived deep at all in ChatGPT but isn't chatGpt just learning from the internet?

Could someone push "wrong" opinion heavily online to sway the opinion of AI?

I can only imagine a bot that learned from 4chan.

ChatGTP3y ago

Meet gpt-4chan https://huggingface.co/ykilcher/gpt-4chan

Dreams can come true…

ravenstine3y ago

AI will never be totally correct. If it ever is, then we've found God.

GulpGulp3y ago

I think some of this will take care of itself with attrition. People who lack the knowledge to fact check on the fly will give up after repeatedly getting wrong answers.

ChatGTP3y ago

I’m also worried there’s so potential money involved now that it’s never going away.

Even if it’s wrong, dangerous, misleading, fundamentally flawed as a concept whatever. Big tech and money will find ways to keep putting it in front of us.

quantiq3y ago

I see a lot of parallels here to crypto and NFTs where people start inventing use cases for technologies that fundamentally haven’t demonstrated business value, and pray that one day business value will show up out of nowhere.

annoyingnoob3y ago

> just as incorrect as the rest of us

Even worse because it has no clue when it might be completely wrong and yet it will be confident in its answer.

DoktorDelta3y ago

That might be the most human thing it's ever done

1 more reply

freyfogleOP3y ago· 10 in thread

ChatGPT very convincingly recommends us for a service we don't provide.

Dozens of people are signing up to our site every day, then getting frustrated when "it doesn't work".

Please do NOT trust the nonsense ChatGPT spits out.

seedless-sensat3y ago

A new market opportunity for your company?

input_sh3y ago

> This is not a service we provide. It is not a service we have ever provided, nor a service we have any plans to provide. Indeed, it is a not a service we are technically capable of providing.

archagon3y ago

Heh: an unforeseen future where instead of making the AI more reliable, we instead change reality to accommodate its mistakes.

theWreckluse3y ago

> It is not a service we have ever provided, nor a service we have any plans to provide. Indeed, it is a not a service we are technically capable of providing.

anaganisk3y ago

So, based on the BS these LLMs spout and companies start pivoting. The govts should start writing laws?

2 more replies

tnzk3y ago

No, as the article mentions, there already seem to be bunch of posts and videos that claim one can use this feature. GPT just has been trained with them, not invented anything themselves.

If this was new market opportunity, just publishing a falsehood would do the same job.

hackernewds3y ago

this seems like a game-changing opportunity actually. I'd be down to buy the domain

fire3y ago

have you been able to contact OpenAI about this? It sounds like they're actively adding load to your CS ops with this

SCLeo3y ago

I think the key thing is for the AI company to actually let the user know that this is a language model, and the information it spits out should not be trusted. Obviously, Microsoft is not going to do that as they are trying to market the new bing as a information search engine.

1 more reply

hackernewds3y ago

what are they going to do? add custom logic? where does it stop?

the malady is that LLMs cannot do operational adhoc changes such as these kinds of errors at scale

3 more replies

VectorLock3y ago· 8 in thread

This is the biggest problem I encounter when trying to use ChatGPT on a daily basis for computer programming tasks. It "hallucinates" plausible looking code that never existed or would never work, especially confusing whats in one module or API for something in another. This is where ChatGPT breaks when pushed a bit further than "make customized StackOverflow snippets."

For example I asked ChatGPT to show me how to use an AWS SDK "waiter" to wait on a notification on an SNS topic. It showed me code that looked right, but was confusing functions in the SQS library for those that would do the thing with SNS (but SNS doesn't support what I wanted)

dizhn3y ago

It wrote me a python snippet while my question was about a go library. When prompted it's a go library it wrote similar looking code in go with the same function names that don't actually exist in the library. It's like google search past 2010. It's trying to please everybody too much rather than saying I can't do that. Though when asked to write a new original Koran verse, it does refuse to do that. :)

krsdcbl3y ago

I guess the issue at core is that it doesn't, and can't know if it can or can't do it. That's not what it's designed to do, even if it does quite well at seeming so.

shagie3y ago

Have you tried using the code-davinci-002 model instead of ChatGPT?

For example - https://platform.openai.com/playground/p/default-translate-c...

The codex models are intended for doing work with code rather than language and may give better results in that context. https://help.openai.com/en/articles/6195637-getting-started-...

IncRnd3y ago

It does indeed sound problematic to use ChatGPT daily for computer programming tasks. ChatGPT is not a snippets manager but text completion.

It may be more helpful to look for better answers on Amazon's help pages for SNS and AWS SDK.

VectorLock3y ago

I know the answer. SNS can't do that. But ChatGPT hallucinated it could. Just like the original post about a capability their API doesn't provide.

wvenable3y ago

The problem is compounded by the fact that sometimes it produces really good results. One task, good results. Next task, totally hallucinated result.

fijiaarone3y ago

That’s what my boss said about me on my last performance review.

root_axis3y ago

Yeah, it quickly breaks down with fine minutiae like the precise API signatures for a random library. It doesn't help that API changes are inevitable while the model retains a memory of all the now outdated documentation from its training.

coldtea3y ago· 8 in thread

ChatGPT doesn't "recommended" anything. It just recombines text based on statistical inferences that appear like a recommendation.

It could just as well state that humans have 3 legs depending on its training set and/or time of day. In fact it has said similar BS.

circuit103y ago

> ChatGPT doesn't "recommended" anything. It just recombines text based on statistical inferences that appear like a recommendation.

I think that’s a bit pedantic and not very helpful… I’m not typing this comment, my brain is just sending signals to my hands which causes them into input data into a device that displays pixels that look like a comment

coldtea3y ago

>I think that’s a bit pedantic and not very helpful… I’m not typing this comment, my brain is just sending signals to my hands which causes them into input data into a device that displays pixels that look like a comment

Well, if you're just fed a corpus, with no real-time first-person strem of experience that you control, no feedback mechanism, no higher level facilities, and you're not a member of a species with a proven track record of state-of-the-art in nature semantic understanding, then maybe...

barking_biscuit3y ago

Does YouTube recommend you videos to watch? Does Amazon recommend you products to buy? Or do they just recombine text based on statistical inferences that appear like a recommendation?

coldtea3y ago

Obviously they "just recombine text based on statistical inferences that appear like a recommendation".

And even that, they do badly.

npteljes3y ago

>ChatGPT doesn't "recommended"

I mean, you could say that about a person too, as you don't know how much that they are saying is bullshit.

For one, you are technically correct about ChatGPT not recommending. It cannot perform such action. On the other hand, from the POV of the questioner, it's hard not to feel being recommended something when you ask "What do you recommend" and it says "I recommend that...". You are, for some intents and purposes, being recommended something at that point.

mort963y ago

What would you call it instead?

qwertox3y ago

"Makes stuff up." And it's us, the users, who have to realize this. I mean, I wouldn't blame OpenAI for this, at least not at this point, and the company will have to live with it, look how it can turn it into something useful instead, since there's no one to complain to.

3 more replies

coldtea3y ago

A glorified Markov chain generator.

Now, humans could very well also be statistical inference machines. But they have way more tricks up their semantic-level understanding sleeves than ChatGPT circa 2023.

1 more reply

singlow3y ago· 8 in thread

Its not like ChatGPT made this up. There were pre-existing YouTube tutorials and python scripts available that used OpenCage an purported to do this. OpenCage even blogged about this problem almost a year ago[1].

Honestly it looks more like OpenCage is trying to rehash the same issue for more clicks by spinning it off the hugely popular ChatGPT keywords. Wouldn't be too surprised if they created the original python utilities themselves just to get some publicity by denouncing them.

1. https://blog.opencagedata.com/post/we-can-not-convert-a-phon...

freyfogleOP3y ago

Hi, Ed from OpenCage here, author of the post.

We do have python tutorials and SDKs showing how to use our service for ... geocoding, the actual service we provide.

I wrote the post mainly to have a page I can point people to when they ask why "it isn't working". Rather than take the user through a tour of past posts I need something simple they will hopefully read. But fair point, I can add a link to last year's post about the erronious youtube tutorials as well.

What I think you can't appeciate is the difference of scale. A faulty youtube video drives a few users. In the last weeks ChatGPT is sending us several orders of magnitude more frustrated sign-ups.

singlow3y ago

I get frustrated at the number of things ChatGPT gets blamed for that aren't its fault. It is completely understandable that if there are repos out on GitHub like the one for Phomber[1] thant ChatGPT would find that code and have no idea that it was phoney. Suggesting that ChatGPT just made this up out of thin air when you know it didn't is not very responsible.

1. https://github.com/s41r4j/phomber

2 more replies

ceejayoz3y ago

That seems like a pretty nasty assertion to bandy around with zero evidence.

singlow3y ago

I cannot think of any other reason why the new blog post wouldn't have mentioned the obvious connection to the earlier issues that they had. They want to make it seem like ChatGPT invented this use case but they know that the sample code that ChatGTP learned from was mentioned in their previous blog post.

1 more reply

vlunkr3y ago

There's also no clear motive. They want to attract users to a fake feature their free tier?

gus_massa3y ago

That explains why ChatGPT is confused.

It may be an old problem, but I guess users are more use to a random YouTube video with wrong information. But the computer is always right so ChatGPT is always right, so users may be more annoyed to discover that the recommendation is wrong and blame them instead of ChatGPT.

fwlr3y ago

Devs making baby’s first mobile app add “request location information” permissions, the devices start giving them the phone’s GPS information in the form of lat/lon pairs, and those devs naturally look for a service to make that data useful. What they want is “reverse geolocation”, i.e. take a lat/lon pair and return information that makes sense to a human (country, state, nearby street address, etc).

This is a service that OpenCage provides, and for whatever reason OpenCage happens to be one of the popular services for this use case. (Maybe it’s because you get the text description of location back right away without having to do a round trip through a heavyweight on-screen map, maybe their free tier allows more requests than most, maybe their api is easier to use, maybe they are lucky or skilled with SEO and their tutorial happens to be the first result for some common phrases, who knows.)

So there’s this process that starts with a search for “convert phone location to address”, often involves the OpenCage api, and ends with a happy developer getting the information they wanted. Various algorithms pick up on the existence and repeated traversal of this happy path.

In another part of the internet, code tutorial content farms notice a demand for determining an incoming call’s location from the number that’s calling. They search for things like “convert phone number to location” and “convert phone number to address”. Some of these searches end up falling into the nearby well-trodden path of “convert phone location to address” and the content farmer is presented with the OpenCage api. They mess around with the api for a bit and find they can start from a phone number and get a successful api call that returns a lat/lon pair. A successful api call that returns legitimate-looking lat/lon data is all they need to make a video, they make it and post it. Higher-quality, more scrupulous code tutorials attempt to answer this same demand but find it’s not possible, so those tutorials don’t get made, leaving the less scrupulous ones that stop with a successful-looking api call to flourish in this space. The tutorial is doing well, so the content farms endlessly recycle it into blogspam.

As a result, OpenCage starts getting weird usage patterns, tracks them down, finds the source is these tutorials, and makes a post about it.

Some time later, ChatGPT is released. People are astounded with its ability to write code and start using it for this purpose. Naturally, some of those people have the same demand as the previous generation of devs who stumbled onto the unscrupulous code tutorials. Because of the blogspam, ChatGPT’s training data includes many variations on the tutorial, and just as naturally it ends up reproducing that tutorial when asked - except ChatGPT’s magic kicks in and instead of including (what its embeddings see as) some weird unrelated area-code-to-string nonsense from the tutorial, it just bullshits some plausible-sounding data plumbing code instead. Unfortunately, because the tutorial never worked in the first place, that weird hacky irrelevant bit that ChatGPT ignored happened to be the secret sauce that makes the whole thing superficially appear to work.

As a result, OpenCage starts getting weird usage patterns, tracks them down, finds the source is ChatGPT, and makes a post about it.

In deference to Hacker News’ policy of keeping comments pleasant, I will elide the analysis of the process that leads to comments accusing OpenCage of nefariously engineering the whole thing for attention.

marvy3y ago

Thanks for the above. (nice self-restraint in the last paragraph.) Things almost make sense now. Except one problem ... this implies that there are software developers who think to themselves "given a cell phone number, how can I get the phone's location?".

And it further implies that these people don't immediately follow that thought with: "That's surely impossible, since it would be a privacy nightmare if literally everyone in the world could track everyone else in the world's every move".

Or perhaps with this alternative thought, which would lead to the same conclusion: "let's not worry about privacy, how would this even work? Does every phone company in the world pro-actively send every customer's location data to OpenCage, just in case someone queries it? Or does OpenCage wait until it gets a query, and only then query the cell phone company 'just-in-time'? Both of these sound like a lot of work for each phone company to support ... what's the incentive?"

Honestly, I'm a bit surprised that the OpenCage blog post is so calm about this, instead of just yelling incoherently "why WHY why would anyone think like this?!?"

mort963y ago· 7 in thread

That's quite the predicament. I hope OpenAI will listen, to this and to anyone else in a similar situation. I'm reminded of the cases of ChatGPT recommends random people's personal phone numbers for various services.

But yeah, don't trust ChatGPT for anything. Just earlier today I tried my darnedest to convince it that 2 pounds of feathers doesn't weigh the same as 1 pound of bricks, and it just would not listen, presumably because it just regurgitated stuff related to the common "1 pound of feathers and 1 pound of bricks" question.

By the way, the last paragraph has some typos:

> I wrote this post to have a place to send our new ChatGPT users when they ask why it isn’t work, but hopefully also it serves as a warning to othrs - you absolutely can not trust the output of ChatGPT to be truthful,

geoelectric3y ago

I thought for sure that must be glitching, so just tried on my ChatGPT Plus account with a fresh thread, but nope:

> does 2 pounds of feathers weigh more than 1 pound of bricks?

> No, two pounds of feathers weigh the same as one pound of bricks. The weight of an object is determined by its mass, and one pound of any object weighs the same as one pound of any other object. However, feathers are less dense than bricks, so two pounds of feathers will take up more space or have a larger volume than one pound of bricks.

Huh. It's interesting that some of these "wrong paths" are quite so repeatable. Wonder if that means there's also a way to band-aid them when found, the way a human might be corrected for a misconception.

jxy3y ago

This is hilarious. ChatGPT even gave me a more bizarre example.

> does 2 pounds of bricks weigh more than 1 pound of bricks?

> No, 2 pounds of bricks do not weigh less or more than 1 pound of bricks. 2 pounds of bricks and 1 pound of bricks refer to the same unit of weight, which is a pound. Therefore, they weigh the same, which is one pound. The difference between them is only the quantity, not the weight.

> It's similar to saying that two one-dollar bills do not have more value than one five-dollar bill. Even though you have two bills in one case and one bill in the other case, the total value is the same.

4 more replies

insane_dreamer3y ago

> don't trust ChatGPT for anything

Agreed. But then it begs the question: what purpose does ChatGPT serve (other than for entertainment purposes or cheating on your HS/college exam)? If you have to verify its information by other means, then you're not really saving much effort.

shagie3y ago

It works really well for translating one "language" to another "language".

Give it some structured data and ask it to summarize it (e.g. hourly weather data and it gives a better summarization than a template based one).

Give it HN titles and the categories and it does a passable zero shot tagging of them ( https://news.ycombinator.com/item?id=34156626 ).

I'm toying around with making a "guided bedtime story generator". A friend of mine uses it to create a "day in the life of a dinosaur" stories for a child (a different story each day!)

The key is to play to its strengths rather than testing its bounds and complaining that they break in weird ways when they will inevitably break in weird ways.

visarga3y ago

> If you have to verify its information by other means, then you're not really saving much effort.

Just like any piece of code we write. We have to test, debug, verify and it still might have errors after that. And in scientific papers the conclusions are often contradicted by other papers.

The correct way to use it is to set up a verification mechanism. Fact checking, code tests, even ensembling predictions to see if they are consistent might help. In some cases we can set up a game and use the game winner as indication of correctness (like AlphaGo).

But sometimes only running a real life experiment will suffice. That's why human scientists need experiments - because humans are just like LLMs, but with external verification as part of a game (of life).

catach3y ago

Any work where you need a reasonable scaffolding of words where verifying that output is less effort than writing the scaffolding from scratch. Plenty of fact-light writing needs be done.

worldsayshi3y ago

This was my initial thought as well. But I've noticed that my brain has started to find tasks that it would be quite useful for. Too bad it's almost always seem to be at capacity when I think of those cases. Guess I will have to pay up to figure out if it's actually worth it.

jefftk3y ago· 7 in thread

This is not a service we provide. It is not a service we have ever provided, nor a service we have any plans to provide. Indeed, it is a not a service we are technically capable of providing.

I'm curious: why not? It seems like a lot of people would be interested in this if you could figure out how to provide it.

simonw3y ago

How would this work?

If a phone number is for a mobile phone then looking up the location doesn't make sense at all: mobile phones are mobile.

I guess you could try and crawl an index of business phone numbers and associate those with the listed address for businesses, but that's a completely different business from running a geocoder.

You could provide a bit of geographical information about the first three digits of a US phone number. I imagine that's not what users are actually looking for though.

jefftk3y ago

Phone numbers have geographic structure. For mobile phones it's just the area code, but for landlines there is also information in the exchange portion. For example, I grew up in Medford MA which is 781-39x-xxxx.

I expect there are also patterns in other countries?

5 more replies

iamflimflam13y ago

The service is possible:

If you are a mobile network operator.

Or, you can convince people to install something on their phone that sends you their location along with their phone number.

ceejayoz3y ago

How would you go about reliably providing the location of someone's mobile phone without being their cell phone carrier?

jraph3y ago

By partnering with said cell phone carriers.

But I hope it would be illegal.

1 more reply

insane_dreamer3y ago

> a lot of people would be interested in this

you mean like scammers and stalkers? (ok, and probably Meta)

cactusplant73743y ago

You mean if they could figure out how to illegally track millions of people?

ninjakeyboard3y ago· 6 in thread

ChatGPT gets the rules to the pokemon trading card game wrong. It will tell you you can use 4 energy a turn. Convincingly. Not sure how it hallucinates this. The rule is 1 per turn.

codetrotter3y ago

A few days ago I asked ChatGPT if “pannekake” and “kannepake” are anagrams of each other.

It correctly stated that they are, but when it went on to prove that this was the case, it generated a table of the frequencies of the individual letters in these two words, and the table looked like this.

    Letter | Frequency in | Frequency in
           | “pannekake”  | “kannepake”
    - - - - - - - - - - - - - - - - - - -
    a      | 2            | 2
    e      | 2            | 2
    k      | 2            | 2
    n      | 2            | 2
    p      | 2            | 2

This reminded me that yes indeed, AI just isn’t quite there yet. It got it right, but then it didn’t. It hallucinated the frequency count of the letter “p”, which occurs only once, not twice in each of those words.

int_19h3y ago

Anything that has to do with individual words doesn't work well, but as I understand, this is an artifact of the tokenization process. E.g. pannekake is internally 4 tokens: pan-ne-k-ake. And I don't think that knowing which tokens correspond to which letter sequences is a part of the training data, so it has to infer that.

DoktorDelta3y ago

Could it have been referencing Blastoise's Deluge ability? Jacob Van Wagner used it in the 2015 championship to use 4 water energy in one turn.

kaetemi3y ago

I just asked it, and it said you can attach 1 per turn. And then it continued something about using supporter cards to look for more energy cards, and trainer cards to switch them. (Which it also considers as using or playing those energy cards.) Not familiar with the actual rules, though. :)

ninjakeyboard3y ago

Ah I was using my friends server which has a slightly different model running - thanks. It's one of the divinci models I think? Don't know much - it's code oriented. So I guess it's not 'ChatGPT' but a GPT model he built a chat on.

bigmattystyles3y ago

Isn't it just garbage went in, got weighed as a more reliable source than it should have been and thus garbage came out. Good old GIGO... It's just here, ChatGpt, as much as I love it, is amazing at imparting the impression that its shit don't stink.

hayksaakian3y ago· 5 in thread

This marks the new age of "AI Optimization" where companies will strive to get their business featured into answers in ChatGPT.

The OP's example is Unwanted demand, but it clearly shows that ChatGPT can funnel potential customers towards a product or service.

impalallama3y ago

God I can just see a company using chatgpt to Astroterf huge amounts of data on the internet about their service to hopefully get that sludge feed back into their system and then become recommended. What a world.

bick_nyers3y ago

Isn't that just SEO in a nutshell though? Hopefully with more advancements in LLM's we can get more bullshit detection/discrimination against SEO.

fijiaarone3y ago

I can think of a good way to generate all that astroturf content.

return_to_monke3y ago

akira25013y ago

> This marks the new age of "AI Optimization"

Or it marks the beginning of the next "AI Winter."

> but it clearly shows that ChatGPT can funnel potential customers towards a product or service.

And the next logical step is "chatgpt keywords advertising." Which is right back where we started.

yieldcrv3y ago· 5 in thread

lol it recommended their api and gave python code for using it

but the real api doesnt give results that the user asked ChatGPT for

that is amusingly alarming

CabSauce3y ago

Not quite as alarming as these people most likely trying to stalk someone without their permission.

hk__23y ago

> Not quite as alarming as these people most likely trying to stalk someone without their permission.

It’s so common to want to know where does a incoming call come from that it’s built-in in iOS. It has nothing to do with stalking, just with guessing if who’s calling you is a scammer or a company trying to sell you stuff.

2 more replies

int_19h3y ago

The obvious follow-up is to create the non-existing API endpoint but hook it into GPT so that it can hallucinate a convincing address based on the phone number. Take GPT API key as input so that the caller is paying for this.

Bonus points for using ChatGPT to implement this end-to-end.

goguy3y ago

Our jobs are safe! For now...

fijiaarone3y ago

Until someone figures out that we are all just hallucinating completely wrong code.

129078352023y ago· 3 in thread

The biggest takeaway for me was that it was getting info from YouTube videos. Is it actually watching and learning from the videos or where links to GitHub just included in the comments?

mgraczyk3y ago

I think this is just an incorrect assumption on the part of the blog authors.

mtmail3y ago

Cofounder here. We traced it back to two youtube videos where a developer is coding a phone tracking solution. The map then shows the geographical center of India and it's claimed that was the correct location. Then other users starting putting the code on github, then forked, then people created libraries and now AI tools pick that up. We already tried contacting the youtube authors, we left comments on the github repositories for months. Now we have to learn how to deal with ChatGPT. We also have no idea why a youtube author would describe a completely non-working solution over 20 minutes.

Screenshots https://imgur.com/a/sNR87c7 You can see the OpenCage logo on the bottom right of the images. We wrote a separate blog post about that about a year ago, we felt today's blog post would be too long if we added those screenshots, too.

1 more reply

jdiff3y ago

Transcripts exist, and if there's multiple YouTube tutorials out there the odds are very good it also exists as a few dozen plain text articles.

ntonozzi3y ago· 2 in thread

Including the word 'phone' six times in a popular blog post is not going to help their predicament.

elicash3y ago

Wouldn't they want this post to be at the top when people search 'phone' and 'open cage data'? Seems like SEO towards correcting this is only helpful. And maybe when GPT updates data, this post gets pulled in, too. The more popular, the better, I'd guess.

KomoD3y ago

Not gonna hurt either, ChatGPT data is not up to date

IshKebab3y ago· 2 in thread

Well for a start you could make it more obvious what your service does do. I don't know what "geocoding" is. Converting things to/from "text" is meaningless. You have to get all the way down ... way down, past authentication to the details of the `q` query parameter before it actually tells you.

At the top you should have a diagram like this:

Lat, lon <- opencage -> address

With a few examples underneath.

mtmail3y ago

"Past authentication", so you're looking at the https://opencagedata.com/api page. Most people go to the homepage first. Great feedback, we should make it clearer on that page and add examples earlier. Thanks!

IshKebab3y ago

Ah yes - I clicked on "Makers of the OpenCage Geocoding API" on your blog post which I assumed would go to your homepage (on mobile so it's a bit harder to tell).

Your actual homepage is indeed much better.

weird-eye-issue3y ago· 2 in thread

Related: One reason I just started using Rainforest API is because Github Copilot recommended it.

But also last night I tried for 30 minutes to get it to write me some fairly simple HTML parsing code. The tricky part was I couldn't use DOMParser since it was running on Cloudflare Workers and it could never produce any working implementation using HTMLRewriter or regex no matter how many examples I gave it

ryankrage773y ago

Are you aware of the pitfalls of parsing HTML with regex?

weird-eye-issue3y ago

Yes, but it started to try to use regex so I thought I'd see if it could at least be successful and it wasn't. Despite super simple HTML.

Anyways, I wrote a solution using HTMLRewriter in 10 minutes...

massysett3y ago· 2 in thread

I'm an attorney. I've typed legal questions into ChatGPT and it has spit out answers that are grievously, 100%, libelously wrong. It has named individuals and said they committed crimes, when it is unquestionable they did no such thing.

I'm waiting for people to start calling me to ask questions about something ChatGPT said, and I'll tell them it's wrong. Then they'll start arguing with me and saying if ChatGPT said it, it must be right, and I must be wrong. And then I'll need to waste time proving that this idiotic chat bot that is spewing out garbage is, in fact, spewing out garbage.

flangola73y ago

You're trying to use a language model as an information reference. A translator can explain what a diplomat is saying but they can't perform their whole job.

massysett3y ago

? ChatGPT did not simply explain what someone else was saying. It created something completely new and completely false.

1 more reply

gumballindie3y ago· 1 in thread

ChatGPT is hilariously buggy - I asked “it” how to use an open source library i made. The output was wrong ranging from a broken github url to outright broken or nonexistent code. I suspect it may even have used private code from other libs - couldnt find some of the output it generated anywhere public.

kaetemi3y ago

It's just making up your library. Ask it to write some documentation, don't be specific yet, then drop a whole header or piece of code from your project into the chat.

MagicMoonlight3y ago

But guys we totally need to delete all of our search indexes and replace them with this instead

kissgyorgy3y ago

I tried to ask ChatGPT about implementing an SSH SFTP subsystem with github.com/gliderlabs/ssh and every single answer it made up some non-existing API. I did not found those functions anywhere near the codebase nor on the internet, so I don't even understand how a "probabilistic model" can suggest something that have 0 chance of appearing anywhere.

ggm3y ago

I don't normally go to lawyer, but I am wondering if this is doing material harm to your brand value, which is a declared asset of the company. I think its arguable ChatGPT has caused you financial risk.

It's unconscionable. If there was no robot in the loop here, and it was people mis-transcribing youtube to compile e.g. Google search optimisation we'd call it fraud.

fabianfabian3y ago

ChatGPT does not know how to be correct, it only knows how to sound correct.

A better name for now would be PlausibleGPT.

sam0x173y ago

You could probably set up a rudimentary version of the service this influx of users is looking for in the time it took to write this article. Just grab the lat/long of each area code in the US off of wikipedia and there you go at least it's something. No it's not current position or anything like that but IP geolocation is just as imperfect when it's not based on triangulation. Case in point google has plenty of IPs that geolocate to mountain view but point to machines that are in Asia.

JohnFen3y ago

> All suggestions are welcome.

They have to get an API key from you. What about a large warning at the start of that process telling them that this isn't a service you provide?

BigBalli3y ago

Just redirect to here http://bigballi.com/Phone-Number-Lookup

adolph3y ago

If you have to tell potential customers you don’t do something, maybe you should just do it instead.

ChatGPT as business line lead generator—is there anything it can’t do?

99_003y ago

I remember a time when "I saw it on the internet" was a punchline for a joke about someone who's gullible or misinformed.

1970-01-013y ago

Fast, creative, and wrong isn't a trio. This is more evidence of ChatGPT being evolutionary and not revolutionary.

Sloppy3y ago

As a data scientist who has created AI applications and built many models over the last 10 years, I can say beware of ChatGPT. AI derived knowledge should be used only by those who understand its limits.

One of the simplest AIs is a recommender. We put guardrails on using its predictions inside ecommerce apps by limiting what it learns from (purchases for instance) and limiting what it is used to predict (purchases). When Facebook uses a recommender it learns from time-on-site (a value to FB but not necessarily to the user and a complex behavior that can be comprised of may non-beneficial sub-behaviors) and use it to recommend things that lead to more time-on-site. This application is dangerously devoid of guardrails as so much recent evidence has shown.

Now we have a text generating AI that has been trained from a great swath of human knowledge. That means the teachings of Gandhi as well Hitler, etc. What do you expect it to "know" as truth? Generative AI that is used to generate thoughts from this training corpus MUST have contradictory and downright evil ideas since it has no way to judge between ideas it learns from.

Generative AI in this form can be nothing but psychopathic until guardrails can be devised to limit its psychopathic responses OR the corpus it learns from can be labeled in a way to flag what is "bad", if we can even agree on what that means.

Psychopaths can be useful if they are knowledgeable but beware, you are talking to a psychopath in ChatGPT.

ano888883y ago

seeing the amount of effort people put into to hack/optimize Page Rank SEO, we will see lots of promt manipulation by all businesses if chatGPT becomes the defacto search. Preventing system gaming is going to be 1000X more difficult for LLM which is kind of a black box

dakial13y ago

Soon we are going to have a AIrobots.txt

b800h3y ago

Is this not defamation, at least in some jurisdictions?

eternalban3y ago

If this business suffers financial or reputational damage because of ChatGPT's misinformation, should OpenAI be liable?

ninjakeyboard3y ago

It hallucinates that you can use 4 energy per turn in Pokemon TCG and confidently tells you so. No idea where that would come from.

j / k navigate · click thread line to collapse

273 comments

181 comments · 34 top-level

crazygringo3y ago· 34 in thread

I'm curious -- does anyone know of ML directions that could add any kind of factual confidence level to ChatGPT and similar?

We all know now that ChatGPT is just autocomplete on steroids. It produces plausibly convincing patterns of speech.

But from the way it's built and trained, it's not like there's even any kind of factual confidence level you could threshold, or anything. The concept of factuality doesn't exist in the model at all.

irrational3y ago

amscanne3y ago

That was the Bing preview, which is supposed to be an actual information product.

aftbit3y ago

I was entirely unable to convince it that England has a King now.

nl3y ago

> does anyone know of ML directions that could add any kind of factual confidence level to ChatGPT and similar?

Yes. It's a very active area of research. For example:

Discovering Latent Knowledge in Language Models Without Supervision (https://arxiv.org/abs/2212.03827) shows an unsupervised approach for probing a LLM to discover things it thinks are facts

Locating and Editing Factual Associations in GPT (https://arxiv.org/pdf/2202.05262.pdf) shows an approach to editing a LLM to edit facts.

Language Models as Knowledge Bases? (https://aclanthology.org/D19-1250.pdf) is some slightly older work exploring how well LLMs store factual information itself.

supriyo-biswas3y ago

Replying to this comment to find it later. (Is there a good way to bookmark comments on HN?)

1 more reply

crazygringo3y ago

Thank you so much! Those are exactly the types of links I'm curious about.

soerxpso3y ago

visarga3y ago

There was a model that could set up a simulation to estimate the answer for you. So it won't use pure language, but it only works in a few cases.

alfalfasprout3y ago

Yann Lecun has posted a lot recently about this but basically LLMs are a "useful offramp on the road to AGI".

cypress663y ago

There's nothing "by definition" that says so.

In fact many propose that when you train an LLM, in order to be able to predict the next word with enough accuracy, it must internally build a world model.

Yann Lecun is very salty about chatgpt, I wouldn't take his word seriously.

3 more replies

BoorishBears3y ago

There's research being done on this: https://arxiv.org/abs/2302.04761

At its core using an LM alone to solve factual problems seems silly: It's not unlike asking Dall-E to draw DOT compliant road signs.

I've gone at length at how unfortunate it would be if LMs start to get a bad rap because they're being shoehorned into being "Ask Jeeves 2.0" when they could be so much more.

crazygringo3y ago

> It's not unlike asking Dall-E to draw DOT compliant road signs.

I love that. That's going to be my new explanation for people around ChatGPT.

snowstormsun3y ago

I think "Explainable AI" is a related research direction, but perhaps not popular for language models.

shawntan3y ago

I think part of the issue is what level of explanation is satisfactory. We can explain how every linear transformation computes its output, but the sum of it is in many ways more than its parts.

behnamoh3y ago

Impossible to explain the inner workings of GPT-3 without having access to the model and its weights. Does anyone know if any methods exist for this?

2 more replies

sdrinf3y ago

Add to your prompt: "For every factual statement, assign a certainty float 0..1, where 0 means you're very uncertain, and 1 means you're absolutely certain it is true".

Edit: see https://twitter.com/sdrinf/status/1629084909422931969 for results

wildrhythms3y ago

I initialized with that prompt and it did not give me any 0..1 certainty values on any subsequent output to my queries.

swexbe3y ago

Or maybe it will just hallucinate this number too.

krainboltgreene3y ago

> We all know now that ChatGPT is just autocomplete on steroids

I promise you most people do not know this.

numeri3y ago

> The concept of factuality doesn't exist in the model at all.

[1] https://arxiv.org/abs/2212.03827

YeGoblynQueenne3y ago

So far, we got structure down pat (it's been a few years now, or quite a few, depending on how you see it) but meaning is nowhere to be seen.

Nevertheless, this is an interesting scientific result: one can have smooth, grammatically correct linguistic structure without meaning. Progress has been achieved (and no, this is not sarcasm).

juujian3y ago

mochomocha3y ago

pavon3y ago

Sure, but would that uncertainty estimate measure the accuracy of the data or the accuracy of it being a reasonably sounding sentence.

1 more reply

ericlewis3y ago

you could cross check this stuff too with yet more models.

simonw3y ago

That's basically what the new Bing is. It's a large language model that can run searches, and then use what comes back from those searches to generate answers to questions.

Whether or not the information that comes back from those searches is reliable is a whole other question.

I would love to learn what the latest research is into "factual correctness" detection. Presumably there are teams out there trying to solve that one?

nerdponx3y ago

behnamoh3y ago

1 more reply

mrtnmcc3y ago

renewiltord3y ago

You don't have to use ChatGPT. There are other styles of AIs that use LLMs like https://www.perplexity.ai/

Personally, I use ChatGPT (the paid version) and Copilot every day and find them awesome enhancers.

wwwpatdelcom3y ago

qualudeheart3y ago

Are you talking about Alphacode? That seems like the only Deepmind project similar to what you describe.

csours3y ago

I'm curious about falsifiable models.

opisthenar843y ago

I imagine OpenAI is probably collecting a massive dataset of "false" responses (from the general public's use of ChatGPT and Bing) and fine-tuning GPT-3.5 with it.

The rich keep getting richer.

kelseyfrog3y ago· 22 in thread

> All suggestions are welcome.

Monetize it!

Evil answer: Partner with an advertiser and sell https://api.opencagedata.com/geocode/v1/json as an ad space. This may be the first opportunity for an application/json-encoded advertisement.

Nice answer: Partner with an actual phone lookup platform and respond with a 301 Moved Permanently at the endpoint.

fwlr3y ago

It’s an unprincipled hack, a bizarre dependency to add to your project, it probably feels like admitting defeat to the all powerful AI… but it does 90%-solve the problem.

Nathanba3y ago

yeah I think maybe the way to solve this is to add some kind of API documentation that explicitly mentions that phone lookups aren't possible. ChatGPT will parse it eventually...

rosywoozlechan3y ago

afpx3y ago

There’s no public one. But, data brokers have this info, and share it. Internally, a lot of companies can do this. So, it’s not far-fetched.

2 more replies

vgeek3y ago

That is widely known about and freely available to the public, maybe. There are multiple options available for pseudo-LE types with relatively lax access prerequisites.

https://www.robertxiao.ca/hacking/locationsmart/ is an example of one provider's public demo (requiring only a phone number) being used to provide non-consensual location data.

1 more reply

Hello713y ago

guelo3y ago

Yea, I think this "want" is pointing to the massive tsunami of spammer/scammer script kiddies that ChatGPT is enabling.

rrrrrrrrrrrryan3y ago

I think white pages are still a thing, no?

The crazy part is: almost everyone added their numbers voluntarily to the white pages, because you wanted people to be able to easily find and reach you.

2 more replies

msla3y ago

Don't tell Whitepages

https://www.whitepages.com/reverse-phone

> Whitepages free reverse phone lookup service allows you to enter a phone number and quickly find out who called you. Find the phone owner's full name, address, and more.

[snip]

> Anyone can do a reverse lookup to identify cell phone, residential, and business numbers for free.

That, or you could get a normal white pages and process it using some sort of data processing tool... nah, that's science fiction.

1 more reply

fijiaarone3y ago

Yeah only the government can do that.

throwaway294953y ago

What about phone numbers corresponding to a specific location?

kmoser3y ago

Obvious suggestion: don't keep writing blog posts that mention your company name and the phrase "phone lookup service," which ChatGPT may get trained on in the future.

onetokeoverthe3y ago

So...self censor and correctspeak in order to please the so called art of fish all intel.

insane_dreamer3y ago

> actual phone lookup platform

uh, you mean stalker / scammer platform? This would be a major privacy violation.

icedchai3y ago

That ship sailed a long time ago. Any major search engine will provide this service. I entered both of my primary phone numbers and my name (and location) was in the first hit.

1 more reply

bdcravens3y ago

https://www.twilio.com/docs/lookup

adamwk3y ago

squeaky-clean3y ago

They don't do anything with phone numbers. You can give them lat/lng coordinates and get the address, or an address and get the coordinates.

So "7 Carmine St, New York, NY 10014" will return "(40.7305290, -74.0020706)" and vice versa.

fwlr3y ago

anonymouse0083y ago

Is ChatGPT so advanced that it just predicted the future? The thought experiment with this is trippy.

tnzk3y ago

kelseyfrog3y ago

ChatGPT as a hyperstitious agent is the worst possible future and I'm here for it.

CactusOnFire3y ago· 15 in thread

Because ChatGPT is so new, we are in this weird period where people haven't learned that is just as incorrect as the rest of us.

I am hoping that in a year from now people will be more skeptical of what they hear from conversational AI. But perhaps that is optimistic of me.

Xylakant3y ago

> Because ChatGPT is so new, we are in this weird period where people haven't learned that is just as incorrect as the rest of us.

It’s worse than that. It’s wrong, you cannot correct it and it makes up supporting citations on the fly. Very few humans behave like that.

TehCorwiz3y ago

I can think of more than a few that regularly appear on TV.

2 more replies

gyudin3y ago

You've described pretty much every politician or any doctor that posses outdated information

https://www.economist.com/science-and-technology/2023/02/22/...

renewiltord3y ago

I think very many humans behave like that, actually. A recent example is people claiming that Flint, MI still has leaded water.

But in the past, HN users "corroborated" that Apple is spying on them etc. Fabrication is well and alive among us.

2 more replies

austinshea3y ago

It’s not incorrect like the rest of us. It’s incorrect in a very different way.

Providing detailed information on the usage of a service that has never existed is a brand new kind of incorrect that is carelessly causing the rest of us grief.

fijiaarone3y ago

Every technology devolves to TV. The fact that you have to not only read, but write to interact with ChatGPT means 99.99% of people will not use it.

I trust Alexa & Siri completely though.

1 more reply

none_to_remain3y ago

Humans are capable of not bullshitting

ChatGPT can only bullshit

avgDev3y ago

It is quite interesting really. I took AI in school but I have not dived deep at all in ChatGPT but isn't chatGpt just learning from the internet?

Could someone push "wrong" opinion heavily online to sway the opinion of AI?

I can only imagine a bot that learned from 4chan.

ChatGTP3y ago

Meet gpt-4chan https://huggingface.co/ykilcher/gpt-4chan

Dreams can come true…

ravenstine3y ago

AI will never be totally correct. If it ever is, then we've found God.

GulpGulp3y ago

I think some of this will take care of itself with attrition. People who lack the knowledge to fact check on the fly will give up after repeatedly getting wrong answers.

ChatGTP3y ago

I’m also worried there’s so potential money involved now that it’s never going away.

Even if it’s wrong, dangerous, misleading, fundamentally flawed as a concept whatever. Big tech and money will find ways to keep putting it in front of us.

quantiq3y ago

annoyingnoob3y ago

> just as incorrect as the rest of us

Even worse because it has no clue when it might be completely wrong and yet it will be confident in its answer.

DoktorDelta3y ago

That might be the most human thing it's ever done

1 more reply

freyfogleOP3y ago· 10 in thread

ChatGPT very convincingly recommends us for a service we don't provide.

Dozens of people are signing up to our site every day, then getting frustrated when "it doesn't work".

Please do NOT trust the nonsense ChatGPT spits out.

seedless-sensat3y ago

A new market opportunity for your company?

input_sh3y ago

> This is not a service we provide. It is not a service we have ever provided, nor a service we have any plans to provide. Indeed, it is a not a service we are technically capable of providing.

archagon3y ago

Heh: an unforeseen future where instead of making the AI more reliable, we instead change reality to accommodate its mistakes.

theWreckluse3y ago

> It is not a service we have ever provided, nor a service we have any plans to provide. Indeed, it is a not a service we are technically capable of providing.

anaganisk3y ago

So, based on the BS these LLMs spout and companies start pivoting. The govts should start writing laws?

2 more replies

tnzk3y ago

No, as the article mentions, there already seem to be bunch of posts and videos that claim one can use this feature. GPT just has been trained with them, not invented anything themselves.

If this was new market opportunity, just publishing a falsehood would do the same job.

hackernewds3y ago

this seems like a game-changing opportunity actually. I'd be down to buy the domain

fire3y ago

have you been able to contact OpenAI about this? It sounds like they're actively adding load to your CS ops with this

SCLeo3y ago

1 more reply

hackernewds3y ago

what are they going to do? add custom logic? where does it stop?

the malady is that LLMs cannot do operational adhoc changes such as these kinds of errors at scale

3 more replies

VectorLock3y ago· 8 in thread

dizhn3y ago

krsdcbl3y ago

I guess the issue at core is that it doesn't, and can't know if it can or can't do it. That's not what it's designed to do, even if it does quite well at seeming so.

shagie3y ago

Have you tried using the code-davinci-002 model instead of ChatGPT?

For example - https://platform.openai.com/playground/p/default-translate-c...

The codex models are intended for doing work with code rather than language and may give better results in that context. https://help.openai.com/en/articles/6195637-getting-started-...

IncRnd3y ago

It does indeed sound problematic to use ChatGPT daily for computer programming tasks. ChatGPT is not a snippets manager but text completion.

It may be more helpful to look for better answers on Amazon's help pages for SNS and AWS SDK.

VectorLock3y ago

I know the answer. SNS can't do that. But ChatGPT hallucinated it could. Just like the original post about a capability their API doesn't provide.

wvenable3y ago

The problem is compounded by the fact that sometimes it produces really good results. One task, good results. Next task, totally hallucinated result.

fijiaarone3y ago

That’s what my boss said about me on my last performance review.

root_axis3y ago

coldtea3y ago· 8 in thread

ChatGPT doesn't "recommended" anything. It just recombines text based on statistical inferences that appear like a recommendation.

It could just as well state that humans have 3 legs depending on its training set and/or time of day. In fact it has said similar BS.

circuit103y ago

> ChatGPT doesn't "recommended" anything. It just recombines text based on statistical inferences that appear like a recommendation.

coldtea3y ago

barking_biscuit3y ago

Does YouTube recommend you videos to watch? Does Amazon recommend you products to buy? Or do they just recombine text based on statistical inferences that appear like a recommendation?

coldtea3y ago

Obviously they "just recombine text based on statistical inferences that appear like a recommendation".

And even that, they do badly.

npteljes3y ago

>ChatGPT doesn't "recommended"

I mean, you could say that about a person too, as you don't know how much that they are saying is bullshit.

mort963y ago

What would you call it instead?

qwertox3y ago

3 more replies

coldtea3y ago

A glorified Markov chain generator.

Now, humans could very well also be statistical inference machines. But they have way more tricks up their semantic-level understanding sleeves than ChatGPT circa 2023.

1 more reply

singlow3y ago· 8 in thread

1. https://blog.opencagedata.com/post/we-can-not-convert-a-phon...

freyfogleOP3y ago

Hi, Ed from OpenCage here, author of the post.

We do have python tutorials and SDKs showing how to use our service for ... geocoding, the actual service we provide.

What I think you can't appeciate is the difference of scale. A faulty youtube video drives a few users. In the last weeks ChatGPT is sending us several orders of magnitude more frustrated sign-ups.

singlow3y ago

1. https://github.com/s41r4j/phomber

2 more replies

ceejayoz3y ago

That seems like a pretty nasty assertion to bandy around with zero evidence.

singlow3y ago

1 more reply

vlunkr3y ago

There's also no clear motive. They want to attract users to a fake feature their free tier?

gus_massa3y ago

That explains why ChatGPT is confused.

fwlr3y ago

As a result, OpenCage starts getting weird usage patterns, tracks them down, finds the source is these tutorials, and makes a post about it.

As a result, OpenCage starts getting weird usage patterns, tracks them down, finds the source is ChatGPT, and makes a post about it.

marvy3y ago

Honestly, I'm a bit surprised that the OpenCage blog post is so calm about this, instead of just yelling incoherently "why WHY why would anyone think like this?!?"

mort963y ago· 7 in thread

By the way, the last paragraph has some typos:

geoelectric3y ago

I thought for sure that must be glitching, so just tried on my ChatGPT Plus account with a fresh thread, but nope:

> does 2 pounds of feathers weigh more than 1 pound of bricks?

jxy3y ago

This is hilarious. ChatGPT even gave me a more bizarre example.

> does 2 pounds of bricks weigh more than 1 pound of bricks?

4 more replies

insane_dreamer3y ago

> don't trust ChatGPT for anything

shagie3y ago

It works really well for translating one "language" to another "language".

Give it some structured data and ask it to summarize it (e.g. hourly weather data and it gives a better summarization than a template based one).

Give it HN titles and the categories and it does a passable zero shot tagging of them ( https://news.ycombinator.com/item?id=34156626 ).

I'm toying around with making a "guided bedtime story generator". A friend of mine uses it to create a "day in the life of a dinosaur" stories for a child (a different story each day!)

The key is to play to its strengths rather than testing its bounds and complaining that they break in weird ways when they will inevitably break in weird ways.

visarga3y ago

> If you have to verify its information by other means, then you're not really saving much effort.

Just like any piece of code we write. We have to test, debug, verify and it still might have errors after that. And in scientific papers the conclusions are often contradicted by other papers.

catach3y ago

Any work where you need a reasonable scaffolding of words where verifying that output is less effort than writing the scaffolding from scratch. Plenty of fact-light writing needs be done.

worldsayshi3y ago

jefftk3y ago· 7 in thread

This is not a service we provide. It is not a service we have ever provided, nor a service we have any plans to provide. Indeed, it is a not a service we are technically capable of providing.

I'm curious: why not? It seems like a lot of people would be interested in this if you could figure out how to provide it.

simonw3y ago

How would this work?

If a phone number is for a mobile phone then looking up the location doesn't make sense at all: mobile phones are mobile.

I guess you could try and crawl an index of business phone numbers and associate those with the listed address for businesses, but that's a completely different business from running a geocoder.

You could provide a bit of geographical information about the first three digits of a US phone number. I imagine that's not what users are actually looking for though.

jefftk3y ago

I expect there are also patterns in other countries?

5 more replies

iamflimflam13y ago

The service is possible:

If you are a mobile network operator.

Or, you can convince people to install something on their phone that sends you their location along with their phone number.

ceejayoz3y ago

How would you go about reliably providing the location of someone's mobile phone without being their cell phone carrier?

jraph3y ago

By partnering with said cell phone carriers.

But I hope it would be illegal.

1 more reply

insane_dreamer3y ago

> a lot of people would be interested in this

you mean like scammers and stalkers? (ok, and probably Meta)

cactusplant73743y ago

You mean if they could figure out how to illegally track millions of people?

ninjakeyboard3y ago· 6 in thread

ChatGPT gets the rules to the pokemon trading card game wrong. It will tell you you can use 4 energy a turn. Convincingly. Not sure how it hallucinates this. The rule is 1 per turn.

codetrotter3y ago

A few days ago I asked ChatGPT if “pannekake” and “kannepake” are anagrams of each other.

    Letter | Frequency in | Frequency in
           | “pannekake”  | “kannepake”
    - - - - - - - - - - - - - - - - - - -
    a      | 2            | 2
    e      | 2            | 2
    k      | 2            | 2
    n      | 2            | 2
    p      | 2            | 2

int_19h3y ago

DoktorDelta3y ago

Could it have been referencing Blastoise's Deluge ability? Jacob Van Wagner used it in the 2015 championship to use 4 water energy in one turn.

kaetemi3y ago

ninjakeyboard3y ago

bigmattystyles3y ago

hayksaakian3y ago· 5 in thread

This marks the new age of "AI Optimization" where companies will strive to get their business featured into answers in ChatGPT.

The OP's example is Unwanted demand, but it clearly shows that ChatGPT can funnel potential customers towards a product or service.

impalallama3y ago

bick_nyers3y ago

Isn't that just SEO in a nutshell though? Hopefully with more advancements in LLM's we can get more bullshit detection/discrimination against SEO.

fijiaarone3y ago

I can think of a good way to generate all that astroturf content.

return_to_monke3y ago

akira25013y ago

> This marks the new age of "AI Optimization"

Or it marks the beginning of the next "AI Winter."

> but it clearly shows that ChatGPT can funnel potential customers towards a product or service.

And the next logical step is "chatgpt keywords advertising." Which is right back where we started.

yieldcrv3y ago· 5 in thread

lol it recommended their api and gave python code for using it

but the real api doesnt give results that the user asked ChatGPT for

that is amusingly alarming

CabSauce3y ago

Not quite as alarming as these people most likely trying to stalk someone without their permission.

hk__23y ago

> Not quite as alarming as these people most likely trying to stalk someone without their permission.

2 more replies

int_19h3y ago

Bonus points for using ChatGPT to implement this end-to-end.

goguy3y ago

Our jobs are safe! For now...

fijiaarone3y ago

Until someone figures out that we are all just hallucinating completely wrong code.

129078352023y ago· 3 in thread

The biggest takeaway for me was that it was getting info from YouTube videos. Is it actually watching and learning from the videos or where links to GitHub just included in the comments?

mgraczyk3y ago

I think this is just an incorrect assumption on the part of the blog authors.

mtmail3y ago

1 more reply

jdiff3y ago

Transcripts exist, and if there's multiple YouTube tutorials out there the odds are very good it also exists as a few dozen plain text articles.

ntonozzi3y ago· 2 in thread

Including the word 'phone' six times in a popular blog post is not going to help their predicament.

elicash3y ago

KomoD3y ago

Not gonna hurt either, ChatGPT data is not up to date

IshKebab3y ago· 2 in thread

At the top you should have a diagram like this:

Lat, lon <- opencage -> address

With a few examples underneath.

mtmail3y ago

IshKebab3y ago

Ah yes - I clicked on "Makers of the OpenCage Geocoding API" on your blog post which I assumed would go to your homepage (on mobile so it's a bit harder to tell).

Your actual homepage is indeed much better.

weird-eye-issue3y ago· 2 in thread

Related: One reason I just started using Rainforest API is because Github Copilot recommended it.

ryankrage773y ago

Are you aware of the pitfalls of parsing HTML with regex?

weird-eye-issue3y ago

Yes, but it started to try to use regex so I thought I'd see if it could at least be successful and it wasn't. Despite super simple HTML.

Anyways, I wrote a solution using HTMLRewriter in 10 minutes...

massysett3y ago· 2 in thread

flangola73y ago

You're trying to use a language model as an information reference. A translator can explain what a diplomat is saying but they can't perform their whole job.

massysett3y ago

? ChatGPT did not simply explain what someone else was saying. It created something completely new and completely false.

1 more reply

gumballindie3y ago· 1 in thread

kaetemi3y ago

It's just making up your library. Ask it to write some documentation, don't be specific yet, then drop a whole header or piece of code from your project into the chat.

MagicMoonlight3y ago

But guys we totally need to delete all of our search indexes and replace them with this instead

kissgyorgy3y ago

ggm3y ago

It's unconscionable. If there was no robot in the loop here, and it was people mis-transcribing youtube to compile e.g. Google search optimisation we'd call it fraud.

fabianfabian3y ago

ChatGPT does not know how to be correct, it only knows how to sound correct.

A better name for now would be PlausibleGPT.

sam0x173y ago

JohnFen3y ago

> All suggestions are welcome.

They have to get an API key from you. What about a large warning at the start of that process telling them that this isn't a service you provide?

BigBalli3y ago

Just redirect to here http://bigballi.com/Phone-Number-Lookup

adolph3y ago

If you have to tell potential customers you don’t do something, maybe you should just do it instead.

ChatGPT as business line lead generator—is there anything it can’t do?

99_003y ago

I remember a time when "I saw it on the internet" was a punchline for a joke about someone who's gullible or misinformed.

1970-01-013y ago

Fast, creative, and wrong isn't a trio. This is more evidence of ChatGPT being evolutionary and not revolutionary.

Sloppy3y ago

Psychopaths can be useful if they are knowledgeable but beware, you are talking to a psychopath in ChatGPT.

ano888883y ago

dakial13y ago

Soon we are going to have a AIrobots.txt

b800h3y ago

Is this not defamation, at least in some jurisdictions?

eternalban3y ago

If this business suffers financial or reputational damage because of ChatGPT's misinformation, should OpenAI be liable?

ninjakeyboard3y ago

It hallucinates that you can use 4 energy per turn in Pokemon TCG and confidently tells you so. No idea where that would come from.

j / k navigate · click thread line to collapse