Working with The Associated Press to provide fresh results for the Gemini app (opens in new tab)

(blog.google)

87 pointsalexrustic1y ago64 comments

64 comments

38 comments · 10 top-level

itsibitzi1y ago· 20 in thread

As someone who works in the news industry I find it pretty sad that we've just capitulated to big tech on this one. There are countless examples of AI summaries getting things catastrophically wrong, but I guess Google has long since decided that pushing AI was more important than accurate or relevant results, as can also be seen with their search results that simply omit parts of your query.

I can only hope this data is being incorporated in some way that makes hallucinations less likely.

nerdjon1y ago

Unfortunately this has just been the reality over the last couple years. People just ignore the hallucination problem (or try to say it isn't a big deal). And yet we have seen time and time again examples of these models being given something, told to summarize it, and still hallucinate important details. So you can't even make the argument that its data is flawed or something.

These models will interject information from their training whether or it is relevant or not. This is just due to the nature of how these models work.

Anyone trying to argue that it doesn't happen that often or anything is missing the key problem. Sure it may be right most of the time, but all that does is build a false sense of security and eventually you stop double checking or clicking through to a source. Whether it is a search result, manipulating data, or whatever.

This is made infinitely worse when these summaries are one and done, a single user is going to see the output and no one else will see it to fact check. It isn't like an article being wrong that everyone reading it is reading the same article, can then comment that something is wrong, it get updated, and so on and so forth. That feedback loop is non-existent with these models

umvi1y ago

> Anyone trying to argue that it doesn't happen that often or anything is missing the key problem. Sure it may be right most of the time, but all that does is build a false sense of security and eventually you stop double checking or clicking through to a source. Whether it is a search result, manipulating data, or whatever.

Same problem existed before AI summaries.

"Briefly stated, the Gell-Mann Amnesia effect is as follows. You open the newspaper to an article on some subject you know well. In Murray's case, physics. In mine, show business. You read the article and see the journalist has absolutely no understanding of either the facts or the issues. Often, the article is so wrong it actually presents the story backward—reversing cause and effect. I call these the "wet streets cause rain" stories. Paper's full of them.

In any case, you read with exasperation or amusement the multiple errors in a story, and then turn the page to national or international affairs, and read as if the rest of the newspaper was somehow more accurate about Palestine than the baloney you just read. You turn the page, and forget what you know."

– Michael Crichton (1942-2008)

extr1y ago

Hallucinations are not a big problem with SOTA models at this point, especially grounded against an actual news article.

dismalaf1y ago

> I can only hope this data is being incorporated in some way that makes hallucinations less likely.

The key word is "real-time". LLMs can't be trained in realtime, so it's obviously going to call an API that pulls up and reads from AP news, just like their search engine.

notatoad1y ago

i don't think you can assume that - "real time" in this context could just mean they feed every article into their training system as soon as it's published.

4 more replies

scarface_741y ago

The examples that have made news were with iOS. iOS doesn’t really do a summary of the content. It just tries to do a summary of the headline.

The on device model that it uses is also literally 1% the size of the large models like Gemini

paxys1y ago

The news industry capitulated to big tech the moment it got reliant on big tech for the majority of its revenue. The entire media landscape today is the direct result of that.

asdff1y ago

Take it a step back further, and you will see that the media landscape capitulated to Big Anything a long time ago. For probably generations now, if we consider people like william randolf hearst and other newspaper men.

micromacrofoot1y ago

It's responsibility laundering — AI can say whatever they want and they can shrug it off by saying bots are sometimes unreliable

asdff1y ago

Uhh, has your head been in the sand? Look at the average output of your industry without ai. It gets things wrong. It misleads. It hallucinates. It has incentives that fundamentally differ from what the readership seeks in news. The fact that your industry took so readily to the technology to output ever more garbage says it all about the state of the industry vs any condemnation of the fundamental technology.

onlyrealcuzzo1y ago

Gemini is the leading model with the lowest hallucination rate: https://www.visualcapitalist.com/ranked-ai-models-with-the-l...

I would expect that number to go down from 1.3% to below 1% over the course of the year.

There's always a chance what you're reading is wrong - due to purposeful deception, negligence, or accident.

Realistically, hardly anything is 100% accurate besides math.

itsibitzi1y ago

I think people really don't understand the effort, care and risk that goes into producing quality reporting.

I work with investigative reporters on stories that take many months to produce. Every time we receive a leak there is an extensive process of proving public interest before we can even start looking at the material. Once we can see it in we have to be extremely careful with everything we note down to make sure that our work isn't seen as prejudiced if legal discovery happens. We're constantly going back and forth with our editorial legal team to make sure what we're saying is fair and accurate. And in the end, the people we're reporting are given a chance to refute any of the facts we're about to present. Any mistakes can result in legal action that can ruin the lives of reporters and shut down companies.

Now, imagine I were to go to a reporter who has spent 6 months working on a story about, for example, a high profile celebrity sexually assaulted multiple women, how the royal family hides their wealth and are exempt from laws, or how multinational corporations use legal loopholes to avoid paying taxes, and said, "oh, 1% of people reading this will likely be given some totally made up details".

Given that stories often have more than a million impressions, this would lead tens of thousands of people with potentially libellous "hallucinations".

It simply should not be allowed.

LLMs have their place, for sure, but presenting the news is not it.

1 more reply

simonw1y ago

The Gemini models themselves may score well on this, but Google's feature implementations are a whole other thing. AI Overviews frequently take untrustworthy search results (like a fan fiction plot outline for Encanto 2) and turn those into confidently incorrect answers. https://simonwillison.net/2024/Dec/29/encanto-2/

1 more reply

stusmall1y ago

1.3% isn't great. I'd rather just go, and pay, directly to trusted news sources. Everyone has different tolerance for falsehoods and priorities I guess.

3 more replies

peanutz4541y ago

> There's always a chance what you're reading is wrong - due to purposeful deception, negligence, or accident.

I am quite certain my personal hallucinations level is more than 1.3%, obviously we want our machines to be better than us, but my doctor once said folic acid is not a vitamin.

sandspar1y ago

Journalism is dying, killed by its own excesses and by the internet. Google is offering it life support. The other option is death.

contagiousflow1y ago

And what are we going to feed into the models without the journalism?

2 more replies

great_tankard1y ago

You might want to look into why "journalism is dying" and whether Google (and Facebook) had anything to do with it.

Mainsail1y ago

Theoretical question. What is replacing it? Is it this? Is it something else? Nothing? Curious to peoples thoughts on this.

2 more replies

spankalee1y ago

"killed by its own excesses"?

1 more reply

331c8c711y ago· 3 in thread

We've gone the whole way from decentralization and rebelliousness of the early internet and the landscape is becoming suffocatingly sterile (=lifeless).

I'm much more excited about eventual emergence of underground homebrew models without any guardrails...

umvi1y ago

> I'm much more excited about eventual emergence of underground homebrew models without any guardrails

Not if AI gatekeepers and interest groups have anything to say about it. AI without guardrails could be classified as a "weapon" and made illegal such that we are only allowed to use models produced by regulated entities and meet certain "safety standards" (like how medical software has to be approved by FDA).

Edit: oh, I guess "underground" could be interpreted in a way that these models are still produced and distributed (but secretly, illegally, etc)

srid1y ago

Yes. What do you think Grok AI's real-time information being sourced on posts of individual users (similar to early internet) as opposed to established media (like AP)?

kennyloginz1y ago

No, Twitter posts are not even close to the “old internet”. No one was paid to create ragebait.

We still argued, but we did it from a place of passion, not commission.

heavyarms1y ago· 2 in thread

There's not a lot of detail in the announcement but I assume this is some kind of RAG system. I wonder if it will cover some short time period (past week, past month?) or if they are trying to cover the whole time period since the knowledge cutoff of the current model.

urbandw311er1y ago

My guess is that they’ll just stuff a few daily headlines into the prompt so that queries about current affairs have some context, rather than re-training the model. Total guess obviously.

PhilippGille1y ago

RAG isn't re-training. You can have vector embeddings of all AP news in a vector DB, then when prompted, find related news via similarity search, and add the most similar (and thus related) ones to the context.

Here's some simple example code in Go, for RAG with 5000 arXiv paper abstracts: https://github.com/philippgille/chromem-go/tree/v0.7.0/examp... (full disclosure it's using a simple vector DB I wrote)

1 more reply

sharpshadow1y ago· 2 in thread

Is there the option to get the news then as they fly in immediately in a feed?

Mr-Frog1y ago

https://developer.ap.org/ap-media-api/agent/Feeds_and_Linked...

nxobject1y ago

I'm surprised I'd never asked that question before, since the AP and other syndicates began as teletype wire feeds. What do modern newsrooms use as the modern replacement of the AP "wire"?

smithcoin1y ago· 1 in thread

FYI If you want to turn this off in workspace you'll need to go here https://admin.google.com/ac/managedsettings/47208553126 and here https://admin.google.com/ac/managedsettings/793154499678.

sebmellen1y ago

The hero we needed

xnx1y ago

The timing of this announcement is surely to contrast to open OpenAI which is currently in court being sued by The New York Times.

throw71y ago

I wonder what the byline will look like. I'm sure their current crop of beat reporters are enthusiastic with developments.

eichi1y ago

One of the CEO was really competitive and has been the few legecy asset which are contributing current Google: other legecy assets are pools of competitive people who hadn't found the best place to show the ability. Current google is just the target of the good profile.

bluSCALE41y ago

Google, what we really want are ads.

bangaroo1y ago

wow! this sure is great! gemini has worked so great up until this point - for example, i learned that a man who died in 1850 is one of three private owners of the airbus a340-600 last week! i'm so glad gemini exists and i absolutely cannot wait to experience a world wherein people get news from it.

j / k navigate · click thread line to collapse

64 comments

38 comments · 10 top-level

itsibitzi1y ago· 20 in thread

I can only hope this data is being incorporated in some way that makes hallucinations less likely.

nerdjon1y ago

These models will interject information from their training whether or it is relevant or not. This is just due to the nature of how these models work.

umvi1y ago

Same problem existed before AI summaries.

– Michael Crichton (1942-2008)

extr1y ago

Hallucinations are not a big problem with SOTA models at this point, especially grounded against an actual news article.

dismalaf1y ago

> I can only hope this data is being incorporated in some way that makes hallucinations less likely.

The key word is "real-time". LLMs can't be trained in realtime, so it's obviously going to call an API that pulls up and reads from AP news, just like their search engine.

notatoad1y ago

i don't think you can assume that - "real time" in this context could just mean they feed every article into their training system as soon as it's published.

4 more replies

scarface_741y ago

The examples that have made news were with iOS. iOS doesn’t really do a summary of the content. It just tries to do a summary of the headline.

The on device model that it uses is also literally 1% the size of the large models like Gemini

paxys1y ago

The news industry capitulated to big tech the moment it got reliant on big tech for the majority of its revenue. The entire media landscape today is the direct result of that.

asdff1y ago

micromacrofoot1y ago

It's responsibility laundering — AI can say whatever they want and they can shrug it off by saying bots are sometimes unreliable

asdff1y ago

onlyrealcuzzo1y ago

Gemini is the leading model with the lowest hallucination rate: https://www.visualcapitalist.com/ranked-ai-models-with-the-l...

I would expect that number to go down from 1.3% to below 1% over the course of the year.

There's always a chance what you're reading is wrong - due to purposeful deception, negligence, or accident.

Realistically, hardly anything is 100% accurate besides math.

itsibitzi1y ago

I think people really don't understand the effort, care and risk that goes into producing quality reporting.

Given that stories often have more than a million impressions, this would lead tens of thousands of people with potentially libellous "hallucinations".

It simply should not be allowed.

LLMs have their place, for sure, but presenting the news is not it.

1 more reply

simonw1y ago

1 more reply

stusmall1y ago

1.3% isn't great. I'd rather just go, and pay, directly to trusted news sources. Everyone has different tolerance for falsehoods and priorities I guess.

3 more replies

peanutz4541y ago

> There's always a chance what you're reading is wrong - due to purposeful deception, negligence, or accident.

I am quite certain my personal hallucinations level is more than 1.3%, obviously we want our machines to be better than us, but my doctor once said folic acid is not a vitamin.

sandspar1y ago

Journalism is dying, killed by its own excesses and by the internet. Google is offering it life support. The other option is death.

contagiousflow1y ago

And what are we going to feed into the models without the journalism?

2 more replies

great_tankard1y ago

You might want to look into why "journalism is dying" and whether Google (and Facebook) had anything to do with it.

Mainsail1y ago

Theoretical question. What is replacing it? Is it this? Is it something else? Nothing? Curious to peoples thoughts on this.

2 more replies

spankalee1y ago

"killed by its own excesses"?

1 more reply

331c8c711y ago· 3 in thread

We've gone the whole way from decentralization and rebelliousness of the early internet and the landscape is becoming suffocatingly sterile (=lifeless).

I'm much more excited about eventual emergence of underground homebrew models without any guardrails...

umvi1y ago

> I'm much more excited about eventual emergence of underground homebrew models without any guardrails

Edit: oh, I guess "underground" could be interpreted in a way that these models are still produced and distributed (but secretly, illegally, etc)

srid1y ago

Yes. What do you think Grok AI's real-time information being sourced on posts of individual users (similar to early internet) as opposed to established media (like AP)?

kennyloginz1y ago

No, Twitter posts are not even close to the “old internet”. No one was paid to create ragebait.

We still argued, but we did it from a place of passion, not commission.

heavyarms1y ago· 2 in thread

urbandw311er1y ago

My guess is that they’ll just stuff a few daily headlines into the prompt so that queries about current affairs have some context, rather than re-training the model. Total guess obviously.

PhilippGille1y ago

Here's some simple example code in Go, for RAG with 5000 arXiv paper abstracts: https://github.com/philippgille/chromem-go/tree/v0.7.0/examp... (full disclosure it's using a simple vector DB I wrote)

1 more reply

sharpshadow1y ago· 2 in thread

Is there the option to get the news then as they fly in immediately in a feed?

Mr-Frog1y ago

https://developer.ap.org/ap-media-api/agent/Feeds_and_Linked...

nxobject1y ago

I'm surprised I'd never asked that question before, since the AP and other syndicates began as teletype wire feeds. What do modern newsrooms use as the modern replacement of the AP "wire"?

smithcoin1y ago· 1 in thread

FYI If you want to turn this off in workspace you'll need to go here https://admin.google.com/ac/managedsettings/47208553126 and here https://admin.google.com/ac/managedsettings/793154499678.

sebmellen1y ago

The hero we needed

xnx1y ago

The timing of this announcement is surely to contrast to open OpenAI which is currently in court being sued by The New York Times.

throw71y ago

I wonder what the byline will look like. I'm sure their current crop of beat reporters are enthusiastic with developments.

eichi1y ago

bluSCALE41y ago

Google, what we really want are ads.

bangaroo1y ago

j / k navigate · click thread line to collapse