The project was born out of personal pain — I wanted a way to read only significant news, like major humanity milestones, or historical political events, filtering out all the celebrity gossip and smartphone releases. But I couldn't find a way to do that — everywhere I looked, the news was ranked by popularity, coverage, or relevance, not significance.
I first tried to solve the problem in the beginning of 2023 with GPT-3 (the top model at that time) by asking it to estimate the significance of some news stories. The results were painfully bad — for some reason, the model preferred tragic, personal stories, completely missing the essence of what makes the news significant. No amount of prompt engineering could fix that.
But it all changed in March 2023 when GPT-4 came out. The scores it gave made much more sense. After a month of work, the first version was ready. News Minimalist had its first successful Hacker News post (https://news.ycombinator.com/item?id=35795388), and I realized that a lot of people had the same problem I had.
I've been working on improving the project ever since. As probably most tech founders, I spent too much time on technical improvements, completely ignoring marketing. But I think that work paid off, and I'm finally satisfied with the scores it gives.
The results are posted on the site: https://www.newsminimalist.com/
Let me know what you think!
Vadim
A lot of the most signficant stories are political, for example, which someone may have no interest in.
I have had this same idea in the past, tuning to my personal interests.
I separate significance from importance (or relevance).
Importance is subjective. News about the health of my family members is important to me, but it is not significant to the world.
Significance is objective. It's about how much the event affects humanity as a whole.
I don’t agree with that, at least not in the present. We only know what’s truly significant when we reflect on history. There are very few things we can be certain are significant in the present. Climate change is likely one, but the US debt ceiling and the war in Ukraine don’t seem as likely to me, at least not in the human scale. There are also events that happen that don’t appear significant in the present but will be hugely significant in the future.
https://en.wikipedia.org/wiki/Portal:Current_events
Although that's perhaps way too minimalist?
I was disagreeing a lot with their selection of news, for example one of their recent entries is:
"Two people are killed and eleven others are injured when a bus flips on its side on a highway near Prenzlau, northeast of Berlin, Germany."
My system gave it a significance score of 1.8, so similar news should never get to the main page: https://www.newsminimalist.com/articles/two-dead-and-four-in...
Does your service do a good job at thinking longer term? Would you have an example of this?
Not trying to have an edgy take here, it comes from a genuine attempt to use it for this exact purpose, but it didn't work at all. A train crash killing 20, as unfortunate as it is, can't possibly be one of the 5 most significant things that happened over the last week, but Wikipedia current events considers it one.
$ wget -qO - https://rss.beehiiv.com/feeds/4aF2pGVAEN.xml | grep pubDate
<pubDate>Thu, 16 Jan 2025 17:56:47 +0000</pubDate>
<pubDate>Thu, 16 Jan 2025 17:56:47 +0000</pubDate>
<pubDate>Mon, 13 Jan 2025 17:22:34 +0000</pubDate>
<pubDate>Sat, 11 Jan 2025 16:42:01 +0000</pubDate>
<pubDate>Thu, 09 Jan 2025 17:52:38 +0000</pubDate>
<pubDate>Fri, 03 Jan 2025 16:19:25 +0000</pubDate>
<pubDate>Sat, 28 Dec 2024 17:44:35 +0000</pubDate>
<pubDate>Wed, 18 Dec 2024 16:44:00 +0000</pubDate>
<pubDate>Mon, 16 Dec 2024 18:01:56 +0000</pubDate>
<pubDate>Tue, 10 Dec 2024 18:00:55 +0000</pubDate>
<pubDate>Fri, 06 Dec 2024 17:43:37 +0000</pubDate>
<pubDate>Wed, 04 Dec 2024 16:38:38 +0000</pubDate>
<pubDate>Sun, 01 Dec 2024 18:57:43 +0000</pubDate>
<pubDate>Fri, 29 Nov 2024 18:07:14 +0000</pubDate>
<pubDate>Sun, 24 Nov 2024 15:26:59 +0000</pubDate>
<pubDate>Thu, 21 Nov 2024 17:25:22 +0000</pubDate>
<pubDate>Tue, 19 Nov 2024 17:47:19 +0000</pubDate>
<pubDate>Fri, 15 Nov 2024 17:28:00 +0000</pubDate>
<pubDate>Tue, 12 Nov 2024 17:44:08 +0000</pubDate>
<pubDate>Fri, 08 Nov 2024 17:22:01 +0000</pubDate>
<pubDate>Wed, 06 Nov 2024 16:43:10 +0000</pubDate>
So since 6th november there were only 21 articles. Longest streak was 10 days and common is 3 days without any news whatsoever.Also: It'd be great if you had a feed tag in your HTML head, so RSS readers could pick it up straight out of your homepage URL instead of needing to manually hunt for the right RSS link.
It's available via RSS too: https://rss.beehiiv.com/feeds/4aF2pGVAEN.xml
I attempted to make a weekly version, but quickly dropped the idea. Over the course of the week articles often became outdated (not just old, but plain wrong).
I found that an optimal newsletter schedule is sending it about every 48-72 hours, depending on how eventful that period was. With this frequency, the articles rarely become outdated, and at the same time it's not too frequent to get tired of.
I think LLMs are really underutilized as a "judgement tool". A couple similar ideas people reached out to me with were: evaluating which pull requests are more significant in a big repo, or which grant applications have more merit.
The LLMs will always make mistakes, but they could work great as the first filter.
As in every other engineering endeavor, the raw data you start off with has a lot to do with what you end up with, no matter what transforms happen. :)
Wikipedia current events page was actually one of the reasons for creating this project! I was disagreeing a lot with their selection of news, for example one of the recent entries is:
"Two people are killed and eleven others are injured when a bus flips on its side on a highway near Prenzlau, northeast of Berlin, Germany."
My system gave it a significance score of 1.8: https://www.newsminimalist.com/articles/two-dead-and-four-in...
Most other aggregators show news based on 1) relevance, 2) upvotes or 3) coverage.
Relevance-based algorithms tend to put reader into a bubble, where the more they read on a certain topic, the more news they see on that topic.
Upvotes-based algorithms usually bring up a lot of clickbait and drama.
Sorting by coverage doesn't really work either, media often just follows people's interests and churns articles on what is "hot".
For example, last summer, a fight between Zuckerberg and Musk was at the top of most feeds based both on upvotes and coverage. Significance-based algorithm didn't even put it in the top 50.
rank by significance:
NOT politics; NOT Trump; NOT republican; NOT democratOr this tip about how storing milk in the fridge door makes it go bad sooner: https://www.mirror.co.uk/news/uk-news/warning-anyone-storing...
Or this Dear Abby piece responding to a woman whose husband might have undiagnosed depression: https://www.mlive.com/advice/2025/01/dear-abby-trying-to-fig...
Three men occupy table without eating at busy food court
A video taken by a Singaporean Stomper shows three men playing games on their mobile phones while seated at a dining table in a crowded food court. The group did not have any food or drinks on the table, and despite the lunch crowd, they did not give up their seats to those waiting for a table. The incident occurred at the newly-opened Sinfoodie food court at Tai Seng Street, which is known to be very busy during lunch.
https://www.newsminimalist.com/articles/d8a34b64-3486-4a45-8...
I had a very similar idea a while back. I wanted to rank news by "impact" which might be more concrete than "significance."
For an LLM prompt, it would be something like:
"estimate the number of people who's lives that will be materially changed by this news." and "estimate the average degree of change for those impacted."
Then impact is roughly the product of those two.
Additionally, I want a version that is tailored to me specifically "estimate the degree of change this will have on my life." + context of my life.
Tangentially, I've found that getting ratings out LLMs works better when I can give all options and request relative ratings. If I ask for rankings individually I get different and less good results. Not enough context length to rate all news from all time in one go though. Any thoughts on that? Maybe providing some benchmark ratings with each request could help? Something I'm exploring.
In the beginning I had 3 parameters: scale (number of people), magnitude (degree of change for those impacted) and additionally potential (how likely is this event to trigger downstream significant events).
The point behind including potential was to separate these two events:
1) A 80 year old dies from cancer 2) An 80 year old dies from a new virus called COVID
This worked roughly well but I kept adding parameters to improve the system: novelty, credibility, etc... The current system works on 7 parameters.
---
I never attempted to give LLM all options and rank them against each other.
1) as you said, for me 20k articles is just too much to fit into context window. Maybe some modern LLMs can handle it, but it wasn't the case for a long time, and I settled on current approach.
2) I don't want the "neighbors" to affect individual article ratings. With the current system I am able to compare news spread over months, because they were all rated using the same prompt.
3) I intentionally avoided giving AI examples, like "evaluate event X given that event Y is 7/10". I want it to give scores with a "clear mind" and not be "primed" to my arbitrary examples.
I'm working on something similar but am thinking of using AI differently. Great job Vadim.
How can I filter out (block) subjects/words?
"World" includes everything that talks about two or more countries.
"Nation" is a category with news that touch only a single country, not necessarily US. The current feed is very US-heavy because practically everyone (even non-US sources) is talking about Trump.
Keyword blocking is available on premium: https://www.newsminimalist.com/premium#block-topics
The default feed sorting is done for regular visitors (new first), for evaluating the output you might like the "significant first" more: https://www.newsminimalist.com/?sort=significance
On that list, the ceasefire article is on the second place out of the ~40k articles analyzed.
---
Having more variables is an good idea. I don't have an immediate vision on how to use it in the UI (I want to keep it minimal), but will think more about it.
---
I've been really torn on free trial. I currently offer a refund guarantee, but will add a trial as well soon.
But from what I've seen it's pretty indifferent to "sides", it's more focused on raw numbers of people affected and magnitude of the event.
Thank you for the inspiration!
Might take a stab at ranking them as well.
"ChatGPT, set a daily/weekly task to give me the most significant news. Use this ranking criteria: <input criteria>"