Show HN: I made an Ollama summarizer for Firefox (opens in new tab)

(addons.mozilla.org)

132 pointstcsenpai1y ago33 comments

Source: https://github.com/tcsenpai/spacellama

33 comments

25 comments · 6 top-level

RicoElectrico1y ago· 7 in thread

I've found that for the most part the articles that I want summarized are those which only fit the largest context models such as Claude. Because otherwise I can skim-read the article possibly in reader mode for legibility.

Is llama 2 a good fit considering its small context window?

tcsenpaiOP1y ago

Personally I use llama3.1:8b or mistral-nemo:latest which have a decent contex window (even if it is less than the commercial ones usually). I am working on a token calculator / division of the content method too but is very early

garyfirestorm1y ago

why not llama3.2:3B? it has fairly large context window too

1 more reply

reissbaker1y ago

I don't think this is intended for Llama 2? The Llama 3.1 and 3.2 series have very long context windows (128k tokens).

tempodox1y ago

What about using a Modelfile for ollama that tweaks the context window size? I seem to remember parameters for that in the ollama GitHub docs.

tcsenpaiOP1y ago

I applied (for now) a pre-filled table with a 4096 default limit. Users can also specify an upper or lower limit from the UI directly now. Added chunk and recursive summarization too.

htrp1y ago

do multi stage summarization?

tcsenpaiOP1y ago

Hi! This was a good suggestion! I implemented it in v 1.1 which is already out :)

chx1y ago· 6 in thread

Help me understand why people are using these.

I presume you want information of some value to you otherwise you wouldn't bother reading an article. Then you feed it to a probabilistic algorithm and so you can not have any idea what the output has to do with the input. Like https://i.imgur.com/n6hFwVv.png you can somewhat decipher what this slop wants to be but what if the summary leaves out or invents or inverts some crucial piece of info?

InsideOutSanta1y ago

"Then you feed it to a probabilistic algorithm and so you can not have any idea what the output has to do with the input"

This is theoretically true, but to me at least, practically irrelevant. In all cases, for most values of the word "all", the summary does tell you what the article contains.

For me at least, the usefulness is not that the summary replaces reading the article. Instead, it's a signal telling me whether I should read it in the first place.

andrewmcwatters1y ago

People write too much. Get to the point.

ranger_danger1y ago

I think you just insulted every journalist on Earth.

2 more replies

throwup2381y ago

Even if I want to read the entirety of a piece of long form writing I'll often summarize it (with Kagi key points mode) so that I know what the overall points are and can follow the writing better. Too much long form writing is written like some mystery thriller where the writer has to unpack an entire storyline before they'll state their main thesis, so it helps my reading comprehension to know what the point is going in. The personal interest stories that precede the main content always land better that way.

chx1y ago

any point? regardless of what's written? does that work for you?

2 more replies

KaiMagnus1y ago

At least for me it’s less about the individual article, in that case I agree with you, but more about the case where you have 25 articles.

Now you can’t possibly get through all of them and have to decide which of those could be worth your time. And in that case, the tradeoff makes sense.

donclark1y ago· 3 in thread

If we can get this as the default for all the newly posted HN articles please and thank you?

ukuina1y ago

This is why I built https://hackyournews.com

It summarizes via Puter (free).

iJohnDoe1y ago

So cool! Thanks. Bookmarked.

totallymike1y ago

I sincerely hope this never happens

asdev1y ago· 2 in thread

I built a chrome version of this for summarizing HN comments: https://github.com/built-by-as/FastDigest

larodi1y ago

Thank you been thinking about this for long time while copying lots of conversations back and forth Claude.

asdev1y ago

no problem! hope it works out for you. currently only supports Ollama and OpenAI but should be pretty easily extended to Claude and other APIs

oneshtein1y ago· 1 in thread

I use PageAssist with Ollama for two months, but I never called "Summarise" option in menu. :-/

tcsenpaiOP1y ago

TIL, I am experimenting with PageAssist right now

tcsenpaiOP1y ago

Update: v 1.1 is out!

- # Changelog

## [1.1] - 2024-03-19

### Added - New `model_tokens.json` file containing token limits for various Ollama models. - Dynamic token limit updating based on selected model in options. - Automatic loading of model-specific token limits from `model_tokens.json`. - Chunking and recursive summary for long pages - Better handling of markdown returns

### Changed - Updated `manifest.json` to include `model_tokens.json` as a web accessible resource. - Modified `options.js` to handle dynamic token limit updates: - Added `loadModelTokens()` function to fetch model token data. - Added `updateTokenLimit()` function to update token limit based on selected model. - Updated `restoreOptions()` function to incorporate dynamic token limit updating. - Added event listener for model selection changes.

### Improved - User experience in options page with automatic token limit updates. - Flexibility in handling different models and their respective token limits.

### Fixed - Potential issues with incorrect token limits for different models.

j / k navigate · click thread line to collapse

33 comments

25 comments · 6 top-level

RicoElectrico1y ago· 7 in thread

Is llama 2 a good fit considering its small context window?

tcsenpaiOP1y ago

garyfirestorm1y ago

why not llama3.2:3B? it has fairly large context window too

1 more reply

reissbaker1y ago

I don't think this is intended for Llama 2? The Llama 3.1 and 3.2 series have very long context windows (128k tokens).

tempodox1y ago

What about using a Modelfile for ollama that tweaks the context window size? I seem to remember parameters for that in the ollama GitHub docs.

tcsenpaiOP1y ago

I applied (for now) a pre-filled table with a 4096 default limit. Users can also specify an upper or lower limit from the UI directly now. Added chunk and recursive summarization too.

htrp1y ago

do multi stage summarization?

tcsenpaiOP1y ago

Hi! This was a good suggestion! I implemented it in v 1.1 which is already out :)

chx1y ago· 6 in thread

Help me understand why people are using these.

InsideOutSanta1y ago

"Then you feed it to a probabilistic algorithm and so you can not have any idea what the output has to do with the input"

This is theoretically true, but to me at least, practically irrelevant. In all cases, for most values of the word "all", the summary does tell you what the article contains.

For me at least, the usefulness is not that the summary replaces reading the article. Instead, it's a signal telling me whether I should read it in the first place.

andrewmcwatters1y ago

People write too much. Get to the point.

ranger_danger1y ago

I think you just insulted every journalist on Earth.

2 more replies

throwup2381y ago

chx1y ago

any point? regardless of what's written? does that work for you?

2 more replies

KaiMagnus1y ago

At least for me it’s less about the individual article, in that case I agree with you, but more about the case where you have 25 articles.

Now you can’t possibly get through all of them and have to decide which of those could be worth your time. And in that case, the tradeoff makes sense.

donclark1y ago· 3 in thread

If we can get this as the default for all the newly posted HN articles please and thank you?

ukuina1y ago

This is why I built https://hackyournews.com

It summarizes via Puter (free).

iJohnDoe1y ago

So cool! Thanks. Bookmarked.

totallymike1y ago

I sincerely hope this never happens

asdev1y ago· 2 in thread

I built a chrome version of this for summarizing HN comments: https://github.com/built-by-as/FastDigest

larodi1y ago

Thank you been thinking about this for long time while copying lots of conversations back and forth Claude.

asdev1y ago

no problem! hope it works out for you. currently only supports Ollama and OpenAI but should be pretty easily extended to Claude and other APIs

oneshtein1y ago· 1 in thread

I use PageAssist with Ollama for two months, but I never called "Summarise" option in menu. :-/

tcsenpaiOP1y ago

TIL, I am experimenting with PageAssist right now

tcsenpaiOP1y ago

Update: v 1.1 is out!

- # Changelog

## [1.1] - 2024-03-19

### Improved - User experience in options page with automatic token limit updates. - Flexibility in handling different models and their respective token limits.

### Fixed - Potential issues with incorrect token limits for different models.

j / k navigate · click thread line to collapse