Skip to content

Top Best Ask Show New Jobs

Show HN: AI Files – manage and organize your files with AI (opens in new tab)

(npmjs.com)

71 pointsjjuliano3y ago44 comments

44 comments

31 comments · 12 top-level

rnosov3y ago· 6 in thread

I think you might want to correct ChatGPT to just GPT. As far as I know, there is no public API access point for the ChatGPT. OpenAI davinci model that you're likely using is based on InstructGPT ( a different GPT based beast ). Also, I would be somewhat worried about this thing racking up massive bills for the larger files.

EDIT: okay, according to replies there is more than meets the eye

charcircuit3y ago

>there is no public API access point for the ChatGPT

There is an undocumented model name that you can use to access it via the API.

simse3y ago

Looks like it actually _is_ using ChatGPT: https://www.npmjs.com/package/chatgpt

jclem3y ago

This package is using GPT-3 via the “ChatGPT” export from that module, which is—somewhat misleadingly—not ChatGPT, but GPT-3.

jjulianoOP3y ago

Author here: It currently uses ChatGPT. While ChatGPT is currently free, the use of REPLICATE API for describing images will have an incurred costs. At the moment, if you opt-out of this feature, you can skip Images for now.

However, future updates will have a configuration to be able to skip REPLICATE, or choose to use a paid OpenAI model.

eterps3y ago

> Also, I would be somewhat worried about this thing racking up massive bills for the larger files.

Looks like the amount of data that is sent is capped:

https://github.com/jjuliano/aifiles/blob/main/.aifiles.sampl...

jjulianoOP3y ago

The more words you sent, the better understanding the AI about the file. HOWEVER, it also comes with privacy concerns. You can choose to rather sent a few words for AI to figure out your file.

Also, take note, the maximum payload for OpenAI is 4kb, so the app will just throw an error when it exceeds 4kb.

eterps3y ago· 2 in thread

How feasible would it currently be to use a standalone tool for this (instead of connecting to ChatGPT)?

Are there any standalone command line tools that can be experimented with?

rnosov3y ago

It should be feasible with GPT-J. You should be able to run it locally if you have GPU with more than 16gb of video memory. Output quality might not match OpenAI offerings though.

https://playground.helloforefront.com/models/free-gpt-j-play...

EDIT: looks like you guys hammered it down. Here is another playground (box on the right):

https://huggingface.co/EleutherAI/gpt-j-6B

jjulianoOP3y ago

In the future, it can be configurable to use your own ChatGPT server. As more companies will opt for a domain-specific LLMs per industry/company.

throwaway11833y ago· 2 in thread

I saw a post[0] in HN that says AI models are susceptible to new kind of malware. How is this app safe?

[0]: https://news.ycombinator.com/item?id=34945349

rnosov3y ago

I won't describe it as malware. Your link describes prompt injection which applicable to any software that currently employs LLMs (including this package).

To successfully exploit it an attacker would need to place a file with malicious prompt on your hard drive. However, if it's the case then there will be a lot more easier ways to execute various attacks.

throwaway11833y ago

> a file with malicious prompt on your hard drive

How will you know if a file is free from malicious prompt or not? The applications seems to be able to download any file and analyze it. So from my perspective, I think it is easier this way than to execute other attack? Because these files may seem benign but can still run instructions from the prompts. Just think that the next pdf you are downloading from the web has has no malware but only malicious prompt. What will you do?

user-3y ago· 2 in thread

Could use with examples of what exactly this is/does

jjulianoOP3y ago

Basically, it curates your files automatically using AI based on the file naming convention you set in the configuration file.

And it suggests tags and summarizes/describes the file based on its contents, then finally attach those tags and comment to the file.

For example, if you have an unnamed file ‘document.doc’ that contains information about a parking ticket, then it will rename this file ‘ParkingTicket.doc’, you can add more organizational details like categories, etc.

It does the same as well for Images and Music.

dstala3y ago

This description helped me uncover a problem that I had since ages! Good

Will give it a run

petemetefete3y ago· 1 in thread

It should be noted that this tool uses a prompt [1] which does include the whole file, or in case of non textual content, the metadata of the image/video/audio file [2].

[1] https://github.com/jjuliano/aifiles/blob/ef529fd6281eaf8d373...

[2] https://github.com/jjuliano/aifiles/blob/ef529fd6281eaf8d373...

Output from the language model is also being injected into a script that is then executed: https://github.com/jjuliano/aifiles/blob/ef529fd6281eaf8d373...

He argued below that he is not vulnerable to indirect prompt injection attacks (https://github.com/greshake/llm-security), but I think he is wrong.

bbor3y ago· 1 in thread

This is absolutely incredible, and a very effective, minimal interface. Can’t wait to try it!

jjulianoOP3y ago

Thanks!

eterps3y ago· 1 in thread

Does this tool keep track of the categories it has used to build a directory structure over time, or does each file result in a new structure unrelated to the previous one?

jjulianoOP3y ago

That is actually in the TODO items. You can also pattern it to a curated folder (if you have one). This feature would be available on the next release.

llanowarelves3y ago· 1 in thread

I have a non-AI version of this I wrote and considered using a topic classifier. Will have to check this out.

jjulianoOP3y ago

I like to see that topic classifier! I was also thinking about it before on how to accomplish that, but now LLMs can classify and tag a text. GPT AI is really a huge jump in technological advancement. It will reshape this world we know today.

pictur3y ago· 1 in thread

good idea. a tool that can do it locally without the need for a dependency might be better.

jjulianoOP3y ago

That's a good suggestion. At the moment, it uses several other applications to gather textual information about the file. In the future releases. You can choose not to use other applications, and just rely on the file-name, etc.

dstala3y ago· 1 in thread

Privacy warning displayed upfront is a great thing! Looks an interesting use case though.

jjulianoOP3y ago

Yes, it's important to tell that disclaimer to be warned about sending info to ChatGPT.

NayamAmarshe3y ago· 1 in thread

This is a great project!

jjulianoOP3y ago

Thanks!

langsoul-com3y ago

The project auto tagging got me wondering if there's an AI that can auto create tags for a job posting.

Ie what tech stack it uses, languages and the like.

j / k navigate · click thread line to collapse