EDIT: okay, according to replies there is more than meets the eye
There is an undocumented model name that you can use to access it via the API.
However, future updates will have a configuration to be able to skip REPLICATE, or choose to use a paid OpenAI model.
Looks like the amount of data that is sent is capped:
https://github.com/jjuliano/aifiles/blob/main/.aifiles.sampl...
Also, take note, the maximum payload for OpenAI is 4kb, so the app will just throw an error when it exceeds 4kb.
Are there any standalone command line tools that can be experimented with?
https://playground.helloforefront.com/models/free-gpt-j-play...
EDIT: looks like you guys hammered it down. Here is another playground (box on the right):
To successfully exploit it an attacker would need to place a file with malicious prompt on your hard drive. However, if it's the case then there will be a lot more easier ways to execute various attacks.
How will you know if a file is free from malicious prompt or not? The applications seems to be able to download any file and analyze it. So from my perspective, I think it is easier this way than to execute other attack? Because these files may seem benign but can still run instructions from the prompts. Just think that the next pdf you are downloading from the web has has no malware but only malicious prompt. What will you do?
And it suggests tags and summarizes/describes the file based on its contents, then finally attach those tags and comment to the file.
For example, if you have an unnamed file ‘document.doc’ that contains information about a parking ticket, then it will rename this file ‘ParkingTicket.doc’, you can add more organizational details like categories, etc.
It does the same as well for Images and Music.
Will give it a run
[1] https://github.com/jjuliano/aifiles/blob/ef529fd6281eaf8d373...
[2] https://github.com/jjuliano/aifiles/blob/ef529fd6281eaf8d373...
He argued below that he is not vulnerable to indirect prompt injection attacks (https://github.com/greshake/llm-security), but I think he is wrong.
Ie what tech stack it uses, languages and the like.