Amongst all the WhatsApp media on my phone I would like to get a list of all the videos and photos with my family in it and then delete the rest.
Is something like this possible with immich?
1. Encode faces, there is a library called face_recognition, that can grab faces from pictures and encode them 2. Group the faces data using `pairwise_distances(encodings, metric='euclidean')`, you only need sklearn library for this
I run it on my iPhone.
Native app. Doesn't require a network connection (great for privacy).
> Queryable is a Core ML model that runs locally on your device. Leveraging OpenAI CLIP's model encoding technology to connect images and text, you can search your iPhone photo album using any natural language input. Most importantly, it is completely offline, so your album privacy will not be revealed to anyone. And, it is open-source: GitHub
After creating Queryable, I also developed an app called MemeSearch, which searches for memes on Reddit (https://apps.apple.com/us/app/memesearch-reddit-meme-finder/...). Although it's completely free, it hasn't been downloaded by many users. I thought nobody wanted it, so I'm glad to see there are still some people who share a similar taste.
Also, I think you should use another icon from this app because it looks like a goofy side project. It probably is but people would probably not download iPhone apps if the icon doesn’t look professional. (My two cents)
https://findthatmeme.com/blog/2023/01/08/image-stacks-and-ip...
[1] https://www.reddit.com/r/AskReddit/comments/jooo5/reddit_ori...
We built and open sourced Indexify https://github.com/tensorlakeai/indexify to make it easy to build resilient pipelines to combine data with many different models and transformations to build applications that relies on embedding or any other metadata extracted by models from Videos, Photos and any documents!
I didn’t know about SigClip, the author mentioned on the blog, need to add this to our library :) I also found it incredible that he generated the crawler with Claude! This is the type of boilerplate I hope we don’t have to write in the future
On my previous job ML department created internal tool, where you could search through city panoramas (like google street view) using text.
It could find you in a second all road pits, overfilled dumpsters and other ugly (and beautiful) things you wanted.
Fwiw, my recent blog is me trying to do this more
We can do this very easily for you. We can provide Search output with context that you can further feed to an LLM for processing to extract events. Let me know if you are interested.
You can get in touch with me at sagar at joyspace dot ai.
If few convert, and most just tell their friends to try your cool demo, you can suddenly have 100,000 reddit users making 200+ requests on average every day cause your free demo's so cool. And suddenly you're mostly trying to figure out how to scrounge up server costs to cover the free parts.
Frankly, seems like the entire industry's probably going to have a lot of the same optimizations pretty soon. "How do we stop delivering such enormous JPGs with every Amazon/eBay click?" and similar.
[1] Slighly old article, so I lower the $/request on compute a bit from $0.0014 to $0.001. https://a16z.com/navigating-the-high-cost-of-ai-compute/
Yeah, Google's and Apple's Photos both can search for pictures given a description of what you're looking for. In my experience both work very well (e.g. search for "cars" in your pics, and it'll find all your cars over the years if you, like me, take pictures with your cars a lot :) ).
I clicked through to your sites 'cause I dig your angle and I saw the bit about the kindle. Ouch, dude. Money sure ain't everything but holy crap.
You have my condolences. Keep building awesome shit, please.
edit: followup question - do you still have it?