1Building a robust ball tracking system for sports with SAM 2 (opens in new tab)(sievedata.com)1mvoodarla1y ago0Save
3Guide to pure-audio and audiovisual speaker recognition techniques (opens in new tab)(sievedata.com)1mvoodarla1y ago0Save
4SieveSync: Realistic, zero-shot lipsync pipeline using MuseTalk and LivePortrait (opens in new tab)(github.com)GitHub1mvoodarla1y ago0Save
5SieveSync: High-quality, zero-shot lipsync built with MuseTalk and LivePortrait (opens in new tab)(sievedata.com)4mvoodarla1y ago0Save
8Finding highlights in long-form video automatically with custom search terms (opens in new tab)(sievedata.com)1mvoodarla2y ago0Save
9Describe Beta: The most descriptive audiovisual summaries for videos (opens in new tab)(github.com)GitHub2mvoodarla2y ago1Save
10AI-generated sound effects for stock videos using CogVLM and AudioLDM (opens in new tab)(sievedata.com)1mvoodarla2y ago0Save
11AI active speaker detection on video with a 90% speedup (opens in new tab)(sievedata.com)3mvoodarla2y ago0Save
12Masked Audio Generation Using a Single Non-Autoregressive Transformer (opens in new tab)(pages.cs.huji.ac.il)1mvoodarla2y ago0Save
13Masked Audio Generation Using a Single Non-Autoregressive Transformer (opens in new tab)(arxiv.org)arXiv1mvoodarla2y ago0Save
14The most cost-effective audio transcription API (opens in new tab)(sievedata.com)3mvoodarla2y ago0Save
15Audiobox Demo: Where anyone can make a sound with an idea (opens in new tab)(audiobox.metademolab.com)3mvoodarla2y ago0Save