I'm very interested in working with video inputs, is it possible to do that with...

oezi · 2025-05-16T18:49:15 1747421355

I have only tested Qwen2.5-Omni for audio and it was hit and miss for my use case of tagging audio.

tough · 2025-05-17T00:24:12 1747441452

machinelearning · 2025-05-16T20:35:24 1747427724

What's a use case are you interested in re: video?

prettyblocks · 2025-05-17T03:57:39 1747454259

I'm curious how effective these models would be at recognizing if the input video was ai generated or heavily manipulated. Also various things around face/object segmentation.