Clips AI
The resizing feature in ClipsAI dynamically adjusts a video to focus on the current speaker. It utilizes speaker diarization with Pyannote, scene change detection with PySceneDetect, and face detectio
The resizing feature in ClipsAI dynamically adjusts a video to focus on the current speaker. It utilizes speaker diarization with Pyannote, scene change detection with PySceneDetect, and face detectio
Speaker diarization
identifies and tracks who is speaking throughout the video using Pyannote technology
Scene change detection
recognises when scenes shift to avoid awkward crops during transitions
Face detection
locates faces in frames using MTCNN and MediaPipe to keep subjects centred
Aspect ratio customisation
resizes videos to fit different platform requirements, from vertical mobile formats to horizontal widescreen
Batch processing capability
processes video files through an API, supporting both video-only and audio-video files
Converting long-form podcast or interview footage into short vertical clips for TikTok, Instagram Reels, or YouTube Shorts
Automatically adapting webinar or conference recordings for multiple social platforms
Creating focus-adjusted versions of multi-speaker videos for accessibility or emphasis
Batch processing recorded content to maintain consistent framing across a series of videos
Repurposing widescreen content for mobile-first distribution without manual editing