Float a small clip over a full-size video — Free Online, No Upload
Picture-in-picture (PiP) puts a small inset video in the corner of a larger one — the look of webcam-over-gameplay streams, reaction videos and tutorial talking heads. This page explains how to build a PiP composite in the browser: pick the main clip, drop a second clip in a corner, size it, blend the audio, and export the result without uploading either file.
Create a clean picture-in-picture composite — inset size, corner and audio all under your control, exported locally with no watermark.
Open the Video Overlay tool →In a PiP composite the main video fills the frame and a smaller inset video plays in a corner on top of it. The tool draws the main video each frame, then draws the inset at the size and position you set, and records the combined picture. The main clip controls the final resolution and length; the inset simply rides along inside it.
The inset has to be small enough to leave the main action visible but large enough to matter. As a rule of thumb, 20–35% of the width works for most layouts; around 25% is the familiar webcam-bubble size. Because size is a percentage, the inset keeps the same relative scale whether you export at 720p or 4K. Add a margin so it doesn't sit flush against the edge.
Bottom-left and bottom-right are the conventional spots — they stay clear of titles at the top and most on-screen action. Use the nine anchor positions to snap the inset exactly where you want, or switch to custom X/Y if the scene needs the inset somewhere specific. Lowering the opacity slightly can help the inset blend when the backgrounds clash.
PiP almost always involves two audio sources — the main clip and the inset. Keep the base audio for the primary footage, and enable Mix overlay audio when the inset's sound matters, such as your commentary over muted gameplay or a reaction over a trailer. If you only need the inset's picture, leave its audio off. Fast 2× / 4× exports drop audio entirely to render quicker.
An inset doesn't have to be on the whole time. Set a timed window so the talking head appears only during the parts where you speak, then disappears so the main clip plays full-frame. Turn on the fade so it eases in and out. For a continuous webcam bubble, leave the timing on Whole video and enable Loop if the inset clip is shorter than the main video. Everything renders locally — no upload, no watermark.
The tool draws the inset as a rectangle. For rounded corners, prepare the inset as a WebM with transparency and a rounded mask, then add it as the overlay.
The one you want full-screen — usually the gameplay, screen recording or main scene. The inset is the smaller webcam or reaction clip.
Yes. The base video keeps its own aspect ratio, so a vertical main clip stays vertical and the inset floats inside it.
No. Both clips are composited in your browser and never sent to a server.