Coding Shuttle CSS Video Float

CoFaCo: Controllable Generative Talking Face Video Coding

Abstract: Efficient talking face video coding and control are crucial in modern video communication, reshaping how individuals connect, collaborate, and interact. Coding seeks to reduce transmission ...

GitHub

Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors

Previous research has investigated the application of Multimodal Large Language Models (MLLMs) in understanding 3D scenes by interpreting them as videos. These approaches generally depend on ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

CoFaCo: Controllable Generative Talking Face Video Coding

Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors

Trending now