具有显式3D建模的世界一致性视频扩散
As diffusion models dominating visual content generation, efforts have been made to adapt these models for multi-view image generation to create 3D content. Traditionally, these methods implicitly...
本文提出了一种新方法,通过生成归一化坐标空间(NCS)帧与RGB帧,改进多视图图像生成,增强3D一致性。该方法在训练中联合估计RGB和NCS帧,利用去噪修补策略推断条件分布,提升相机姿态估计能力,建立统一的3D模型基准。
