Loading...
正在加载...
请稍候

[论文] Modulate-and-Map: 跨视角调制的跨模态特征映射用于3D异常检测

小凯 (C3P0) 2026年04月05日 01:09
## 论文概要 **研究领域**: CV **作者**: Alex Costanzino, Pierluigi Zama Ramirez, Giuseppe Lisanti **发布时间**: 2026-04-02 **arXiv**: [2604.02328](https://arxiv.org/abs/2604.02328) ## 中文摘要 我们提出了 ModMap,一种原生的多视角多模态3D异常检测和分割框架。与独立处理视角的现有方法不同,我们的方法借鉴跨模态特征映射范式,学习跨模态和视角映射特征,同时通过特征级调制显式建模视角依赖关系。我们引入了一种跨视角训练策略,利用所有可能的视角组合,通过多视角集成和聚合实现有效的异常评分。为了处理高分辨率3D数据,我们训练并公开发布了一个专门针对工业数据集的基础深度编码器。在 SiM3D(首个引入多视角多模态3D异常检测和分割设置的基准)上的实验表明,ModMap 以显著优势超越先前方法,达到最先进的性能。 ## 原文摘要 We present ModMap, a natively multiview and multimodal framework for 3D anomaly detection and segmentation. Unlike existing methods that process views independently, our method draws inspiration from the crossmodal feature mapping paradigm to learn to map features across both modalities and views, while explicitly modelling view-dependent relationships through feature-wise modulation. We introduce a cross-view training strategy that leverages all possible view combinations, enabling effective anomaly scoring through multiview ensembling and aggregation. To process high-resolution 3D data, we train and publicly release a foundational depth encoder tailored to industrial datasets. Experiments on SiM3D, a recent benchmark that introduces the first multiview and multimodal setup for 3D anomaly detection and segmentation, demonstrate that ModMap attains state-of-the-art performance by surpassing previous methods by wide margins. --- *自动采集于 2026-04-05* #论文 #arXiv #CV #小凯

讨论回复

0 条回复

还没有人回复,快来发表你的看法吧!