FourieRF: Few-Shot NeRFs via Progressive Fourier Frequency Control

LIX, École Polytechnique
3DV 2025

Abstract

We present a novel approach for few-shot NeRF estimation, aimed at avoiding local artifacts and capable of efficiently reconstructing real scenes. In contrast to previous methods that rely on pre-trained modules or various data-driven priors that only work well in specific scenarios, our method is fully generic and is based on controlling the frequency of the learned signal in the Fourier domain. We observe that in NeRF learning methods, high-frequency artifacts often show up early in the optimization process, and the network struggles to correct them due to the lack of dense supervision in few-shot cases. To counter this, we introduce an explicit curriculum training procedure, which progressively adds higher frequencies throughout optimization, thus favoring global, low-frequency signals initially, and only adding details later. We represent the radiance fields using a grid-based model and introduce an efficient approach to control the frequency band of the learned signal in the Fourier domain. Therefore our method achieves faster reconstruction and better rendering quality than purely MLP-based methods. We show that our approach is general and is capable of producing high-quality results on real scenes, at a fraction of the cost of competing methods. Our method opens the door to efficient and accurate scene acquisition in the few-shot NeRF setting.

MY ALT TEXT

Method Architecture. Our method initializes 1D and 2D features in spatial space, projects them into the Fourier domain, and clips them with specialized filters. Projecting the clipped features back to spatial space allows us to retrieve smooth shapes.

Progressively Integrating Complexity



RGB (Left) and Depth prediction (Right) throughout training.

Overall, our method is built on two key observations. First, we note that both strong overfitting and high-frequency artifacts typically occur early in the optimization process, and, if avoided in these early stages, they are significantly less prominent in the final result. Second, we note that by gradually increasing the maximal Fourier frequency of the learned signal both significantly regularizes the learned NeRF, while at the same time, providing the network enough degrees of freedom to learn the fine details (in the final stages of the optimization).

RGB/Depth comparison

Compare our method with TensoRF or ZeroRF in the following settings.

Comparison between our method (Left) with TensoRF or ZeroRF (Right)

Our method is the best accelerated apporach to process real scenes in the few-shot rendering problem. We can see that both the baselines TensoRF and ZeroRF fail to capture correct geometry and appearance. These methods fill the scene with incoherent geometry and floaters.

BibTeX

BibTex code will be here