Biophysics-Enhanced Neural Representations for Patient-Specific Respiratory Motion Modeling

cs.CV Jan Boysen, Hristina Uzunova, Heinz Handels, Jan Ehrhardt · Mar 23, 2026

What it does

Why it matters

This paper proposes PRISM-RM, a trajectory-aware implicit neural representation (INR) that models lung motion as a continuous diffeomorphic flow driven by external surrogate signals. By integrating neohookean hyperelastic constraints with...

Main concern

Community signal

0 up · 0 down

AI Review AI reviewed

Plain-language introduction

Accurate respiratory motion modeling is critical for radiotherapy precision, yet patient-specific breathing patterns are difficult to predict outside observed ranges. This paper proposes PRISM-RM, a trajectory-aware implicit neural representation (INR) that models lung motion as a continuous diffeomorphic flow driven by external surrogate signals. By integrating neohookean hyperelastic constraints with temporal total-variation regularization, the method eliminates the need for fixed reference breathing states and aims to improve extrapolation to unseen respiratory phases.

Critical review

Verdict

Bottom line

PRISM-RM represents a methodologically sound extension of the authors' prior INR-based motion model, but its clinical utility remains uncertain. While the trajectory-aware formulation significantly improves extrapolation over their previous approach (TRE reduced from 3.66mm to 2.83mm in EI→EE scenarios, p<0.05), it still underperforms the conventional sequential two-stage baseline (1.86mm TRE) that it seeks to replace. The biophysical constraints produce smoother, more plausible displacement fields, but the fundamental limitation of INRs in generalizing beyond training data persists, suggesting that further architectural innovations or hybrid approaches are necessary before INRs can supersede established registration methods in radiotherapy applications.

“Mean TRE: Seq 1.86mm, Integ 3.66mm, PRISM-RM 2.83mm”

Boysen et al., Table 1 · Table 1, rows EI→EE

“PRISM-RM performs comparable to the integrated INR-based respiratory motion model, but outperforms it in extrapolation, although it still performs worse than the sequential two-stage approach”

Boysen et al., Sec. 4 · Section 4

What holds up

The theoretical framework is well-conceived: the diffeomorphic flow formulation using time-dependent velocity fields $v^{\bm{\theta}}$ integrated via Euler schemes provides a continuous representation of respiratory cycles without fixed reference states. The ablation study convincingly demonstrates that temporal regularization $\mathcal{R}_{t}$ is crucial for extrapolation performance, while spatial regularization $\mathcal{R}_{ph}$ alone provides limited benefits outside the interpolation regime. The neohookean hyperelastic regularizer $\mathcal{R}_{ph}(\varphi^{\bm{\theta}})=\mathrm{trace}(\mathbf{C})-3-\mathrm{log}(|\mathbf{J}|^{2})+\lambda(|\mathbf{J}|-1)^{2}$ is physically motivated for lung tissue and produces visibly smoother displacement fields compared to the sequential baseline.

“$\mathcal{R}_{ph}(\varphi^{\bm{\theta}})=\mathrm{trace}(\mathbf{C})-3-\mathrm{log}(|\mathbf{J}|^{2})+\lambda(|\mathbf{J}|-1)^{2}$”

Boysen et al., Sec. 2.3 · Section 2.3, Eq. 9

“for the extrapolation experiment (EI $\rightarrow$ EE) ... the integration of regularization both in space and time improves the performance significantly”

Boysen et al., Sec. 4.1 · Section 4.1

Main concerns

The most significant concern is the performance gap in the critical extrapolation scenario: PRISM-RM achieves 2.83mm mean TRE versus 1.86mm for the sequential approach, representing a 52% larger error that could translate to clinically meaningful dose delivery inaccuracies. The dataset is limited to only 11 patients with 10-14 breathing states each, raising questions about statistical power and generalization across diverse breathing patterns. The improvement over the prior INR method (3.66mm to 2.83mm), while statistically significant, is modest in absolute terms and comes at substantial computational cost (40s per epoch versus 11-29s for ablated variants). Regarding the biophysical constraints, the neohookean model assumes homogeneous tissue properties, whereas lung regions contain heterogeneous structures (tumors, vessels, airways) with varying elastic moduli that a single $\lambda$ parameter cannot capture.

“Mean TRE: Seq 1.86mm, Integ 3.66mm, PRISM-RM 2.83mm”

Boysen et al., Table 1 · Table 1, EI→EE Mean row

“PRISM-RM 40.1s, S 27.8s, T 21.2s (EE experiment)”

Boysen et al., Table 3 · Table 3

“eleven lung cancer patients ... either 10 or 14 images are available”

Boysen et al., Sec. 3.1 · Section 3.1

Evidence and comparison

The comparison to the sequential two-stage approach of Wilms et al. (2014) is appropriate and serves as a rigorous baseline, though the authors' claim that PRISM-RM 'improves the extrapolation ability' requires qualification—it improves upon their prior INR method but not upon the clinical standard. The evidence supports that trajectory-aware sampling with $n(n-1)$ image pairs versus $n-1$ provides more robust training, but the advantage is insufficient to overcome the inherent generalization limitations of coordinate-based neural networks when predicting motion at surrogate signal values outside the training distribution $\mathcal{S}$. The landmark-based evaluation using manually placed anatomical points is appropriate for respiratory motion assessment, though the exclusion of neighboring breathing states in the extrapolation experiment represents a relatively severe test that may not fully represent incremental extrapolation scenarios seen in clinical practice.

“the neighboring breathing states are omitted during training in addition to the EE breathing state”

Boysen et al., Sec. 3.2 · Section 3.2

“we can use $n(n-1)$ registration pairs to learn the network parameters instead of only $n-1$”

Boysen et al., Sec. 2.2 · Section 2.2

Reproducibility

Reproducibility is partially supported but incomplete. The architectural details are thoroughly specified: a sinusoidal activated INR with three hidden layers of 256 neurons, learning rate $1\times10^{-5}$, and hyperparameters $\alpha_{ph}=0.001$, $\alpha_{t}=0.1$, $\lambda=1$. The training procedure describes sampling 10,000 points per epoch with a 40/60 split between observed and interpolated surrogate signals. However, critical implementation details for the Euler integration are underspecified—the exact number of integration steps between arbitrary time points $t_j$ and $t_k$ is not stated, and the network initialization scheme for the SIREN architecture is not confirmed. The code is not publicly available, though the dataset (Wilms et al., 2014) can be obtained from the corresponding author upon request.

“three hidden layers with 256 neurons each ... learning rate is set to $1\times 10^{-5}$ ... $\alpha_{ph}=0.001$ and $\alpha_{t}=0.1$”

Boysen et al., Sec. 3.3 · Section 3.3

“For details on the data availability for third-parties the interested reader is kindly referred to the corresponding author”

Boysen et al., Data statement · Data Availability

Abstract

A precise spatial delivery of the radiation dose is crucial for the treatment success in radiotherapy. In the lung and upper abdominal region, respiratory motion introduces significant treatment uncertainties, requiring special motion management techniques. To address this, respiratory motion models are commonly used to infer the patient-specific respiratory motion and target the dose more efficiently. In this work, we investigate the possibility of using implicit neural representations (INR) for surrogate-based motion modeling. Therefore, we propose physics-regularized implicit surrogate-based modeling for respiratory motion (PRISM-RM). Our new integrated respiratory motion model is free of a fixed reference breathing state. Unlike conventional pairwise registration techniques, our approach provides a trajectory-aware spatio-temporally continuous and diffeomorphic motion representation, improving generalization to extrapolation scenarios. We introduce biophysical constraints, ensuring physiologically plausible motion estimation across time beyond the training data. Our results show that our trajectory-aware approach performs on par in interpolation and improves the extrapolation ability compared to our initially proposed INR-based approach. Compared to sequential registration-based approaches both our approaches perform equally well in interpolation, but underperform in extrapolation scenarios. However, the methodical features of INRs make them particularly effective for respiratory motion modeling, and with their performance steadily improving, they demonstrate strong potential for advancing this field.

Challenge the Review

Pick a starting point or write your own. Challenges run in the background, so you can keep reading while the AI investigates.

Challenges are public to read, but only signed-in members can post them. Your challenge text is stored with your account for moderation, but usernames are not shown in the public thread.

No challenges yet. Disagree with the review? Ask the AI to revisit a specific claim.