TrustFed: Enabling Trustworthy Medical AI under Data Privacy Constraints

cs.LG cs.CY Vagish Kumar, Syed Bahauddin Alam, Souvik Chakraborty · Mar 23, 2026
Local to this browser
What it does
Federated learning enables privacy-preserving medical AI but struggles with unreliable uncertainty estimates when clinical data is heterogeneous and imbalanced across sites. TrustFed addresses this by introducing representation-aware...
Why it matters
TrustFed addresses this by introducing representation-aware conformal prediction, which assigns test samples to calibration clients based on feature-space similarity and aggregates local thresholds via a soft-nearest strategy to provide...
Main concern
TrustFed presents a compelling empirical advance as the first federated conformal prediction framework to demonstrate robust coverage across severe heterogeneity and class imbalance in large-scale medical imaging. However, the theoretical...
Community signal
0
0 up · 0 down
Sign in to vote with arrows
AI Review AI reviewed
Plain-language introduction

Federated learning enables privacy-preserving medical AI but struggles with unreliable uncertainty estimates when clinical data is heterogeneous and imbalanced across sites. TrustFed addresses this by introducing representation-aware conformal prediction, which assigns test samples to calibration clients based on feature-space similarity and aggregates local thresholds via a soft-nearest strategy to provide finite-sample coverage guarantees without centralizing raw data. Validated on over 430,000 images across six distinct imaging modalities, the work advances federated learning from privacy-preserving training toward clinically trustworthy deployment with statistically calibrated uncertainty.

Critical review
Verdict
Bottom line

TrustFed presents a compelling empirical advance as the first federated conformal prediction framework to demonstrate robust coverage across severe heterogeneity and class imbalance in large-scale medical imaging. However, the theoretical validity rests on heuristic mitigation of selection bias via the max-operator, and the framework requires sharing test sample embeddings with all clients—an unquantified privacy leakage that contradicts the strict privacy claims. While the empirical results are strong, the lack of formal coverage proofs under adaptive calibration neighborhood selection and the empirical (rather than adaptive) choice of neighbor count $\mathcal{N}_k$ limit the rigor of the guarantees claimed.

“While such adaptive selection could, in principle, induce selection bias, TrustFed mitigates this risk by taking the maximum threshold across the selected neighborhood”
paper · Soft-nearest threshold aggregation
“In practice, we select $\mathcal{N}_k$ empirically by choosing the smallest value that achieves the target coverage level $1-\alpha$ with a small safety margin”
paper · Soft-nearest threshold aggregation
What holds up

The extensive empirical evaluation across six clinically distinct modalities (blood cells, retina, CT, dermatology, histopathology) totaling 430,000 images strongly supports the method's robustness. The representation-aware client assignment consistently outperforms raw-pixel baselines, and the soft-nearest aggregation successfully stabilizes coverage without inflating prediction set sizes, demonstrating that localized exchangeability assumptions can substitute for global ones in heterogeneous federated settings. The choice of CortiNet as the backbone architecture is well-justified for medical imaging, and the privacy-preserving aggregation of scalar thresholds (rather than scores) is a practical design.

“Across all datasets, feature-space similarity significantly outperforms raw-pixel assignment in top-$k$ client assignment accuracy and computational efficiency”
paper · Representation-aware client assignment
“TrustFed consistently maintains empirical coverage near nominal levels for both majority and minority classes, demonstrating robustness to extreme distributional skew”
paper · Performance under extreme class imbalance
Main concerns

The primary methodological concern is that the adaptive selection of calibration neighborhoods based on test embeddings $f_i = \phi(x_{test})$ violates the standard exchangeability assumption of conformal prediction. While the authors propose $\tau(x_{test}) = \max_{j \in \mathcal{N}_k} \tau_j$ as a conservative correction, they provide no formal proof that this preserves marginal coverage under the induced selection bias. Additionally, the requirement to broadcast test embeddings to all clients for distance computation $d_{i,k} = \min_{1\leq j\leq m_k} \|f_i - b_{k,j}\|_2$ introduces a privacy channel not analyzed in the threat model, and the framework is limited to classification tasks with a fixed, empirically-tuned neighborhood size $\mathcal{N}_k$ rather than adaptive selection.

“This smoothing mechanism introduces a mild form of data-dependent conditioning, since the effective calibration neighborhood depends on the test representation, departing from the classical split conformal setting where the calibration set is fixed a priori”
paper · Soft-nearest threshold aggregation
“Each client computes $d_{i,k}$ locally and returns only this scalar distance to the central server”
paper · Representation-aware client assignment
Evidence and comparison

The evidence supports the claim that TrustFed outperforms global pooling baselines (FCP) and purely local calibration under heterogeneity, with systematic improvements in coverage stability across imbalanced class distributions. However, the comparison omits recent federated conformal methods that explicitly handle label shift or leverage different aggregation strategies, leaving unclear whether the representation-aware approach dominates state-of-the-art alternatives. The paper also lacks ablations quantifying the individual contribution of the CortiNet architecture versus the conformal wrapper, and the theoretical comparison to existing federated conformal bounds (e.g., Lu et al., Plassier et al.) is absent.

Reproducibility

The study utilizes publicly available MedMNIST datasets and promises GitHub code release upon acceptance, which facilitates reproducibility. However, critical implementation details necessary for independent reproduction are missing from the main text, including the specific neighbor counts $\mathcal{N}_k$ used for each dataset, the dimensionality of the feature embeddings $\phi(x)$, communication rounds $R$, and local epochs $E$. The distance computation relies on Euclidean norm in representation space, but the exact layer from which embeddings are extracted is not specified, and the privacy analysis omits quantification of information leakage through embedding sharing.

“On acceptance, all the source codes to reproduce the results in this study will be made available to the public on GitHub by the corresponding author”
paper · Code availability
Abstract

Protecting patient privacy remains a fundamental barrier to scaling machine learning across healthcare institutions, where centralizing sensitive data is often infeasible due to ethical, legal, and regulatory constraints. Federated learning offers a promising alternative by enabling privacy-preserving, multi-institutional training without sharing raw patient data; however, real-world deployments face severe challenges from data heterogeneity, site-specific biases, and class imbalance, which degrade predictive reliability and render existing uncertainty quantification methods ineffective. Here, we present TrustFed, a federated uncertainty quantification framework that provides distribution-free, finite-sample coverage guarantees under heterogeneous and imbalanced healthcare data, without requiring centralized access. TrustFed introduces a representation-aware client assignment mechanism that leverages internal model representations to enable effective calibration across institutions, along with a soft-nearest threshold aggregation strategy that mitigates assignment uncertainty while producing compact and reliable prediction sets. Using over 430,000 medical images across six clinically distinct imaging modalities, we conduct one of the most comprehensive evaluations of uncertainty-aware federated learning in medical imaging, demonstrating robust coverage guarantees across datasets with diverse class cardinalities and imbalance regimes. By validating TrustFed at this scale and breadth, our study advances uncertainty-aware federated learning from proof-of-concept toward clinically meaningful, modality-agnostic deployment, positioning statistically guaranteed uncertainty as a core requirement for next-generation healthcare AI systems.

Challenge the Review

Pick a starting point or write your own. Challenges run in the background, so you can keep reading while the AI investigates.

No challenges yet. Disagree with the review? Ask the AI to revisit a specific claim.