Optimal Framewise Displacement Thresholds for fMRI Censoring: A Researcher's Guide to Balancing Data Quality and Bias

Charles Brooks Dec 02, 2025 50

This article provides a comprehensive guide for researchers and drug development professionals on selecting optimal framewise displacement (FD) thresholds for censoring motion-corrupted volumes in functional MRI.

Optimal Framewise Displacement Thresholds for fMRI Censoring: A Researcher's Guide to Balancing Data Quality and Bias

Abstract

This article provides a comprehensive guide for researchers and drug development professionals on selecting optimal framewise displacement (FD) thresholds for censoring motion-corrupted volumes in functional MRI. It explores the foundational principles of motion artifacts, evaluates methodological approaches for threshold application, addresses key challenges in data quality versus bias trade-offs, and compares validation strategies across diverse datasets and populations. Synthesizing recent evidence from large-scale studies like ABCD and HCP, this review offers practical, evidence-based recommendations to enhance the reliability and reproducibility of functional connectivity and brain-wide association studies.

Understanding Motion Artifacts and the Critical Role of Framewise Displacement

The Impact of Head Motion on fMRI Signal Quality and Functional Connectivity

Troubleshooting Guides

Why does my fMRI data show systematic changes in functional connectivity, and how can I fix it?

Problem: You have observed systematic changes in functional connectivity measures, such as decreased long-distance connectivity and increased short-range connectivity, particularly in networks like the default mode network.

Explanation: Head motion is a significant confounding factor in fcMRI studies. Even small, sub-millimeter movements introduce systematic bias that is not completely removed by standard denoising algorithms [1] [2]. This occurs because motion artifacts create synchronized signal changes across multiple brain regions, producing spurious correlations that can be mistaken for neuronal effects [1] [3].

Solutions:

Implement frame censoring: Identify and remove motion-contaminated volumes using framewise displacement (FD) or DVARS metrics. A threshold of FD < 0.2-0.3 mm is effective for reducing motion-related artifacts while retaining sufficient data [4] [2].
Apply ICA-based denoising: Use independent component analysis to automatically identify and remove motion-related artifacts from your data [4] [3].
Include motion parameters: Regress out the 6 or 24 canonical head motion parameters as nuisance variables in your general linear model [3].
Consider prospective motion correction: If available, use MR-compatible optical tracking systems to update scan parameters in real-time during acquisition [5].

How can I determine if my brain-behavior correlations are contaminated by motion artifacts?

Problem: You are studying traits potentially associated with motion (e.g., psychiatric disorders, age-related changes) and need to verify that your trait-FC relationships are not driven by residual motion artifacts.

Explanation: Certain participant populations (children, older adults, patients with neurological or psychiatric disorders) tend to move more during scanning. Since motion systematically affects FC measures, this can create spurious brain-behavior correlations that are often misinterpreted as neuronal effects [1] [2].

Solutions:

Use the SHAMAN method: Implement Split Half Analysis of Motion Associated Networks to compute a trait-specific motion impact score that distinguishes between motion causing overestimation or underestimation of trait-FC effects [2].
Compare motion levels between groups: Ensure groups with different trait values do not systematically differ in their head motion parameters [1].
Apply rigorous censoring: For motion-correlated traits, use stricter censoring thresholds (FD < 0.2 mm) to reduce false positives [2].
Validate with multiple methods: Compare results across different motion mitigation strategies to ensure consistency [3].

What should I do when participants move excessively during scanning?

Problem: You are working with a high-motion population (e.g., young children, clinical populations) and struggle to acquire sufficient usable data.

Explanation: High-motion participants present a particular challenge because excessive frame censoring can bias sample distributions by systematically excluding individuals who may exhibit important variance in the trait of interest [4] [2].

Solutions:

Use real-time monitoring: Implement software like Framewise Integrated Real-time MRI Monitoring (FIRMM) to track data quality during acquisition and continue scanning until sufficient low-motion data is collected [4].
Acquire multiple runs: Collect several shorter runs rather than one long run to increase the likelihood of obtaining adequate low-motion data across sessions [4].
Optimize preprocessing pipeline: Combine volume censoring with ICA-based denoising instead of traditional filtering and nuisance regression [4].
Employ prospective motion correction: If available, use PMC systems to improve data quality despite large head movements [5].

Frequently Asked Questions (FAQs)

What is the optimal framewise displacement threshold for censoring fMRI data?

The optimal FD threshold depends on your research goals and participant population. For most studies, thresholds between 0.2-0.3 mm provide a good balance between removing motion artifacts and retaining data [4] [2]. Stricter thresholds (FD < 0.2 mm) significantly reduce motion-related false positives but may increase data loss [2]. The table below summarizes recommendations based on recent studies:

Table: Framewise Displacement Threshold Recommendations

FD Threshold	Use Case	Benefits	Limitations
0.2 mm	Studies of motion-correlated traits	Reduces significant motion overestimation to ~2% of traits [2]	Higher data exclusion rates
0.3 mm	General research with typical populations	Retains ~83% of participants in high-motion cohorts [4]	Some residual motion effects may persist
0.4-0.5 mm	Exploratory studies or when data retention is priority	Maximizes data retention	Increased risk of motion-contaminated results

How does head motion specifically affect functional connectivity measures?

Head motion systematically alters functional connectivity in specific, reproducible patterns:

Decreased long-distance connectivity: Especially between distributed regions of association cortex like the default and frontoparietal control networks [1]
Increased local functional coupling: Particularly in sensory and motor regions [1]
Spurious correlation patterns: That could be mistaken for neuronal effects in other contexts [1]
Distance-dependent effects: Longer-distance connections show more susceptibility to motion artifacts [2]

Table: Effects of Head Motion on Specific Brain Networks

Brain Network	Effect of Motion	Potential Misinterpretation
Default Mode Network	Decreased functional coupling [1]	Apparent network disruption in clinical populations
Frontoparietal Control Network	Decreased functional coupling [1]	False group differences in cognitive studies
Motor Network	Increased coupling between left and right regions [1]	Overestimation of bilateral integration
Visual Network	Altered spatial definition [5]	Impaired network identification

What preprocessing strategies most effectively mitigate motion artifacts?

No single approach consistently outperforms others across all datasets and tasks [3]. The most effective strategy often involves combining multiple techniques:

Table: Motion Mitigation Techniques Comparison

Technique	Mechanism	Best For	Considerations
Frame Censoring	Removes high-motion volumes from analysis	Reducing spurious correlations [3]	Causes data loss; requires threshold selection
Motion Parameter Regression	Regresses out realignment parameters as nuisance variables	Standard task-based fMRI [3]	May not remove all motion-related variance
ICA-Based Denoising	Identifies and removes motion-related components	High-motion pediatric data [4]	Requires careful component classification
Prospective Motion Correction	Updates scan parameters in real-time during acquisition	Improving data quality during large movements [5]	Requires specialized equipment
Wavelet Despiking	Identifies artifacts across temporal scales	Complementary approach to other methods [3]	Less commonly used for task-based fMRI

How much does motion typically affect fMRI signal quality?

Motion substantially degrades fMRI signal quality. Studies show that:

Without comprehensive denoising, head motion can explain up to 73% of signal variance in fMRI data [2]
Even after denoising with advanced pipelines (e.g., ABCD-BIDS), motion still explains approximately 23% of signal variance [2]
Prospective motion correction can increase temporal signal-to-noise ratio (tSNR) by 20-45% in data affected by large head movements [5]

Experimental Protocols & Methodologies

Protocol: Frame Censoring for High-Motion Pediatric Data

This protocol from [4] successfully obtained usable resting-state data from first-grade children (age 6-8) despite extreme motion:

Real-time Monitoring: Use FIRMM software during scanning to monitor head motion and continue acquisition until at least 4 minutes of total data comprised of frames with <0.4 mm FD is obtained.
Volume Censoring: Identify and remove outlier volumes exceeding FD threshold of 0.3 mm.
Data Concatenation: Combine data from multiple runs after censoring.
ICA-Based Denoising: Apply independent component analysis with automatic component classification to remove residual motion artifacts.
Quality Assessment: Verify effectiveness by analyzing correlation between participant head motion and temporal signal-to-noise ratio/functional connectivity.

Protocol: Multi-Dataset Evaluation of Motion Correction Approaches

This comprehensive evaluation from [3] compared motion correction efficacy across 8 public datasets:

Data Selection: Include diverse tasks, acquisition parameters, and participant populations.
Pipeline Implementation: Apply identical preprocessing pipelines across all datasets.
Method Comparison: Evaluate 7 different motion correction approaches:
- 6 canonical motion regressors (RP6)
- 24-term motion regressors (RP24)
- Wavelet despiking (WDS)
- Robust weighted least squares (rWLS)
- Untrained ICA (uICA)
- Frame censoring based on FD
- Frame censoring based on DVARS
Performance Quantification: Assess using maximum group t-statistics, ROI-based activation estimates, and test-retest reliability.

Signaling Pathways & Experimental Workflows

fMRI Motion Correction Workflow: This diagram illustrates the decision process for addressing head motion in fMRI studies, incorporating both prospective (real-time) and retrospective (post-processing) correction methods with key threshold selection points.

The Scientist's Toolkit: Research Reagent Solutions

Table: Essential Resources for fMRI Motion Research

Resource	Type	Function	Example Implementation
Framewise Displacement (FD)	Metric	Quantifies volume-to-volume head movement [4] [3]	Threshold of 0.2-0.3 mm for censoring [2]
DVARS	Metric	Measures rate of change of BOLD signal across brain [3]	Complementary to FD for identifying corrupted volumes
FIRMM	Software	Real-time monitoring of head motion during scanning [4]	Acquire sufficient low-motion frames in pediatric cohorts
ICA Components	Algorithm	Separates data into independent spatial components [4]	Automatic identification of motion-related artifacts
Prospective Motion Correction	Hardware/Software	Updates scan parameters in response to head motion [5]	Improves tSNR and resting state network definition
SHAMAN	Analytical Method	Quantifies trait-specific motion impact [2]	Detects spurious brain-behavior relationships

What is Framewise Displacement (FD)?

Framewise Displacement (FD) is a quantitative metric that summarizes head motion between consecutive volumes in a functional magnetic resonance imaging (fMRI) time series. It provides a single, comprehensive value (in millimeters) for each time point, representing the instantaneous head movement from one brain volume to the next [6] [7].

FD is calculated from the six realignment parameters (RPs) generated during motion correction. These RPs consist of three translational (X, Y, Z) and three rotational (pitch, yaw, roll) parameters. The formula for calculating FD at timepoint i is [7]:

FDi = |Δxi| + |Δyi| + |Δzi| + |Δαi| + |Δβi| + |Δγ_i|

Where:

Δx_i = x_{i-1} - x_i (and similarly for y, z)
Δα_i = α_{i-1} - α_i (and similarly for β, γ)

Rotational displacements, originally in degrees or radians, are converted to millimeters by calculating the arc length on a sphere of a specified radius, typically 50 mm [8] [7]. This conversion allows all movement components to be expressed in a consistent spatial unit.

FD Calculation and Censoring Workflow

The following diagram illustrates the standard workflow for processing fMRI data, from calculating realignment parameters to applying FD-based censoring.

Common FD Censoring Thresholds in Research

The choice of FD threshold for censoring high-motion volumes involves a trade-off between data quality and data retention. There is no universal value, and the optimal threshold can depend on the study population (e.g., adults, children, clinical groups) [9].

The table below summarizes FD censoring thresholds commonly found in the literature.

FD Threshold (mm)	Typical Usage Context	Key Rationale or Consideration
0.1 - 0.15	Highly conservative studies [10]	Attempts to remove all motion effects; may lead to excessive data loss, potentially excluding over 50-70% of a sample [10].
0.2	Standard, moderately conservative threshold [8] [10]	A commonly used benchmark to balance rigor and data retention.
0.3 - 0.5	High-motion cohorts (e.g., pediatric, clinical) [9]	Retains more data and participants in challenging populations while still removing the most severe motion [9]. Significant changes in correlations begin to be detected at FD > 0.5 mm [8].

Essential Research Reagents & Tools

To implement FD-based censoring in your fMRI research, the following computational tools and resources are essential.

Tool or Resource	Function	Example Software / Package
fMRI Preprocessing Suite	Performs motion correction to generate the 6 realignment parameters.	FSL (MCFLIRT), AFNI, SPM [6]
FD Calculation Script	Computes the FD time series from the realignment parameters.	fMRIscrub (R) [7], BRAMILA (Matlab) [6], AFNI scripts
Nuisance Regression Tool	Regresses out motion parameters and other confounding signals from the BOLD data.	AFNI (3dDeconvolve), FSL (FEAT), SPM [11] [6]
Volume Censoring Utility	Identifies and removes volumes exceeding the specified FD threshold.	Integrated feature in pipelines like XCP-D [12] or custom scripts via AFNI/FSL.

Troubleshooting FAQ

Q1: After rigorous participant exclusion (e.g., mean FD > 0.2 mm), my behavioral variable is still correlated with mean FD. What should I do?

This is a common challenge. Possible solutions include:

Include mean FD as a nuisance covariate in your group-level statistical model (e.g., in your GLM). This statistically controls for the residual effects of motion [10].
Acknowledge the limitation in your study discussions, as no method can perfectly remove motion artifacts, and covarying may not eliminate all spurious effects [10].
Be cautious about using an overly strict exclusion threshold (e.g., > 0.1 mm) as it can discard a very large portion of your data (e.g., 70%), which is often not practical [10].

Q2: Why is motion still a problem after regressing out the realignment parameters?

Nuisance regression alone is often insufficient because head motion causes complex, dynamic changes in the BOLD signal that are not fully captured by simple linear models of the realignment parameters [13] [11] [8]. These motion-induced signal changes can be shared across the brain and persist for more than 10 seconds after the movement itself has ended, leading to spurious, distance-dependent changes in functional connectivity measures [8]. Volume censoring is designed to remove the data most severely affected by these artifacts.

Q3: I am getting an error in my processing pipeline when using censoring. What could be wrong?

A common technical error is a mismatch in data length between the original time series and the regressed/censored data when generating executive summary plots or during subsequent analysis steps [12]. This often occurs because the censoring step removes specific volumes, shortening the time series. Ensure that all downstream processing steps and visualization scripts are compatible with and correctly reference the censored dataset of reduced length.

How Motion Artifacts Create Spurious Brain-Behavior Relationships

Troubleshooting Guides

Guide 1: Diagnosing Spurious Brain-Behavior Relationships

Problem: A significant brain-behavior correlation is detected, but you suspect it might be driven by head motion rather than true neural signal. Background: Head motion is the largest source of artifact in fMRI signals and systematically biases functional connectivity (FC). Participants with higher motion tend to show decreased long-distance connectivity and increased short-range connectivity. This pattern can create false positive associations if motion is correlated with the behavioral trait of interest (e.g., psychiatric conditions, developmental disorders) [14].

Diagnostic Step	Procedure	Interpretation
Check Trait-Motion Correlation	Calculate correlation between framewise displacement (FD) and your behavioral trait across participants.	A significant correlation (p < 0.05) indicates high risk for motion-confounded results [14].
Apply SHAMAN Analysis	Perform Split Half Analysis of Motion Associated Networks: split each participant's data into high-motion and low-motion halves, then compare trait-FC correlations between halves [14].	A significant difference indicates motion is impacting your trait-FC relationship. Direction reveals overestimation or underestimation.
Test Censoring Sensitivity	Re-run your primary analysis using progressively stricter FD censoring thresholds (e.g., 0.3mm, 0.2mm).	If effect sizes or significance change substantially with stricter censoring, the original result is likely contaminated by motion [14] [9].

Guide 2: Implementing Effective Motion Censoring

Problem: You need to remove motion-corrupted volumes without discarding valuable data or introducing new biases. Background: Censoring (or "scrubbing") excludes individual fMRI volumes where head motion exceeds a specific FD threshold. There is a natural tension between removing contaminated data to reduce false positives and retaining enough data for reliable analysis, especially for participants who move more [14].

Censoring Workflow Decision Guide

Step	Key Considerations	Recommendations
Threshold Selection	Stricter thresholds remove more noise but may exclude participants with high motion.	Start with FD < 0.3mm for general use; tighten to 0.2mm for highly motion-correlated traits [14] [9].
Data Retention Check	Ensure sufficient data remains after censoring for reliable analysis.	Aim for >8-10 minutes of clean data post-censoring; consider concatenating multiple runs [15] [9].
Impact Assessment	Verify censoring reduced motion-artifact without introducing new bias.	Use SHAMAN or QC-FC plots to quantify residual motion effects post-censoring [14].

Frequently Asked Questions

Q1: Our study includes children with ADHD who move more than controls. How can we tell if our group differences in connectivity are real or motion artifacts?

Use the SHAMAN (Split Half Analysis of Motion Associated Networks) method to calculate a motion impact score for your specific trait-FC findings [14]. This method distinguishes between motion causing overestimation or underestimation of effects. For ADHD-control comparisons, you would:

Process your data through standard denoising (e.g., ABCD-BIDS pipeline)
Apply SHAMAN analysis to your key group difference findings
Interpret the motion impact score and p-value: a significant score indicates your results are likely contaminated by motion Supplement this with distance-dependent correlation analysis, as motion artifacts typically reduce long-distance connectivity [14].

Q2: We've collected 10-minute resting-state scans. Is this sufficient for reliable brain-behavior prediction, or will motion artifacts dominate our results?

Evidence suggests 10-minute scans are suboptimal and cost-inefficient. A 2025 Nature study found that prediction accuracy increases with total scan duration (sample size × scan time). For scans ≤20 minutes, accuracy increases linearly with the logarithm of total scan duration [15]. The same study recommends:

Minimum scan time: At least 20 minutes
Optimal scan time: Approximately 30 minutes is most cost-effective, yielding 22% savings over 10-minute scans For your existing 10-minute data, implement aggressive motion censoring (FD < 0.2mm) and consider that your effect sizes may be attenuated by both limited scan duration and residual motion artifacts [15].

Q3: What is the most effective preprocessing pipeline for minimizing motion artifacts in volumetric task-fMRI analysis?

Recent evidence favors one-step interpolation over multi-step preprocessing. A 2025 comparison of three pipelines found:

OGRE pipeline (one-step interpolation) produced significantly lower inter-subject variability than both FSL FEAT preprocessing (p = 7.3 × 10⁻⁹) and fMRIPrep (p = 0.036) [16]
OGRE also yielded the strongest detection of task-related activation in primary motor cortex [16] For resting-state data, fMRIPrep provides a robust, standardized option that adapts to different input datasets and requires minimal user input while providing comprehensive quality reports [17].

Q4: After standard denoising (e.g., with fMRIPrep), how much residual motion artifact should we expect, and what more can we do?

Even after comprehensive denoising, substantial motion artifacts often remain. In the ABCD study, after denoising with ABCD-BIDS (including global signal regression, respiratory filtering, and motion parameter regression), head motion still explained 23% of signal variance—a 69% reduction from minimal processing alone, but still substantial [14].

Mitigation Strategy	Effectiveness	Limitations
Volume Censoring (FD < 0.2mm)	Reduces significant motion overestimation from 42% to 2% of traits [14]	Does not address motion underestimation; may exclude important participants
Multi-Echo fMRI + RETROICOR	Effectively mitigates physiological artifacts; benefits most moderate accelerations [18]	Requires specialized acquisition sequences and physiological monitoring
ICA-Based Denoising	Removes much remaining motion artifact after censoring; effective in high-motion pediatric data [9]	Requires careful component classification; may remove neural signal if misapplied

The Scientist's Toolkit

Research Reagent Solutions

Tool / Resource	Function	Application Notes
SHAMAN	Quantifies trait-specific motion impact scores for brain-behavior relationships [14]	Critical for verifying motion isn't driving significant findings; provides p-values for motion impact
fMRIPrep	Standardized, robust fMRI preprocessing pipeline [17]	Provides comprehensive quality reports; implements one-step interpolation to reduce blurring
OGRE Pipeline	One-step interpolation pipeline optimized for FSL FEAT analysis [16]	Specifically reduces inter-subject variability in task-fMRI; compatible with FreeSurfer versions 7.2.0+
RETROICOR	Corrects physiological artifacts using cardiac/respiratory data [18]	Can be applied to individual echoes or composite data in multi-echo fMRI; most effective with moderate acceleration
Optimal Scan Time Calculator	Online tool for designing cost-effective BWAS studies [15]	Balances sample size and scan duration based on prediction accuracy targets; recommends ≥30 minute scans

Comprehensive Motion Mitigation Workflow

Frequently Asked Questions (FAQs)

Q1: What is distance-dependent bias in the context of brain connectivity? Distance-dependent bias refers to a systematic error where the probability and estimated strength of a connection between two brain regions are artificially influenced by the physical distance separating them. In structural connectivity, shorter connections are more likely to be detected and appear stronger than longer connections. This arises from a combination of biological cost-reduction principles (e.g., the brain favors energetically cheaper short-range connections) and methodological limitations in reconstruction techniques like tractography [19].

Q2: How does subject head motion create a distance-dependent bias in functional connectivity? Head motion during fMRI scans introduces artifacts that corrupt the Blood-Oxygen-Level-Dependent (BOLD) signal. Crucially, the impact of this motion is not uniform across the brain. It can cause spurious, distance-dependent changes in functional connectivity metrics, often inflating short-range correlations and reducing long-range correlations. This creates a systematic bias that can distort the true pattern of brain networks [3] [20] [11].

Q3: What is volume censoring (or "scrubbing") and how can it mitigate motion-related bias? Volume censoring is a motion correction technique that identifies and excludes individual fMRI volumes (time-points) contaminated by excessive head movement from subsequent analysis. This is typically done using metrics like Framewise Displacement (FD) or DVARS. By removing these corrupted data points, censoring reduces the motion-induced artifactual correlations in the BOLD signal, thereby mitigating the associated distance-dependent bias and providing a more accurate estimate of true functional connectivity [3] [4] [20].

Q4: How do I choose an optimal Framewise Displacement (FD) threshold for censoring? There is no universal FD threshold; the optimal value is dataset-specific and involves a trade-off between removing motion artifact and retaining sufficient data. Evidence from various studies can guide this decision, as summarized in the table below.

Table 1: Reported Framewise Displacement (FD) Censoring Thresholds in Different Populations

Population	Recommended FD Threshold	Key Rationale / Outcome
Adults (Task-fMRI)	0.5 mm	Reduced variance in parameter estimates and increased statistical power [20].
First-Grade Children (Resting-state)	0.3 mm	Effectively removed motion artifacts while retaining 83% of participants; minimized correlation between motion and connectivity [4] [9].
Fetuses (Resting-state)	1.5 mm	Improved prediction of neurobiological features (e.g., gestational age) compared to no censoring [11].
General Evaluation	Dataset-Specific	Optimal threshold should be determined prior to final analysis based on the specific data [21].

Q5: What is the difference between uniform and distance-dependent consensus thresholding for structural networks? This is a key methodological distinction in creating group-level structural connectomes from multiple subjects.

Uniform Thresholding: A single consistency threshold (e.g., an edge must be present in 50% of subjects) is applied to all connections, regardless of length. This systematically over-represents short-range and under-represents long-range connections because short connections are more consistently detected across subjects [19].
Distance-Dependent Thresholding: The consistency threshold varies based on connection length. It is designed to non-parametrically preserve the edge length distribution found in individual subjects. This method better recapitulates subject-level network properties and preserves the integrity of critical long-distance connections [19].

Troubleshooting Guides

Issue: High Motion Artifact in Pediatric or Clinical fMRI Data

Problem: You are working with a dataset from a high-motion population (e.g., young children, clinical cohorts) and observe strong correlations between head motion and functional connectivity measures, indicating persistent motion artifact.

Solution: Implement a combined preprocessing pipeline of real-time monitoring, volume censoring, and ICA denoising.

Table 2: Protocol for High-Motion Data Mitigation

Step	Protocol Details	Function
1. Real-Time Monitoring	Use software like Framewise Integrated Real-time MRI Monitoring (FIRMM) during scanning to track the amount of low-motion data acquired [4].	Maximizes the chance of collecting sufficient usable data by allowing for additional runs if needed.
2. Volume Censoring	Calculate FD and DVARS. Identify and censor volumes exceeding your chosen threshold (e.g., FD > 0.3 mm for children). Model these as nuisance regressors in the GLM [3] [4].	Directly removes the influence of high-motion volumes on the final connectivity estimates.
3. Concatenation	If multiple resting-state runs are available, concatenate the cleaned data from each run after censoring [4].	Creates a single, longer time series for more reliable connectivity analysis.
4. ICA-Based Denoising	Use automated classifiers (e.g., FIX) or trained classifiers to identify and remove noise-related components from the concatenated data [4].	Removes residual, non-motion-related noise and can make additional motion parameter regression redundant.

Visual Workflow:

Issue: Biased Group-Level Structural Connectomes

Problem: When creating a group-representative structural network, the resulting graph is dominated by short-range connections and fails to capture the long-range integrative architecture seen in individual subjects.

Solution: Replace uniform consensus thresholding with a distance-dependent consensus thresholding method.

Experimental Protocol (Distance-Dependent Thresholding):

Pool Subject Data: For each subject, calculate the length of every possible edge in the network.
Create Pooled Distribution: Combine the edge lengths from all subjects into a single, pooled distribution. This represents the "typical" subject's edge length profile.
Calculate Distance-Dependent Threshold: The goal is to find a threshold, for each distance bin, that results in a group-level network whose edge length distribution matches the pooled subject-level distribution.
Apply Non-parametric Threshold: Retain connections in the group-level network based on this variable threshold, which is more lenient for long-range connections than for short-range ones. This ensures that long-distance connections, which are less consistently observed but biologically critical, are not disproportionately excluded [19].

Visual Workflow:

The Scientist's Toolkit: Key Research Reagents & Solutions

Table 3: Essential Materials and Tools for Connectivity Analysis and Motion Mitigation

Item Name	Function / Description	Example Use Case
Framewise Displacement (FD)	A scalar metric quantifying the volume-to-volume change in head position. The primary measure for identifying motion-corrupted volumes to censor [3] [20].	Determining which fMRI volumes to censor during preprocessing (e.g., FD > 0.3 mm).
Realignment Parameters (RP)	The six parameters (translations X,Y,Z and rotations pitch, yaw, roll) estimating head position at each volume. Used as nuisance regressors in the GLM [3].	Inclusion as RP6 or expanded RP24 models in first-level models to account for motion-related variance.
Independent Component Analysis (ICA)	A data-driven technique that separates the fMRI signal into maximally independent spatial and temporal components. Noise components can be automatically or manually identified and removed [3] [4].	Denoising data after censoring, particularly effective for removing residual artifact without signal loss.
FIRMM Software	Framewise Integrated Real-time MRI Monitoring software. Provides real-time feedback on subject motion during the scan [4].	Used in pediatric studies to ensure a sufficient amount of low-motion data is acquired before ending the session.
Distance-Dependent Thresholding Algorithm	A non-parametric algorithm for generating group-level structural networks that preserves the connection-length distribution of individual subjects [19].	Creating a group-representative structural connectome that does not systematically underestimate long-range connections.

Residual Motion Artifacts Persist After Standard Denoising Methods

Troubleshooting Guides

Guide 1: Diagnosing and Addressing Persistent Motion Artifacts

Residual motion artifacts often remain in fMRI data even after applying standard denoising procedures like realignment and basic motion parameter regression [22] [23]. This guide will help you identify and correct for these persistent artifacts.

Q1: How can I confirm that residual motion artifacts are present in my data? Persistent motion artifacts often manifest as a distance-dependent relationship between motion and functional connectivity, where correlations between nearby brain regions remain artificially elevated [22]. To diagnose this:

Calculate the correlation between subject head motion (e.g., mean Framewise Displacement) and functional connectivity measures after denoising.
Plot this residual relationship against the physical distance between brain regions.
A significant positive relationship, especially for short-distance connections, indicates that motion artifacts persist [22] [24].

Q2: What are the most effective strategies to remove these residual artifacts? No single method eliminates all artifacts, but several advanced strategies show efficacy:

aCompCor (Component Based Noise Correction): This method extracts noise signals from white matter and cerebrospinal fluid (CSF) regions and regresses them from the data. An optimized aCompCor approach, designed to increase the noise prediction power of these signals, has been shown to be one of the best-performing strategies [22].
ICA-AROMA: This data-driven technique uses Independent Component Analysis to automatically identify and remove motion-related components from the data. It performs well across multiple benchmarks with a relatively low cost in terms of data loss [22] [24].
Volume Censoring (Scrubbing): This approach identifies and removes motion-contaminated volumes from the analysis. It is particularly effective at reducing distance-dependent artifacts [22] [20].

Guide 2: Implementing and Optimizing Volume Censoring

Volume censoring can be a powerful tool, but its implementation requires careful consideration of thresholds and an understanding of the trade-offs involved [3].

Q1: What is the optimal Framewise Displacement (FD) threshold for censoring? There is no universal threshold, and the optimal value can depend on your specific dataset and research question [3]. The table below summarizes thresholds and their outcomes from key studies:

Table 1: Framewise Displacement (FD) Thresholds in Practice

FD Threshold	Context / Population	Outcome / Rationale
0.9 mm [20]	Task-fMRI in children and adolescents	Reduced variance in parameter estimates and increased statistical power.
0.3 mm [9]	Resting-state in 6-8 year olds	Effective data quality while retaining 83% of participants.
0.2 - 0.5 mm [3]	Multi-dataset evaluation of task-fMRI	Modest censoring (1-2% data loss) showed consistent improvements.

Recommendation: Start with a threshold of 0.2-0.3 mm and perform sensitivity analyses to determine the impact on your data retention and outcome metrics [3] [9].

Q2: What are the key trade-offs of volume censoring? While effective, censoring has significant costs:

Data Loss: Removing volumes reduces the temporal degrees of freedom, which can lower the reliability of functional connectivity estimates [22].
Biased Exclusion: Stringent censoring can lead to the exclusion of a higher proportion of high-motion subjects, potentially biasing your final sample [24].
Discontinuities: Censoring creates gaps in the time series, which can complicate analysis [25].

Mitigation Strategy: To address data discontinuities, consider advanced methods like structured low-rank matrix completion, which can recover missing entries and restore continuous time series [25].

Frequently Asked Questions (FAQs)

Q1: Why do motion artifacts persist even after rigid-body realignment and motion parameter regression? Realignment corrects for misalignment between volumes but does not address several underlying sources of artifact, including:

Spin-history effects: Motion changes the excitation history of spins, causing signal changes that realignment cannot fix [26] [23].
Interpolation errors: The process of realigning volumes requires resampling, which can introduce its own artifacts [23].
Intra-volume motion: Head motion can occur during the acquisition of a single volume, which standard volume-based realignment does not model [26].

Q2: Is Global Signal Regression (GSR) recommended for removing motion artifacts? The use of GSR is a nuanced decision. On one hand, it can improve the removal of global motion artifacts and increase the sensitivity of pipelines to detect true effects [24]. On the other hand, it can exacerbate the distance-dependent relationship between motion and connectivity and remains controversial due to potential mathematical biases [22] [24]. Its application should be justified by the research question and interpreted with caution.

Q3: My study involves a task. How does task-based motion affect denoising? Task-based fMRI presents a specific challenge because head motion is often correlated with task performance (e.g., moving in response to a stimulus). This correlation makes it difficult to disentangle true neuronal activity from motion artifact [22] [23]. In such cases:

Conventional regression may remove genuine task-related activity along with the motion artifact [23].
ICA-based methods (like ICA-AROMA) can be advantageous because they separate signals based on spatial independence, potentially isolating motion-related components without removing task-related activation [23].

Q4: Are there any new or emerging techniques to handle residual motion? Yes, research is ongoing. Promising approaches include:

SLOMOCO (Slice-oriented MOtion COrrection): A method that performs motion correction at the slice level rather than the volume level, better addressing intra-volume motion [26].
Structured Low-Rank Matrix Completion: An advanced censoring recovery method that fills in censored data gaps by exploiting the inherent structure of the fMRI time series, mitigating the data loss problem [25].
Weighted Least Squares (WLS): A method that estimates the variance for each image in the time series and weights them during model estimation, thereby down-weighting the influence of high-motion volumes [27].

The Scientist's Toolkit

Table 2: Essential Denoising Methods and Their Functions

Method / Reagent	Primary Function	Key Considerations
aCompCor [22]	Derives noise regressors from noise ROIs (WM/CSF).	An optimized version is top-performing; may be less effective in high-motion data [22] [24].
ICA-AROMA [22] [24]	Identifies and removes motion components via spatial ICA.	Good balance of efficacy and data retention; can be automated.
Volume Censoring [22] [20]	Removes high-motion volumes from analysis.	Most effective for distance-dependent artifacts; causes data loss.
Global Signal Regression (GSR) [22] [24]	Regresses the global mean signal from the data.	Improves many benchmarks but exacerbates distance-dependence; use with caution.
SLOMOCO [26]	Corrects for motion at the slice level.	Addresses intra-volume motion, a source of residual artifact.

Workflow and Conceptual Diagrams

Motion Artifact Mitigation Decision Workflow

This chart outlines the decision-making process for selecting and applying advanced denoising methods after finding that residual motion artifacts persist following standard realignment.

Advanced Censoring and Recovery Process

This diagram illustrates the workflow for an advanced censoring approach that uses matrix completion to recover missing data, thereby mitigating the problem of data loss.

Implementing FD Thresholds: Practical Strategies and Evidence-Based Guidelines

Framewise displacement (FD) is a quantitative measure of head motion between consecutive volumes in a functional magnetic resonance imaging (fMRI) scan. Volume censoring (or "scrubbing") is a widely used technique to mitigate motion artifacts by identifying and removing individual volumes with FD values that exceed a specific threshold. This process is crucial because head motion can introduce systematic but spurious correlations in functional connectivity data, potentially compromising the validity of study results [13]. The choice of FD threshold represents a critical trade-off: overly conservative (lower) thresholds may remove excessive data, while overly liberal (higher) thresholds may retain motion-corrupted volumes.

The table below summarizes the common FD threshold ranges found in the literature, categorized from conservative to liberal approaches.

Table 1: Common FD Threshold Ranges and Their Applications

Approach Category	FD Threshold Range	Primary Applications and Rationale	Key Trade-offs
Conservative	0.1 - 0.2 mm	Used for rigorous motion control in high-quality datasets with low inherent motion [28]. Minimizes spurious motion-related correlations.	Highest data loss; may exclude many participants or volumes, reducing statistical power.
Moderate	0.2 - 0.3 mm	A balanced approach for standard analyses. Effective for pediatric and clinical populations with expected higher motion [4] [24].	Moderate data loss. Offers a good balance between data quality and retention.
Liberal	> 0.3 mm	Applied in studies of populations with extreme motion (e.g., fetuses, young children) where stricter thresholds would lead to excessive data loss [11].	Retains more motion-corrupted data; requires careful validation to ensure artifacts are removed.

Experimental Evidence and Protocol Details

Evidence Supporting a 0.2 mm Threshold

Multiple studies have identified 0.2 mm as a benchmark for "high motion." Power et al. demonstrated that FD values as small as 0.2 mm can introduce systematic artifacts, where motion decreases long-distance correlations and increases short-distance correlations [13]. Their protocol involved:

Data Acquisition: Resting-state fMRI data from multiple cohorts (children, adolescents, adults) on Siemens scanners.
Motion Calculation: FD was computed from the root mean square of the translational and rotational displacement parameters.
Analysis: Correlations between motion and functional connectivity were examined before and after applying various censoring thresholds.
Finding: Scrubbing volumes with FD > 0.2 mm was effective at reducing negative motion-BOLD relationships, though positive relationships in motor areas, potentially of neural origin, often remained [28].

Evidence Supporting a 0.3 mm Threshold in High-Motion Cohorts

A 2022 study specifically designed to handle high-motion pediatric data demonstrated the efficacy of a 0.3 mm threshold [4].

Participants: 108 first-grade children (age 6-8), a cohort with typically high motion.
Real-Time Monitoring: Head motion was monitored in real-time using Framewise Integrated Real-time MRI Monitoring (FIRMM) to ensure at least 4 minutes of low-motion data (FD < 0.4 mm) were acquired.
Preprocessing Pipeline: The protocol combined volume censoring with ICA-based denoising.
Result: With the censoring threshold set at FD > 0.3 mm, the preprocessed data met rigorous quality standards while retaining 83% of participants. This shows that a moderately liberal threshold can yield usable data even in challenging populations [4].

Systematic Evaluation of Multiple Thresholds

A comprehensive 2018 evaluation of 19 different denoising pipelines highlighted the general effectiveness of volume censoring [24].

Benchmarks: Pipelines were evaluated on their ability to break the relationship between motion and connectivity, minimize data loss, and ensure test-retest reliability.
Finding: Volume censoring consistently performed well across most benchmarks, particularly in minimizing motion-related artifacts. The study reinforced that censoring is one of the most robust strategies, though its success is also tied to the potential exclusion of high-motion individuals.

Troubleshooting Guide: FAQs on FD Threshold Implementation

Q1: How do I choose the right FD threshold for my study? The optimal threshold is not universal and depends on your specific research context. Consider your population's expected motion (e.g., adults vs. children), your scan duration, and your study's tolerance for data loss. A threshold of 0.2 mm is a good starting point for general adult populations seeking high data quality. For pediatric or clinical cohorts, a threshold of 0.3 mm may be more appropriate to maintain an adequate sample size. It is critical to perform sensitivity analyses by comparing results across different thresholds (e.g., 0.2 mm and 0.3 mm) to ensure your findings are not driven by this choice [4] [24].

Q2: What are the consequences of using proportional thresholding in functional connectome studies? Proportional thresholding, which keeps a fixed percentage of the strongest connections, is a common alternative to FD-based censoring for constructing functional networks. However, if your study groups (e.g., patients vs. controls) have systematic differences in overall functional connectivity strength, proportional thresholding can introduce bias. It may force the inclusion of more weak (and potentially spurious) connections in the group with lower overall connectivity, making their network appear more random and artificially inflating group differences in network organization [29]. It is recommended to test for and report differences in overall connectivity strength between groups.

Q3: Our dataset has high motion. What strategies can we combine with volume censoring? Volume censoring is most effective when used as part of a comprehensive denoising strategy [24]:

Combine with ICA Denoising: After censoring, use ICA-based approaches (e.g., ICA-AROMA) to automatically identify and remove motion-related components from the data. This can capture residual artifacts that censoring alone may miss [4].
Include Group-Level Covariates: Even after rigorous individual-level censoring, include the average FD per subject as a nuisance covariate in your group-level analyses to account for residual motion effects [28].
Lengthen Scan Time: For future study designs, consider longer scan times. A 2025 study found that scans of at least 20-30 minutes are more cost-effective for achieving high prediction accuracy, as they allow for the collection of more usable data even after censoring [15].

Q4: Is nuisance regression of motion parameters sufficient to correct for motion? No. While regressing out motion parameters is a common step, evidence shows it is not sufficient to fully remove motion artifacts. Associations between head motion and functional connectivity patterns often persist after nuisance regression alone. Volume censoring has been proven more effective at mitigating these lingering effects, a finding consistent across populations from adults to fetuses [24] [11].

Workflow Diagram: Implementing FD Threshold Censoring

The diagram below outlines the key steps and decision points in a standard volume censoring workflow for resting-state fMRI data.

Diagram 1: FD Censoring Workflow. This chart outlines the process for identifying and removing high-motion volumes from fMRI data, culminating in a decision point on whether sufficient data remains for analysis.

The Scientist's Toolkit: Essential Research Reagents & Materials

Table 2: Key Software and Methodological "Reagents" for Motion Correction

Tool / Method	Function / Purpose	Example Use Case / Note
Framewise Displacement (FD)	A scalar summary metric of frame-to-frame head motion. Calculated from translational and rotational realignment parameters.	The primary index for identifying motion-corrupted volumes for censoring [13].
Volume Censoring (Scrubbing)	The process of removing individual fMRI volumes where FD exceeds a defined threshold.	Can be implemented as "scrubbing" (direct removal) or "spike regression" (modeling outliers). Effective but causes data loss [24].
ICA-AROMA	A data-driven algorithm (ICA-based Automatic Removal Of Motion Artifacts) that identifies and removes motion-related components from fMRI data.	Often used in conjunction with or as an alternative to censoring. Provides good motion control with less data loss than aggressive censoring [24].
FIRMM	Framewise Integrated Real-time MRI Monitoring software. Provides real-time feedback on head motion during the scan.	Allows researchers to acquire a pre-defined amount of low-motion data, which is especially useful for high-motion populations [4].
Global Signal Regression (GSR)	A controversial but effective denoising step that regresses out the global mean signal of the brain.	Can reduce motion-related artifacts and improve the specificity of functional connectivity measures, but may also remove neural signal [24].
aCompCor	A noise reduction method that extracts principal components from noise regions of interest (e.g., white matter, CSF) and regresses them out.	May be most viable in low-motion datasets; its performance can be compromised in high-motion scenarios [24].

Framewise Displacement Threshold FAQ

What is framewise displacement (FD) and why is censoring necessary?

Framewise displacement (FD) is a scalar quantity that summarizes head movement from one brain volume to the next, calculated from the six rigid-body realignment parameters (translations and rotations) [13]. Censoring, or "scrubbing," is the process of identifying and removing individual volumes (time points) with FD values above a chosen threshold [4]. This is necessary because even small, sub-millimeter head movements can introduce systematic artifacts into functional connectivity MRI (fcMRI) data. These artifacts manifest as spurious correlations, typically decreasing long-distance connections and increasing short-distance connections, which can lead to false positive or false negative findings in brain-behavior association studies [28] [14] [13].

How does the research population influence the choice of an FD threshold?

The characteristics of the study population are a primary consideration because head motion varies systematically across different groups.

Children and Clinical Populations: Studies involving young children or individuals with neurological or psychiatric conditions (e.g., ADHD, autism) often encounter higher rates of in-scanner motion [4] [28]. For these high-motion groups, a more lenient threshold might be necessary to avoid excluding a large number of participants or an excessive amount of data. For example, one study on first-grade children successfully used a threshold of FD < 0.3 mm, which retained 83% of participants while still meeting rigorous data quality standards [4].
Adults and Low-Motion Populations: In studies of compliant, healthy adults, a more stringent threshold (e.g., FD < 0.2 mm) is often feasible and recommended to minimize residual motion artifact, as participants generally move less [14].

Table 1: Example FD Thresholds by Population

Population	Example FD Threshold	Rationale	Source Context
Young Children (Age 6-8)	FD < 0.3 mm	Balances data quality with participant retention in a high-motion cohort. [4]	Study of 108 first-grade children. [4]
General / Adult	FD < 0.2 mm	Commonly used stringent threshold to mitigate motion artifacts. [14]	Analysis of the ABCD study dataset. [14]
Fetal fMRI	Varies widely (FD > 0.5mm to > 3.8mm)	Extreme motion environment; threshold choice directly trades off with massive data loss (40-95%). [30]	Study of 535 fetal fMRI sessions. [30]

How does scan duration interact with the choice of FD threshold?

Scan duration is a critical factor because it determines the total amount of data available. The relationship is a direct trade-off: a more stringent (lower) FD threshold will remove more volumes, effectively shortening the usable scan length.

Achieving Sufficient Data: Longer initial scan durations provide a buffer, allowing researchers to use a stringent threshold and still retain enough data for a reliable analysis. Recent research on brain-wide association studies (BWAS) suggests that longer scans (at least 20-30 minutes) are more cost-effective for achieving high prediction accuracy [15]. If a stringent threshold is applied to a short scan, the resulting data may be insufficient.
Data Quality vs. Quantity: The goal is to censor motion-corrupted volumes to improve data quality, but not so many that the remaining data is too short for stable functional connectivity estimates. One study on dynamic causal modeling found that scanning durations over 10.8 minutes could yield good reliability [31]. If a planned analysis requires a certain minimum number of volumes, the FD threshold may need to be adjusted to meet that requirement, prioritizing less stringent thresholds for shorter scan protocols.

Table 2: Impact of Scan Duration and Sample Size on Data Quality

Factor	Effect on Data Quality & Reliability	Research Finding
Longer Scan Duration	Increases prediction accuracy and reliability of functional connectivity metrics. [15] [31]	For phenotypic prediction, 30-minute scans were found to be the most cost-effective. For effective connectivity, scans over 10.8 minutes yielded good reliability. [15] [31]
Larger Sample Size	Improves prediction accuracy and boosts the statistical power of brain-behavior associations. [15]	Sample size is ultimately more important than scan duration for prediction power, though both are beneficial. [15]
Total Scan Duration	Prediction accuracy increases with the total scan duration (sample size × scan time per participant), with diminishing returns. [15]	A logarithmic model showed sample size and scan time are initially interchangeable for boosting prediction accuracy. [15]

How does my research question affect the threshold selection?

The specific goals of your study can influence how conservative you need to be with motion censoring.

Studies of Motion-Correlated Traits: Extra caution is required when studying behavioral or clinical traits that are themselves correlated with head motion (e.g., inattention, impulsivity). In these cases, residual motion artifact can create spurious brain-behavior relationships, making it appear that a trait is associated with a certain connectivity pattern when the association is actually driven by motion [14]. For such research questions, using a stringent threshold and employing methods to quantify trait-specific motion impact (like SHAMAN) is highly recommended [14].
Preserving Natural State Variance: For some studies, overly aggressive censoring might systematically exclude participants with higher motion, who may also represent an important part of the population variance on the trait of interest (e.g., excluding all children with low attention scores) [14]. In such scenarios, a slightly more lenient threshold might be justified to preserve the representativeness of the sample, provided that motion is rigorously accounted for in the statistical models.

Diagram 1: FD Threshold Selection Workflow

Experimental Protocols for Motion Censoring

Protocol 1: Volume Censoring with ICA Denoising for High-Motion Pediatric Data

This protocol is adapted from a study that successfully obtained usable resting-state data from first-grade children despite extreme motion [4].

Real-time Monitoring: Use real-time motion monitoring software (e.g., Framewise Integrated Real-time MRI Monitoring - FIRMM) during data acquisition to ensure a target amount of low-motion data is collected over multiple runs.
Preprocessing: Perform standard volume realignment to generate rigid-body head motion parameters.
Calculate Framewise Displacement (FD): Compute FD for every volume in the time series [4].
Censoring: Identify and remove all volumes where the FD exceeds a predetermined threshold (e.g., 0.3 mm). Data from multiple runs are concatenated after censoring.
Denoising: Use Independent Component Analysis (ICA) based denoising (e.g., with a trained classifier like FIX or ICA-AROMA) to remove motion-related and other artifact components from the concatenated, censored data. This step replaces the need for temporal filtering and nuisance regression, which can be invalidated by censoring.
Quality Control: Assess the preprocessed data by ensuring there are minimal correlations between participant head motion and key quality metrics like temporal signal-to-noise ratio (tSNR) and functional connectivity.

Protocol 2: Combined Scrubbing and Group-Level Covariation for Case-Control Studies

This protocol is recommended for studies comparing groups that may differ in their motion characteristics, such as a clinical group versus healthy controls [28].

Preprocessing & Realignment: Complete standard realignment of functional images.
High-Quality Volume Identification: Flag volumes for censoring using a stringent threshold (e.g., FD > 0.2 mm). Some protocols also flag the volume before and after a high-motion volume.
Integrated Regression Model: In a single general linear model (GLM), regress out both the motion parameters (e.g., 24- or 36-parameter model) and "spike regressors" that represent each censored time point. This combines modeling and scrubbing approaches.
Derive Functional Connectivity Metrics: Calculate your primary resting-state fMRI metrics (e.g., correlation matrices) from the cleaned time series.
Group-Level Analysis: In the group-level statistical model, include the mean FD (or another summary measure of motion) for each participant as a nuisance covariate. This step helps remove residual variance associated with motion that was not fully eliminated at the individual subject level [28].

The Scientist's Toolkit: Key Research Reagents & Materials

Table 3: Essential Tools for fMRI Motion Correction and Quality Control

Tool / Material	Function / Purpose	Brief Explanation
Framewise Displacement (FD)	Quantifies head motion between consecutive brain volumes.	A single summary statistic derived from the 6 realignment parameters, used to identify motion-corrupted time points for censoring [4] [13].
Real-time Motion Monitoring (e.g., FIRMM)	Provides instant feedback on head motion during scan acquisition.	Allows researchers to collect additional data during the session if a participant moves too much, ensuring sufficient low-motion data is acquired [4].
Independent Component Analysis (ICA)	Automatically separates fMRI data into spatial and temporal components.	Used to automatically or manually identify and remove noise-related components from the data, often as a final denoising step after censoring [4].
Motion Censoring (Scrubbing)	Removes or regresses out individual high-motion volumes.	The core technique for eliminating the largest motion artifacts by excluding specific time points from analysis [4] [28].
Group-Level Motion Covariates	Accounts for residual motion effects in final statistical analyses.	Including a summary of each participant's motion (e.g., mean FD) as a covariate in group models controls for motion-related differences between subjects or groups [28].

Is a 0.2mm Framewise Displacement (FD) threshold the optimal benchmark for volume censoring?

No, large-scale studies and multi-dataset evaluations do not support a single, universal FD threshold of 0.2mm as optimal for volume censoring. The appropriate threshold is dependent on the specific study population, data acquisition parameters, and the trade-off between data quality and data retention [3] [9].

Research on high-motion pediatric cohorts, for instance, has successfully used a threshold of 0.3 mm, which allowed for the retention of 83% of participants while still meeting rigorous data quality standards [9]. Conversely, a multi-dataset evaluation in task-based fMRI found that even modest amounts of frame censoring (e.g., 1–2% data loss) could yield improvements, but no single approach, including a fixed FD threshold, consistently outperformed all others across different datasets and tasks [3].

The following table summarizes FD thresholds and their contexts as identified in the literature:

FD Threshold	Study Context	Key Findings / Rationale
0.3 mm	Resting-state fMRI in first-grade children (high-motion cohort) [9]	Effective at removing motion-corrupted volumes; 83% of participants retained after rigorous preprocessing.
Modest data loss (e.g., 1-2%)	Multi-dataset evaluation of task-based fMRI [3]	Improvements were seen with modest censoring, but the optimal threshold was not fixed and varied across studies.
1.5 mm	Fetal resting-state fMRI [11]	Used alongside motion regression; improved prediction of neurobiological features like gestational age.

A Researcher's Guide to Implementing Volume Censoring

1. Preprocessing and Motion Parameter Calculation Before censoring, functional MRI data must undergo standard preprocessing, which typically includes motion correction via rigid-body realignment [32]. This process generates six motion parameters (three translations and three rotations). The rotational parameters must be converted to millimeters by calculating the displacement on the surface of a sphere of a specified radius (e.g., the estimated radius of the brain) [11]. Framewise Displacement (FD) is then computed as the sum of the absolute values of the derivatives of these six motion parameters [3].

2. Independent Component Analysis (ICA) for Denoising Volume censoring is often combined with ICA-based denoising for a more robust cleanup [9] [33]. This data-driven method separates the BOLD signal into independent components, each with a spatial map and time course. Noise components (e.g., from head motion or physiological artifacts) can be automatically identified using a classifier like FSL's FIX (FMRIB's ICA-based Xnoiseifier) and then regressed out of the data [33].

3. The Censoring Decision Workflow The decision on which volumes to censor involves evaluating the computed FD against a chosen threshold. The diagram below outlines the logical workflow for this process.

Tool / Resource	Function / Purpose
FSL (FMRIB Software Library)	A comprehensive library of MRI analysis tools; used for motion correction, ICA, and FIX-based denoising [33].
AFNI (Analysis of Functional NeuroImages)	A suite of programs for analyzing and displaying functional MRI data; includes tools for despiking and volume censoring [11].
Gordon Neuroanatomical Atlas	A brain parcellation scheme used to define regions of interest (ROIs) for analyzing functional connectivity, as employed in the ABCD Study [34].
Child Behavior Checklist (CBCL)	A parent-reported measure used to assess internalizing behaviors (e.g., withdrawn/depressed) in children, allowing for correlation with neuroimaging data [34].
Polygenic Risk Score for Depression (PRS-D)	A numerical score summarizing an individual's genetic liability for depression, used to investigate gene-brain-behavior relationships [34].

Troubleshooting Common Problems

Problem: After implementing a 0.2mm FD censoring threshold, I have lost too much data from my high-motion pediatric cohort. What should I do?

Solution: A strict 0.2mm threshold may be overly conservative for high-motion populations. The literature suggests:

Use a less strict threshold: Empirical work with first-grade children showed that a 0.3 mm FD threshold effectively controlled for motion artifacts while retaining the majority of subjects (83%) [9].
Combine methods: Relying on censoring alone is often insufficient. Implement a pipeline that combines volume censoring with ICA-based denoising (e.g., using FSL's FIX) to remove residual motion artifacts that censoring might miss [9] [33].

Problem: My analysis shows a lingering association between head motion and functional connectivity even after nuisance regression and volume censoring. How can I further mitigate this?

Solution: Persistent motion effects are a known challenge.

Validate with motion prediction: One method to test for lingering effects is to see if a subject's functional connectivity profile can predict their average FD. If a significant relationship remains, consider:
- Adjusting the censoring threshold: Systematically evaluate the impact of different FD thresholds (e.g., 0.2mm, 0.3mm, 0.5mm) on the motion-connectivity relationship [11].
- Incorporate more nuisance regressors: Beyond the 6 basic motion parameters, using 24 or even 36 expanded motion regressors (including derivatives and squared terms) can account for more complex motion-related noise [11].

Problem: I am analyzing task-based fMRI data. Should I use the same volume censoring approach as for resting-state data?

Solution: The principles are similar, but the implementation requires special care due to the experimental design.

Model censoring in the GLM: In task-based fMRI, censored volumes are typically handled by including "scan-nulling" nuisance regressors in the general linear model (GLM) for each censored time point [3].
Be mindful of data loss: Censoring in task-based designs can lead to a loss of statistical power, especially if critical task events occur during censored volumes. The trade-offs between motion mitigation and signal detection must be carefully evaluated for your specific task paradigm [3].

Integrating Censoring with Complementary Motion Correction Techniques

Troubleshooting Guides

Guide 1: Resolving Poor Functional Connectivity After Censoring

Problem: After applying frame censoring, functional connectivity matrices show high error, or seed-based correlation maps are poorly delineated.

Explanation: Censoring creates discontinuities in the fMRI time series. Simple removal of high-motion volumes can disrupt the temporal structure of the BOLD signal, leading to unreliable correlation estimates, especially if a significant portion of data is removed [25].

Solution: Implement a structured low-rank matrix completion method to recover the missing entries instead of leaving gaps.

Procedure:
- Identify and censor volumes with a Framewise Displacement (FD) threshold, typically >0.2-0.3 mm [3] [9].
- Apply a matrix completion algorithm that enforces a low-rank prior on a structured Hankel matrix formed from the time series.
- This method simultaneously recovers the missing data and performs slice-time correction at a finer temporal resolution.
- Proceed with standard functional connectivity analysis on the completed time series.
Verification: After applying this method, check for improved error metrics in pair-wise correlation matrices and better delineation of known networks, like the default mode network, in seed-based analyses [25].

Guide 2: Choosing Between Censoring and Motion Regressors

Problem: Uncertainty about whether to use frame censoring, motion parameter regressors (RP6/RP24), or both in a task-based fMRI GLM, particularly when motion is correlated with the task in a block design.

Explanation: Frame censoring (scrubbing) is particularly effective at mitigating motion artifacts when high-motion events are temporally linked to the task condition, as it removes the influence of the most corrupted data points from the analysis [35]. Relying solely on regressors may not fully remove these intense, localized artifacts.

Solution: For task-based fMRI, especially block designs, frame censoring is generally recommended over RP6 alone.

Procedure:
- Calculate Framewise Displacement (FD) for each volume.
- Identify and censor volumes exceeding a chosen FD threshold (e.g., 0.3-0.5 mm) by including "scan-nulling" regressors in your first-level General Linear Model (GLM) [3] [35].
- You can omit the standard 6 realignment parameters (RP6) as nuisance regressors when using frame censoring, as their primary variance is often removed by the censoring process.
- For participants with no FD outliers, standard realignment is sufficient, and motion regression may be unnecessary if the design matrix already accounts for the censored volumes [35].
Verification: Compare the model's residuals and the resulting t-statistics from a model with censoring to one using only RP6. Improved statistical power and reduced motion-related artifacts in the output maps indicate successful application [3].

Frequently Asked Questions (FAQs)

FAQ 1: What is the optimal Framewise Displacement (FD) threshold for censoring in a high-motion pediatric cohort?

For high-motion pediatric cohorts (e.g., children aged 6-8), a rigorous censoring threshold of FD > 0.3 mm has been demonstrated to be effective. This threshold, when combined with ICA-based denoising in a preprocessing pipeline, successfully removes motion-corrupted volumes while allowing for the retention of a high percentage of participants (83%). This approach minimizes the correlation between head motion and data quality metrics like temporal signal-to-noise ratio and functional connectivity [9].

FAQ 2: Can real-time motion feedback reduce head motion during a task-based fMRI scan?

Yes. Providing participants with real-time visual feedback on their head motion via software like FIRMM can lead to a statistically significant reduction in head motion during task-based fMRI. One study demonstrated a reduction in average Framewise Displacement (FD) from 0.347 mm to 0.282 mm. The feedback is typically provided using a color-coded cross-hair that changes from white to yellow to red based on preset FD thresholds (e.g., <0.2 mm, 0.2-0.3 mm, ≥0.3 mm). This is complemented by between-run feedback showing a motion performance score [36].

FAQ 3: How do I handle the data loss associated with aggressive frame censoring?

Aggressive censoring leads to data discontinuities. A advanced solution is to use a motion-compensated recovery method based on structured low-rank matrix completion. This technique does not just leave gaps; it formulates the artifact-reduction problem as the recovery of a large matrix with missing entries. By exploiting the inherent low-rank structure of the fMRI time series data, the algorithm can accurately interpolate and recover the censored volumes, resulting in a continuous time series that improves subsequent functional connectivity analysis [25].

FAQ 4: When using aCompCor for noise reduction, is scan scrubbing still necessary?

Research suggests that if the aCompCor method is used to estimate nuisance signals from white matter and cerebrospinal fluid, the added benefit of scan scrubbing is minimal. aCompCor, which uses principal components analysis, effectively attenuates motion artifacts on its own. Studies have shown that scrubbing does not provide significant additional reduction in motion artifacts or improvement in connectivity specificity once aCompCor has been applied [37].

Table 1: Comparison of Motion Correction Technique Efficacy Across Multiple Datasets [3]

Technique	Acronym	Key Finding	Relative Performance
Frame Censoring (FD/DVARS)	Censoring	Consistent improvements with modest data loss (1-2%). Often outperforms RP6.	High
24 Motion Regressors	RP24	Expansion of realignment parameters to account for non-linear effects.	Variable
Robust Weighted Least Squares	rWLS	Down-weights high-variance frames in a two-pass modeling procedure.	Variable
Wavelet Despiking	WDS	Identifies non-stationary artifacts across temporal scales.	Variable
Untrained ICA	uICA	Identifies artifact components based on spatial characteristics.	Variable
6 Motion Regressors	RP6	The standard six realignment parameters. Commonly used as a baseline.	Lower

Table 2: Impact of Real-Time Feedback on Motion in Task-fMRI [36]

Group	Average Framewise Displacement (FD)	Key Experimental Condition
No Feedback (Control)	0.347 mm	Standard instructions to hold still.
Real-Time Feedback (Intervention)	0.282 mm	Visual feedback (white/yellow/red cross) based on FD thresholds.

Experimental Protocols

Protocol: Multi-Dataset Evaluation of Frame Censoring and Other Techniques

Objective: To systematically compare the efficacy of frame censoring and other common motion correction strategies across diverse task-based fMRI datasets [3].

Datasets:

Source: Eight publicly available studies from OpenNeuro.
Content: 11 distinct tasks across child, adolescent, and adult participants.
Metrics: Analysis performed on 15 datasets, evaluating techniques using maximum t-values, ROI activation, and split-half reliability.

Motion Correction Workflow:

Data Preprocessing: Standard preprocessing including rigid-body realignment.
Motion Estimation: Calculate Framewise Displacement (FD) and DVARS for each volume.
Technique Application: Apply seven different motion correction methods to the same data:
- RP6 (6 realignment parameters)
- RP24 (24-term expansion of motion parameters)
- Frame Censoring based on FD
- Frame Censoring based on DVARS
- Wavelet Despiking (WDS)
- Robust Weighted Least Squares (rWLS)
- Untrained ICA (uICA)
Performance Quantification: For each method, compute outcome metrics on the processed data:
- Maximum group t-statistic (whole-brain and ROI).
- Mean parameter estimates within an ROI.
- Test-retest reliability of subject-level maps.

Key Conclusion: No single motion correction approach consistently outperformed all others across all datasets and tasks. The optimal choice depends on the specific dataset and the outcome metric of interest. However, frame censoring with modest data loss frequently performed well [3].

Protocol: Real-Time Feedback for Motion Reduction During Task

Objective: To assess whether real-time and between-run motion feedback can reduce head motion during an auditory word repetition task [36].

Participants: 78 adults (19-81 years), pseudorandomly assigned to feedback or no-feedback groups.

Procedure:

Task: Participants heard and repeated spoken words in noise during fMRI scanning.
Feedback Group: Viewed a crosshair that changed color based on real-time FD calculated by FIRMM software:
- White: FD < 0.2 mm
- Yellow: 0.2 mm ≤ FD < 0.3 mm
- Red: FD ≥ 0.3 mm
- Between runs, participants were shown a "Head Motion Report" with a performance score and motion graph and encouraged to improve.
Control Group: Given standard instructions to hold still but no real-time feedback.
Analysis: Head motion was compared between groups using average Framewise Displacement (FD).

Key Finding: The feedback group showed a statistically significant reduction in head motion compared to the control group [36].

Workflow and Signaling Diagrams

Motion Compensated Recovery Workflow

Motion Correction Technique Selection

Research Reagent Solutions

Table 3: Essential Software and Analytical Tools for Motion Correction Research

Tool Name	Type	Primary Function in Research	Key Use-Case
FIRMM	Software	Provides real-time visual feedback on participant head motion during scanning.	Reducing motion at the source during both resting-state and task-based fMRI [36].
aCompCor	Algorithm	Noise reduction via PCA on noise ROIs (WM/CSF) instead of mean signal regression.	Effectively mitigating motion artifacts in resting-state data without the need for scrubbing [37].
OpenNeuro	Data Repository	A public platform for sharing neuroimaging datasets.	Accessing diverse, publicly available fMRI datasets for method evaluation and replication [3].
Automatic Analysis	Pipeline Software	Automated and reproducible neuroimaging analysis pipelines.	Implementing and comparing multiple motion correction techniques consistently across datasets [3].

Framewise displacement (FD) calculation and volume censoring represent critical preprocessing steps in functional magnetic resonance imaging (fMRI) studies, particularly those involving high-motion populations such as children, elderly individuals, or clinical populations. This technical guide outlines comprehensive methodologies for implementing FD-based volume censoring workflows, framed within the broader context of determining optimal censoring thresholds for research. The protocols detailed below are essential for mitigating motion-related artifacts that can otherwise compromise data quality and introduce spurious correlations in functional connectivity analyses [9].

Frequently Asked Questions (FAQs)

Q1: What is framewise displacement (FD) and why is it important in fMRI preprocessing?

Framewise displacement quantifies the volume-to-volume head motion by calculating the derivative of the rigid body transformation parameters. It is computed from the six head motion parameters (three translations and three rotations) obtained during realignment. FD provides a scalar value for each time point that represents the extent of head movement, making it an essential metric for identifying motion-corrupted volumes that should be censored from analysis [38].

Q2: How is framewise displacement calculated mathematically?

The formula for FD at time t is calculated as follows [38]:

These translation and rotation components are combined to yield the total FD value for each volume.

Q3: What is the recommended FD threshold for volume censoring in resting-state fMRI?

Research indicates that a censoring threshold of FD > 0.3 mm is effective for maintaining data quality while retaining a sufficient number of participants. One study of first-grade children demonstrated that with this threshold, 83% of participants retained usable resting-state data despite extreme motion, meeting rigorous quality standards after preprocessing [9].

Q4: How does volume censoring integrate with other denoising techniques?

Volume censoring is typically implemented alongside complementary denoising methods. Studies have shown that combining volume censoring with independent component analysis (ICA)-based denoising effectively removes motion artifacts. Volume censoring targets severely motion-corrupted volumes, while ICA denoising addresses residual motion artifacts in retained volumes [9].

Q5: What are the consequences of insufficient motion correction in fMRI studies?

Inadequate motion correction can introduce spurious correlations in functional connectivity analyses, particularly problematic in resting-state fMRI. These artifacts are especially concerning when studying disorders or populations prone to movement, as even small movements can significantly impact results. Proper FD calculation and volume censoring are essential for valid statistical inference [39].

Experimental Protocols and Methodologies

Protocol 1: Comprehensive FD Calculation and Censoring Workflow

Table 1: Step-by-Step FD Calculation and Censoring Protocol

Step	Procedure	Software Implementation	Quality Control
1. Realignment	Align all functional images to the first image	SPM Realign: Estimate & Reslice [38]	Generate mean functional image and motion parameters
2. FD Calculation	Compute framewise displacement from motion parameters	Custom MATLAB scripts using rigid body transformation derivatives [38]	Plot FD across time for each participant
3. Threshold Determination	Apply FD > 0.3 mm threshold for volume censoring [9]	MATLAB, Python, or AFNI	Calculate percentage of volumes censored per participant
4. Volume Censoring	Remove or flag volumes exceeding threshold	AFNI's 3dToutcount or SPM-based custom scripts	Verify censoring using temporal signal-to-noise ratio (tSNR)
5. Denoising	Apply ICA-based denoising to retained volumes [9]	FSL MELODIC or CONN toolbox	Check correlation between motion and functional connectivity

Protocol 2: Quality Control Assessment for Censored Data

Table 2: Quality Control Metrics for Censoring Workflows

Quality Metric	Calculation Method	Acceptance Criteria	Purpose
Temporal Signal-to-Noise Ratio (tSNR)	Mean signal divided by standard deviation across time [9]	Minimal correlation with head motion	Assess temporal stability of signal
Participant Retention Rate	Percentage of participants retaining sufficient data after censoring [9]	>80% participants with adequate volumes	Evaluate feasibility of group analysis
Motion-Functional Connectivity Correlation	Correlation between FD and connectivity measures [9]	Minimal or non-significant correlations	Verify motion artifact removal
Framewise Displacement Distribution	Mean, max, and range of FD values	Study-specific based on population	Characterize motion patterns in sample

Workflow Visualization

Figure 1: FD Calculation to Volume Exclusion Workflow. This diagram illustrates the comprehensive pipeline from initial data processing through quality verification, highlighting the iterative nature of quality control.

Research Reagent Solutions

Table 3: Essential Tools for FD Calculation and Volume Censoring

Tool/Software	Primary Function	Implementation in Workflow
SPM (Statistical Parametric Mapping)	Image realignment and motion parameter estimation [38]	Coregistration of functional volumes and generation of motion parameters
MATLAB	Custom script implementation for FD calculation [38]	Calculation of framewise displacement from motion parameters
AFNI (Analysis of Functional NeuroImages)	Volume censoring and outlier detection [9]	Implementation of censoring thresholds and removal of motion-corrupted volumes
FSL MELODIC	Independent component analysis for denoising [9]	Removal of residual motion artifacts after volume censoring
CONN Toolbox	Functional connectivity analysis and motion correction [39]	Integration of censoring denoising in connectivity pipelines
Custom QC Scripts	Quality assessment and visualization [38]	Generation of motion-FD plots and censoring effectiveness metrics

Advanced Methodological Considerations

Threshold Optimization Framework

Selecting an appropriate FD threshold requires balancing data quality with participant retention. The optimal threshold of 0.3 mm emerged from systematic evaluation showing minimal correlation between head motion and functional connectivity metrics while retaining 83% of participants in a pediatric sample [9]. Researchers should validate this threshold for their specific populations, as motion characteristics vary across demographic and clinical groups.

Integration with Comprehensive Preprocessing

Volume censoring functions most effectively when integrated with a complete preprocessing pipeline including slice timing correction, spatial normalization, and smoothing. Studies recommend combining volume censoring with anatomical component-based noise correction (CompCor) and ICA-based denoising for optimal artifact removal [39]. This multi-pronged approach addresses both severe motion artifacts (through censoring) and more subtle noise sources (through additional denoising techniques).

Impact on Statistical Power

Implementation of volume censoring necessitates consideration of its impact on statistical power. Researchers should establish minimum data retention criteria (e.g., at least 5 minutes of clean data for resting-state analyses) to ensure sufficient statistical power for subsequent analyses. The trade-off between data quality and quantity should be carefully evaluated based on specific research questions and analytical approaches [9].

Navigating Trade-offs: Data Quality, Statistical Power, and Sample Bias

Troubleshooting Guides & FAQs

Frequently Asked Questions

Q1: What is the fundamental trade-off in framewise displacement censoring? The fundamental trade-off lies between removing motion-contaminated data to reduce noise and retaining enough data volumes to ensure reliable analysis and statistical power. Overly aggressive censoring (using very low FD thresholds) removes more noise but can lead to excessive data loss, potentially excluding entire subjects from analysis. Overly lenient censoring retains more data but may leave motion artifacts that inflate functional connectivity estimates and introduce bias [3] [9] [11].

Q2: How do I choose an initial FD threshold for my specific population? Start with these evidence-based thresholds, then validate for your data:

Population	Suggested Initial FD Threshold	Key Considerations
Adults [3]	0.2 - 0.3 mm	Balanced approach for typical motion levels.
Children (Age 6-8) [9]	0.3 mm	Effective for retaining subjects (83%) in high-motion cohorts.
Fetal Populations [11]	1.5 mm	Higher threshold adapted for extreme, unpredictable motion.

Q3: My data has very high motion. What strategies can I use besides adjusting the FD threshold? For high-motion data, consider these strategies:

Combine Techniques: Use volume censoring alongside other denoising methods like ICA-AROMA or FIX for synergistic effects [40] [9].
Data-Driven Scrubbing: Methods like "projection scrubbing" flag only volumes displaying abnormal statistical patterns, often preserving more data than strict motion-based scrubbing [41].
Concatenation: For resting-state data, if censoring creates multiple clean segments, concatenate them before functional connectivity analysis [9].

Q4: Does scan duration affect how I should approach censoring? Yes, longer scans provide more data, making them more resilient to data loss from censoring. A recent large-scale analysis found that 30-minute scans are, on average, the most cost-effective for achieving high prediction performance in brain-wide association studies. With a longer acquisition, you can afford more stringent censoring while retaining sufficient data for a reliable analysis [15].

Troubleshooting Common Problems

Problem: High subject exclusion rate after censoring.

Potential Cause: The FD threshold is too strict (too low) for your population's motion characteristics.
Solutions:
- Re-evaluate Threshold: Slightly increase the FD threshold. Even a small change (e.g., from 0.2mm to 0.3mm) can significantly increase subject retention while still removing the most severely corrupted volumes [9].
- Use Data-Driven Methods: Switch to a data-driven scrubbing method like projection scrubbing, which dramatically increases sample size by avoiding high rates of subject exclusion compared to motion scrubbing [41].
- Lengthen Scan Time: For future studies, plan for longer scan times (e.g., 30 minutes) to create a larger buffer of data per subject [15].

Problem: Motion-related artifacts persist in functional connectivity after censoring.

Potential Cause: Censoring alone may not remove all motion-related signal components, especially those with a gradual, low-frequency nature.
Solutions:
- Layer Denoising Techniques: Apply additional noise-reduction techniques after censoring. For task-based fMRI, FIX has been shown to optimally balance noise removal and signal conservation. For resting-state, ICA-based denoising (e.g., ICA-AROMA) is highly effective [40] [9].
- Expand Nuisance Regressors: Instead of just 6 motion parameters (RP6), use an expanded set (e.g., RP24) that includes derivatives and squared terms to better model the complex effects of motion [3] [11].

Problem: Uncertainty about whether my chosen censoring strategy is effective.

Potential Cause: Lack of quantitative validation for the chosen pipeline.
Solutions:
- Check Motion-FC Correlation: Calculate the correlation between participants' mean FD and their functional connectivity matrices. An effective pipeline should minimize this correlation [9] [11].
- Benchmark Predictive Power: Test if your processed data can predict a neurobiological feature (e.g., age or sex). Improved prediction accuracy after censoring indicates better signal-to-noise ratio [11].
- Assess Reliability: Use metrics like split-half reliability to see if your results are consistent across different parts of the data [3].

Quantitative Data for Study Design

The table below summarizes key findings from recent studies to inform your censoring protocol and study design.

Study	Population / Data Type	Key Finding / Recommended Practice	Performance Outcome
Marek et al. (2025) [15]	Large-scale BWAS (Resting & Task fMRI)	Optimal scan time: ~30 minutes is most cost-effective for prediction accuracy.	22% cost savings over 10-min scans; prediction accuracy increases with total scan duration (Sample Size × Scan Time).
Rogers et al. (2022) [3]	Multi-dataset Task fMRI	Modest censoring (1-2% data loss) with FD often improves results, but no single technique consistently outperforms all others.	Performance gains were task- and dataset-dependent.
Hoeppli et al. (2023) [40]	Task fMRI (Pain, Auditory)	FIX denoising optimally balanced noise reduction and signal conservation.	FIX conserved more signal than CompCor and ICA-AROMA for tasks with physiological changes.
Otero et al. (2023) [41]	Resting-state fMRI (HCP)	Data-driven "projection scrubbing" outperformed motion scrubbing, improving identifiability while censoring far fewer volumes.	Increased data retention without negatively impacting validity, reliability, and identifiability of functional connectivity.
Lyden et al. (2025) [11]	Fetal rs-fMRI	Censoring (1.5 mm FD) alongside regression is critical.	Improved prediction of gestational age/sex (55.2% accuracy with censoring vs. 44.6% without).

The Scientist's Toolkit: Essential Research Reagents & Materials

The table below lists key computational "reagents" and tools for implementing an effective censoring pipeline.

Item / Technique	Function / Purpose	Key Considerations
Framewise Displacement (FD) [3]	Quantifies head motion between consecutive volumes. Primary metric for defining censoring thresholds.	The most common metric for motion-based scrubbing. Threshold choice is population-dependent.
Data-Driven Scrubbing (e.g., Projection Scrubbing, DVARS) [41]	Identifies outlier volumes based on statistical patterns in the BOLD signal itself, not just head motion.	Can be more efficient than FD, preserving data and subjects while improving functional connectivity identifiability.
ICA-Based Denoising (e.g., ICA-AROMA, FIX) [40]	Automatically identifies and removes motion-related and other artifact components from the data.	Particularly effective when combined with censoring. FIX may be optimal for task-based fMRI.
Nuisance Regression (RP6, RP12, RP24) [3] [11]	Models out motion effects by including motion parameters and their expansions as regressors in the general linear model.	Often used before censoring. RP24 (Volterra expansion) is more comprehensive than RP6.
High-Performance Computing Cluster	Enables computationally intensive processing like ICA and large-scale machine learning prediction.	Essential for processing large datasets (N > 1000) and running advanced denoising algorithms in a reasonable time.

Experimental Protocols & Workflows

Detailed Protocol: Evaluating Censoring Thresholds

This protocol allows you to empirically determine the optimal FD threshold for your specific dataset.

Data Preprocessing: Begin with standard steps: slice-timing correction, realignment, and spatial normalization. Do not apply aggressive denoising or censoring at this stage.
Calculate Motion Metrics: Compute Framewise Displacement (FD) and DVARS for every volume of every subject.
Define Threshold Range: Create a set of FD thresholds for evaluation (e.g., 0.1, 0.2, 0.3, 0.4, 0.5 mm).
Censor Data: For each threshold, create a censored dataset where volumes exceeding the FD threshold are removed from subsequent analysis.
Run Analysis & Extract Metrics: For each censored dataset, perform your target analysis (e.g., functional connectivity or task activation) and calculate the following quality metrics:
- Data Retention: The percentage of subjects and volumes retained.
- Motion-FC Correlation: The correlation between mean FD and whole-brain functional connectivity strength [11].
- Prediction Accuracy: If possible, the accuracy of predicting a phenotypic or neurobiological variable (e.g., age) from the brain data [11].
- Reliability: Split-half reliability or fingerprinting identifiability of functional connectivity matrices [3] [41].
Select Optimal Threshold: Plot the quality metrics against the FD thresholds. The optimal threshold balances high data retention, low motion-FC correlation, and high predictive/reliability metrics.

Workflow Diagram: Censoring Threshold Evaluation

Detailed Protocol: Integrated Censoring & Denoising Pipeline

This protocol outlines a comprehensive approach for mitigating motion artifacts, combining multiple techniques supported by the literature.

Initial Processing: Perform realignment and slice-time correction.
Nuisance Regression: Regress out 24 motion parameters (RP24) generated during realignment to model linear and non-linear motion effects [11].
Temporal Filtering: Apply high-pass filtering (e.g., >0.008 Hz) to remove slow-frequency drifts.
Volume Censoring: Identify and censor volumes with FD exceeding your chosen threshold (e.g., 0.3mm). Use a "spike-free" FD calculation if possible.
ICA-Based Denoising: Apply ICA-AROMA (for resting-state) or FIX (particularly for task-fMRI) to automatically remove motion-related components not fully captured by previous steps [40].
Spatial Smoothing: Apply a moderate Gaussian kernel (e.g., FWHM = 4.5mm) to improve the signal-to-noise ratio [11].
Final Analysis: Proceed with functional connectivity or task-based GLM analysis on the fully preprocessed data.

Workflow Diagram: Integrated Censoring & Denoising

Mitigating Exclusion Bias in Motion-Correlated Populations and Traits

Frequently Asked Questions

What is exclusion bias in neuroimaging studies? Exclusion bias occurs when researchers remove participants with excessive in-scanner head motion from their analysis. This data is often "Missing Not at Random" (MNAR), because the probability of being excluded is directly related to key study variables, such as clinical symptom severity. This systematically biases the sample, limits the generalizability of findings, and can lead to underestimation of true neurobiological effects [42] [43] [44].
My data has high motion. Should I use a stricter or more lenient Framewise Displacement (FD) threshold? This is a core trade-off. Stricter thresholds (e.g., FD < 0.2 mm) better control for motion artifacts and reduce false positive results, but they exclude more data and can increase sample bias. Lenient thresholds retain more participants and improve generalizability but risk motion contamination inflating or distorting brain-behavior correlations [45] [2]. The optimal choice may depend on your specific research question and population.
Are certain participant traits correlated with in-scanner motion? Yes, numerous studies have shown that in-scanner head motion is correlated with a wide spectrum of participant characteristics. In the large ABCD study, factors such as demographic variables (e.g., socioeconomic status), behavioral measures (e.g., cognitive performance, impulsivity), and clinical symptoms were all related to the likelihood of a participant's data being excluded due to motion [43]. In studies of autism, children with more severe symptoms are excluded at higher rates [44].
What can I do beyond censoring to mitigate motion artifacts? A multi-pronged approach is recommended:
- Acquisition: Use real-time motion monitoring (e.g., FIRMM) to acquire sufficient low-motion data [4].
- Processing: Apply robust denoising techniques, such as ICA-based algorithms (ICA-AROMA or FIX), to remove motion-related noise without discarding entire volumes [42] [4].
- Analysis: Include motion metrics (e.g., mean FD) as covariates in group-level models and consider statistical methods like multiple imputation or targeted minimum loss-based estimation (TMLE) to account for missing data [42] [43] [44].
How effective is volume censoring? Volume censoring (or "scrubbing") is highly effective at removing motion-corrupted volumes. One study on pediatric data showed that combining censoring with ICA-based denoising produced data that met rigorous quality standards while retaining 83% of participants, even in a high-motion cohort [4].

Troubleshooting Guides

Problem: My Group Differences Are Driven by Motion

Description: You are concerned that significant findings between your clinical group (e.g., patients with psychosis or autism) and healthy controls are not due to true neural differences, but to unequal head motion between the groups.

Solution:

Quantify the Confound: Test for group differences in mean framewise displacement (FD). If the clinical group has significantly higher motion, your results are vulnerable to motion confounding [42] [44].
Apply Strict Censoring: Re-run your analysis using a strict censoring threshold (e.g., FD < 0.2 mm). This reduces motion-related false positives [45] [2].
Use Motion Impact Scores: For a trait-specific check, employ methods like SHAMAN (Split Half Analysis of Motion Associated Networks). This quantifies whether motion is causing over- or under-estimation of your specific trait-FC (functional connectivity) relationships, providing a p-value for the motion impact [2].
Report Comprehensively: Always report motion levels and exclusion rates for all groups. If results disappear after strict motion correction, they were likely artifactual [42] [43].

Problem: High Exclusion Rates Are Making My Sample Unrepresentative

Description: After applying standard quality control (QC), you find that a large portion of your clinical sample, particularly those with more severe symptoms, has been excluded. The remaining sample is no longer representative of the population you intend to study.

Solution:

Optimize Acquisition: Implement real-time motion monitoring (FIRMM) to collect more usable data during the scan session, reducing the need for post-hoc exclusion [4].
Prioritize Denoising over Censoring: Use aggressive denoising pipelines (e.g., ICA-AROMA) to salvage data from higher-motion subjects rather than excluding them entirely [42] [4].
Apply Advanced Statistical Methods: Treat excluded subjects as a missing data problem.
- Method: Use doubly robust targeted minimum loss-based estimation (TMLE) with an ensemble of machine learning algorithms.
- Protocol: This method models both the probability of being excluded (the missingness mechanism) and the outcome (brain connectivity), providing unbiased estimates even when data is not missing at random. This has been shown to recover more extensive and likely more accurate group differences in autism [44].
Be Transparent: Clearly document how many participants were excluded and for what reasons. Discuss the potential impact of this exclusion on the interpretability and generalizability of your findings [42] [43].

Problem: Choosing the Right Framewise Displacement Threshold

Description: You are unsure whether to use a lenient or strict FD threshold for censoring and need to understand the empirical trade-offs.

Solution: The table below summarizes the effects of different FD thresholds based on large-scale studies:

Framewise Displacement (FD) Threshold	Impact on Data Quality	Impact on Sample Bias & Effect Estimation	Recommended Use Case
Lenient (e.g., FD < 0.5 mm)	Retains more data volumes. Higher risk of residual motion artifacts inflating correlations [45].	Retains more participants, potentially improving sample representativeness. Overestimates trait-FC effects for motion-correlated traits [2].	Preliminary analyses or when population representativeness is the primary concern and motion is minimal.
Moderate (e.g., FD < 0.3 mm)	Good balance; effectively removes a large portion of motion artifacts when combined with denoising [4].	A practical compromise, though some bias may persist.	General use when aiming for a balance between data quality and sample size.
Strict (e.g., FD < 0.2 mm)	Most effective at removing motion artifacts. Dramatically reduces false positives and spurious correlations [45] [2].	Greatly increases exclusion rates, potentially biasing the sample towards less severe cases. Can lead to underestimation of true effects [42] [44] [2].	Essential for final analyses when controlling for motion artifacts is the highest priority.

Decision Workflow: The following diagram outlines the logical process for selecting and implementing a censoring strategy, highlighting the key trade-offs at each stage.

The Scientist's Toolkit

This table details key resources and methodologies for mitigating exclusion bias.

Tool / Method	Type	Primary Function & Relevance
Framewise Displacement (FD)	Metric	Quantifies volume-to-volume head movement. The primary measure for defining censoring thresholds [42] [45].
ICA-AROMA / FIX	Algorithm	ICA-based denoising tools that automatically identify and remove motion-related artifact components from fMRI data, reducing the need for volume censoring [42] [4].
FIRMM	Software	Framewise Integrated Real-time MRI Monitoring. Tracks head motion during scanning, allowing researchers to collect a target amount of low-motion data before ending the session [4].
TMLE	Statistical Method	Doubly robust Targeted Minimum Loss-Based Estimation. Corrects for bias introduced by non-random missing data (e.g., excluded high-motion subjects) [44].
SHAMAN	Analytical Method	Split Half Analysis of Motion Associated Networks. Assigns a trait-specific "motion impact score" to detect if brain-behavior associations are spuriously influenced by motion [2].
fMRIPrep	Software	A robust, standardized preprocessing pipeline for fMRI data that ensures consistent initial data handling, a foundation for all subsequent QC [17].

Experimental Protocol: Evaluating Trait-Specific Motion Impact with SHAMAN

Objective: To determine whether a specific trait-of-interest (e.g., cognitive score or clinical symptom) has a functional connectivity (FC) signature that is significantly impacted by residual head motion, after standard denoising.

Background: The SHAMAN method capitalizes on the fact that traits are stable over time, while motion is a varying state. It tests if the correlation between a trait and FC is different in high-motion versus low-motion segments of the same dataset [2].

Materials:

Preprocessed resting-state fMRI data and a corresponding trait measure for all subjects.
Framewise Displacement (FD) timeseries for each subject.
A parcellation atlas to define regions of interest (ROIs).

Procedure:

Data Preparation: For each subject, calculate the mean FD across their entire scan. Split the fMRI timeseries into two halves: a "high-motion" half (volumes with FD above the subject's mean) and a "low-motion" half (volumes with FD below the mean) [2].
Connectivity Calculation: For each subject and for each half of the data, compute a functional connectivity matrix (e.g., a correlation matrix between all ROI timeseries).
Edge-wise Effect Size Calculation:
- For each half of the data, perform a group-level regression (edge-wise) between the trait and each FC connection, covarying for necessary factors (e.g., age, sex). This produces a beta-coefficient for the trait-FC effect for every connection in both the high-motion and low-motion halves.
- Calculate the difference in these beta-coefficients for each connection: Δβ = β_high_motion - β_low_motion [2].
Motion Impact Score:
- The motion impact score for a given trait is the aggregate (e.g., sum) of all Δβ values across the brain.
- A positive score aligned with the overall trait-FC effect suggests motion causes overestimation.
- A negative score opposite the overall effect suggests motion causes underestimation [2].
Statistical Inference:
- Use permutation testing (e.g., randomly shuffling the high/low motion labels within subjects) to create a null distribution of motion impact scores.
- Compare the true score to this null distribution to obtain a p-value, indicating whether the motion impact is statistically significant [2].

Visualization: The workflow for the SHAMAN protocol is illustrated below.

Frequently Asked Questions (FAQs) on Motion Mitigation

Q1: What are the most effective strategies to reduce head motion during fMRI scanning in children?

A1: Effective strategies are multi-faceted. Distributing fMRI data acquisition across multiple, shorter same-day sessions has been shown to significantly reduce head motion in children [46]. Furthermore, using engaging, visually-based tasks (like Word-Picture Matching) instead of auditory-only tasks can lead to lower motion, as higher task engagement is associated with reduced movement [47] [48]. Proper preparation, including mock scanner training and creating a child-friendly environment, is also essential for success [47] [49].

Q2: How does motion differ between children and adults, and what are the implications for study design?

A2: Children consistently exhibit more head motion than adults, and motion inversely correlates with age [47] [48]. The pattern of motion can also differ; pediatric motion is often dominated by a "nodding" movement (pitch rotation and z-/y-translation) [48]. For study design, this means that protocols effective for adults (like inside-scanner breaks) may need adjustment for children, for whom splitting the scan into multiple sessions is more beneficial [46]. Studies comparing groups with different inherent motion (e.g., children vs. adults, clinical vs. healthy controls) must implement rigorous motion correction to prevent confounded results.

Q3: Beyond standard realignment, what are proven methods for correcting motion artifacts in task-based fMRI?

A3: Several advanced methods are available, and the optimal choice can depend on your dataset and outcome metrics [3].

Frame Censoring (SCRUBBING): Statistically excluding motion-contaminated volumes from analysis. Performance depends on the threshold chosen (see Table 2) [3].
Motion Regressors: Including the 6 realignment parameters (RP6) or a 24-term expansion (RP24) that includes derivatives and squared terms, as nuisance regressors in the general linear model [3].
Data-Driven Approaches: Techniques like Wavelet Despiking (WDS) and Robust Weighted Least Squares (rWLS) can simultaneously reduce multiple types of noise and motion artifacts [3].
Integrated Correction in Pipelines: Using robust preprocessing pipelines like fMRIPrep, which performs motion correction, field unwarping, and normalization in an integrated, standardized manner [17].

Troubleshooting Guides

Guide 1: Diagnosing and Addressing Excessive Motion in Pediatric Scans

Symptom	Possible Cause	Solution
Systematic motion spikes during a specific task	Task-induced movement (e.g., button presses, overt speech) [47]	- Switch to covert responses where possible.- Optimize task timing to be event-related rather than blocked design [50].- Use visual engagement tasks to promote stillness [47].
Progressive increase in motion throughout a scan run	Fatigue, loss of attention, or discomfort [46]	- Implement shorter scan runs.- Introduce brief, structured inside-scanner breaks to allow for micro-movements [46].
Consistently high motion across all subjects in a group	Inadequate participant preparation or poor study design for the population [46] [49]	- Enhance pre-scan training using a mock scanner [47] [46].- Use child-friendly furnishings and allow a parent to be present [49].- Consider distributing scanning over multiple sessions [46].

Guide 2: Selecting a Frame Censoring Threshold for Your Data

This guide helps you choose a threshold for Framewise Displacement (FD), a common metric for identifying motion-corrupted volumes.

FD Threshold	Use Case & Rationale	Potential Drawback
FD > 0.2 mm	A liberal threshold; useful for preserving data in studies with low-motion populations or when the number of timepoints is critical.	Retains volumes with smaller, yet potentially impactful, motion artifacts, risking residual contamination.
FD > 0.3 mm	A moderate and commonly used threshold [48]. A multi-dataset evaluation found that censoring 1-2% of data (often corresponding to this range) can improve results [3].	Represents a balance between data quality and data retention.
FD > 0.5 mm	A conservative threshold; appropriate for high-motion populations (e.g., young children) or when the highest data fidelity is required for analysis.	Can lead to significant data loss, which may compromise statistical power and model estimation if a large portion of the scan is censored [3].

Recommendation: There is no single optimal threshold for all studies. The threshold should be chosen based on the motion characteristics of your specific population and the requirements of your analysis. It is highly recommended to perform sensitivity analyses to ensure your findings are robust across different threshold choices [3].

Quantitative Data on Motion Characteristics

Table 1: Motion Variation by Age, Sex, and Task

The following table summarizes key findings from a large-scale study (N=323) investigating head motion in children during fMRI language tasks [47].

Factor	Impact on Head Motion	Key Statistical Finding
Task Type	Tasks involving visual engagement (e.g., Word-Picture Matching) suffered significantly less motion than auditory-only tasks (e.g., Story Processing, Verb Generation) [47].	Significant main effect of language task on motion (p < 0.05).
Age	Older children exhibited less head motion than younger children [47].	Significant main effect of age on motion (p < 0.05).
Sex	The effect of task on motion was influenced by the subject's sex, indicating interactive effects [47].	The main effect of task was significantly affected by sex and its interaction with age (p < 0.05).

Table 2: Performance Comparison of Motion Correction Methods

This table synthesizes results from a multi-dataset evaluation (8 public datasets, 11 tasks) comparing various motion correction approaches for task-based fMRI [3]. Performance was measured via metrics like maximum group t-statistic and split-half reliability.

Correction Method	Brief Description	Performance Summary
RP6 / RP24	Nuisance regression using 6 or 24 (expanded) motion parameters.	The standard baseline approach. Other methods often provide comparable or modest improvements over it [3].
Frame Censoring	Removing high-motion volumes based on FD or DVARS.	Modest amounts of censoring (1-2% data loss) showed consistent improvements. No single FD/DVARS threshold was optimal across all datasets [3].
Wavelet Despiking (WDS)	A data-driven method that identifies artifacts based on non-stationarity in wavelet space.	Performance was frequently comparable to what could be achieved with frame censoring [3].
Robust WLS (rWLS)	A two-pass modeling procedure that down-weights high-variance (high-motion) frames.	Showed comparable performance to other advanced techniques, with no single method consistently outperforming all others [3].

Experimental Protocols

Protocol 1: Child-Friendly fMRI Scanning Session

Objective: To acquire high-quality fMRI data in pediatric populations by minimizing head motion through behavioral and design strategies.

Materials: Mock scanner, child-friendly furnishings, audiovisual presentation system, response device, padding and straps for immobilization.

Procedure:

Pre-Scan Preparation:
- Conduct a mock scanner training session on the same day as the actual scan to familiarize the child with the environment and sounds [46].
- Provide a child-friendly orientation, which may include a video introduction and specific training on the tasks [47].
In-Scanner Procedures:
- Use comfortable, customized head molds or padding to immobilize the head [48].
- Structure the acquisition into multiple, shorter sessions distributed across the same day, rather than one long session [46].
- For each functional run, incorporate brief breaks to counteract temporal drift in motion [46].
- Utilize engaging, visually-based tasks to maintain attention and reduce movement [47] [48].
Real-Time Monitoring:
- Monitor the subject via closed-circuit TV during scanning [47].
- If motion is detected, provide immediate, calm reminders via the intercom about the importance of staying still [47].

Protocol 2: Evaluating Frame Censoring Thresholds

Objective: To determine the impact of different framewise displacement (FD) thresholds on task-based fMRI results as part of a sensitivity analysis.

Materials: Preprocessed fMRI data, calculated FD timeseries for each subject, fMRI analysis software (e.g., SPM, FSL, AFNI).

Procedure:

Data Preparation: Preprocess your fMRI data using a standardized pipeline (e.g., fMRIPrep [17]) that includes motion realignment.
Calculate Motion Metrics: Compute Framewise Displacement (FD) for each subject.
Define Thresholds: Select a range of FD thresholds for evaluation (e.g., 0.2mm, 0.3mm, 0.4mm, 0.5mm) [3] [48].
Run Parallel Analyses: For each threshold, conduct a first-level analysis for all subjects using a generalized linear model (GLM) that includes:
- Your task regressors.
- Nuisance regressors for motion parameters (e.g., RP6 or RP24).
- A set of "scan-nulling" nuisance regressors for every volume where FD exceeds the current threshold [3].
Compare Outcomes: At the group level, compare the resulting statistical maps for key metrics:
- Maximum t-statistic in a predefined ROI or across the whole brain.
- Mean parameter estimates (activation) within an ROI.
- Split-half reliability of subject-level maps [3].
Decision: Choose a threshold that offers a good balance between data quality (improved t-statistics/reliability) and data retention (acceptable levels of censoring). Document this process and the final chosen threshold.

Motion Mitigation Strategy Diagram

Motion Mitigation Workflow

The Scientist's Toolkit: Research Reagent Solutions

Item	Function in Motion Mitigation
Mock Scanner	A replica MRI setup used to acclimate participants (especially children) to the scanning environment, sounds, and procedures, thereby reducing anxiety and motion [47] [46].
Visual Stimulation System	Presents engaging, visual tasks or movie clips to maintain participant attention and stillness, which has been shown to reduce mean motion and temporal drift [47] [48].
fMRIPrep Software	A robust, open-source preprocessing pipeline that standardizes and performs key steps like motion correction, field unwarping, and normalization, ensuring reproducible data preparation [17].
Framewise Displacement (FD)	A scalar metric quantifying volume-to-volume head movement. It is the primary measure for identifying and censoring motion-corrupted volumes in the data [3] [48].
Motion Parameter Regressors	The time-series of the 6 rigid-body realignment parameters (3 translations, 3 rotations) and their expansions, used as nuisance regressors in the GLM to statistically account for motion-related variance [3].

Optimizing Scan Duration to Compensate for Censoring Data Loss

Why is volume censoring necessary in resting-state fMRI research?

Volume censoring (or "scrubbing") is a critical step to mitigate the confounding effects of head motion on functional connectivity measures. Even after standard denoising procedures and nuisance regression, systematic biases can persist in the data.

Lingering Motion Effects: Research shows that after standard denoising with methods like ABCD-BIDS, 23% of the fMRI signal variance can still be explained by head motion, and a strong, negative correlation between motion and functional connectivity persists [14]. Another study on fetal rs-fMRI found that while nuisance regression reduced the association between motion and the BOLD time series, it was not fully effective in reducing the impact of motion on whole-brain functional connectivity profiles [11].
Impact on Trait Correlations: In large-scale studies, residual motion can significantly overestimate or underestimate the relationship between functional connectivity and behavioral or clinical traits. In one analysis, 42% of traits showed significant motion overestimation scores, and 38% showed significant underestimation scores after standard denoising [14].

How does censoring introduce bias, and why is scan duration key?

While censoring is effective, it is not a perfect solution. Removing high-motion volumes can lead to the exclusion of entire participants if the remaining data is insufficient, which systematically biases the sample.

Non-Random Data Loss: Participant characteristics such as demographic factors, executive functioning, and certain clinical conditions are potent predictors of in-scanner motion. Consequently, listwise deletion of participants for having too much censored data results in a sample that is no longer representative of the broader population [43].
The Fundamental Trade-off: The core challenge is a "natural tension" between the need to remove motion-contaminated data to reduce spurious findings and the need to retain enough data to avoid biasing the sample and to reliably estimate functional connectivity [14]. The solution to this trade-off lies in acquiring more data per participant to ensure that even after censoring, a sufficient amount of high-quality data remains for analysis.

What is the quantitative relationship between censoring thresholds and data loss?

The choice of a Framewise Displacement (FD) threshold directly determines the amount of data available for analysis. The table below summarizes key data loss percentages from recent studies.

Table 1: Data Retention and Censoring Thresholds in Recent Studies

FD Censoring Threshold	Reported Data Loss / Retention	Context and Findings
< 0.2 mm	Reduced traits with significant motion overestimation from 42% to 2% [14]	Effective for mitigating motion overestimation, though did not reduce motion underestimation effects.
< 0.2 mm	Not explicitly quantified, but a common benchmark in the field.	Used in systematic pipeline evaluations as a standard stringent threshold [51].
< 0.3 mm	Data retained for analysis was 55.2% of the original sample [43].	In the ABCD study, this threshold led to the exclusion of nearly half of the participants, highlighting severe selection bias.
< 0.5 mm	Data retained for analysis was 85.9% of the original sample [43].	A more liberal threshold that retained most of the sample, though with a higher risk of residual motion artifact.
< 0.9 mm	Data retained for analysis was 95.1% of the original sample [43].	A very liberal threshold that minimizes participant exclusion but may not adequately control for motion effects.

What is the recommended protocol for optimizing scan duration?

To ensure sufficient high-quality data remains after censoring, follow this experimental planning workflow:

Step-by-Step Methodology:

Define FD Threshold: Based on your population and research question, select an appropriate FD threshold. For clinical populations known to move more, a less stringent threshold (e.g., 0.3-0.5 mm) may be necessary to avoid excessive exclusion, but this must be balanced with the need to control for motion [43].
Estimate Anticipated Data Loss: Use pilot data or published benchmarks (like those in Table 1) to estimate the percentage of volumes you expect to censor. For a stringent threshold of 0.2 mm, data loss can be substantial.
Determine Minimum Usable Data: Establish the minimum amount of clean data required for reliable connectivity estimation. While there is no universal standard, many studies aim for at least 10 minutes (600 seconds) of clean data after censoring. This provides a robust basis for calculating correlation matrices [14] [51].
Calculate Required Scan Duration: Use the formula to compute the total scan duration needed. For example, if you need 10 minutes of clean data and estimate 40% data loss, you should plan for a 16.7-minute initial scan.
Implement and Validate: After data acquisition, apply your censoring protocol. If a participant has less than the minimum required clean data, they must be excluded, underscoring the importance of accurate initial planning [43].

The Scientist's Toolkit: Essential Research Reagents & Solutions

Table 2: Key Tools for Censoring and Motion Mitigation Research

Tool / Solution	Function	Example Use Case / Note
Framewise Displacement (FD)	Quantifies head motion from volume-to-volume. The primary metric for identifying "bad" frames to censor [38].	Calculated from rigid body realignment parameters. A threshold is required to define "high motion" [3].
Volume Censoring (Scrubbing)	A statistical technique that excludes high-motion volumes from analysis, typically by adding nuisance regressors to the GLM [3].	Effectively reduces spurious motion-FC relationships but causes data loss [14] [11].
Nuisance Regression	Removes variance associated with motion parameters and other non-neural signals (e.g., CSF, WM) from the BOLD data.	Often used before censoring. Alone, it is insufficient to remove all motion artifacts [11].
SHAMAN Analysis	A novel method to assign a "motion impact score" to specific trait-FC relationships, distinguishing over- and underestimation [14].	Helps diagnose whether residual motion is biasing your specific findings after denoising and censoring.
Portrait Divergence (PDiv)	An information-theoretic measure of dissimilarity between whole-network topologies [51].	Useful for evaluating test-retest reliability of functional connectomes after different processing pipelines, including censoring.

Frequently Asked Questions

Is there a single best FD threshold for all studies?

No. The optimal threshold is a balance between data quality and data retention, which depends on your specific study population and goals. Stringent thresholds (e.g., FD < 0.2 mm) best control for motion but cause more data loss and potential bias. More liberal thresholds (e.g., FD < 0.5 mm) retain more participants but risk residual motion effects. Pilot studies on your specific population are recommended to find the best balance [43] [3].

Can I use other techniques instead of censoring to avoid data loss?

While other techniques like ICA-based denoising exist, censoring is one of the most effective methods for mitigating motion artifacts. Studies have shown that without censoring high-motion participants, other methods often fail to fully control for motion-related noise. Therefore, a combination of nuisance regression followed by volume censoring is widely recommended [11] [43].

How do I handle the inevitable bias from excluding high-motion participants?

Since exclusion is often non-random, it is crucial to account for it statistically. Rather than simply using listwise deletion, consider using multiple imputation or other missing data handling strategies to correct for biases introduced by non-random exclusions in functional connectivity analyses [43]. Always report your quality control procedures in detail to enhance the reproducibility of your research.

Alternative Strategies When High Censoring Rates are Unavoidable

FAQs on High Censoring Rates

What defines a "high censoring rate," and when should I become concerned? In survival analysis, censoring occurs when the time-to-event information is incomplete for some study participants [52]. There is no universal percentage that defines "high," but rates approaching or exceeding 50% can be a concern, particularly if the censoring is "informative" [53] [54]. In such cases, standard estimators like the Kaplan-Meier can become unreliable, and the median survival time may be difficult to estimate precisely [53]. Concerns should be highest when the reason for censoring (e.g., a patient leaving a study due to side effects) is likely related to the outcome under investigation.

Is the Kaplan-Meier estimator biased with high censoring rates? The Kaplan-Meier estimator itself is not inherently biased by a high proportion of random censoring [53]. One of its key advantages is the ability to use information from censored individuals up until the point they are censored [52]. However, its reliability can decrease, leading to large jumps or wide confidence intervals in the survival curve, especially at later time points where fewer individuals remain at risk [52] [53]. The primary issue arises with informative censoring, where the probability of being censored is related to an individual's unobserved risk of the event, which can bias the results [53].

What are the primary analytical alternatives to handle high censoring? When faced with high censoring, researchers can consider several analytical strategies, each with its own strengths and assumptions. The table below summarizes the core approaches.

Table: Analytical Methods for Highly Censored Data

Method	Core Principle	Key Assumption	Suitability for High Censoring
Cox Proportional Hazards Model [52] [53]	Semi-parametric model that relates covariates to the hazard rate.	Censoring is non-informative; hazard ratios are constant over time.	Excellent; the hazard ratio remains an unbiased effect measure even with high censoring [53].
Parametric Survival Models (e.g., Weibull, Exponential) [52]	Assumes a specific statistical distribution for the survival times.	The chosen distribution (e.g., Weibull) correctly models the underlying hazard.	Good; uses a likelihood-based approach that incorporates censored observations.
Data Augmentation (PSDATA/nPSDATA) [54]	Uses parametric or non-parametric algorithms to artificially expand a small, highly-censored dataset.	The augmentation process accurately represents the true data-generating mechanism.	Specifically designed to address the "small sample size and highly censored" problem.

In fMRI research, how does framewise displacement (FD) thresholding relate to censoring? In resting-state fMRI, "censoring" (or "scrubbing") refers to removing individual brain volumes corrupted by excessive head motion [4]. The Framewise Displacement (FD) metric quantifies volume-to-volume head movement. Setting an FD threshold (e.g., 0.2 mm or 0.3 mm) is a common strategy to exclude these bad volumes [4] [14]. However, this creates a trade-off: stricter thresholds (e.g., FD < 0.2 mm) reduce motion artifacts but increase the rate of data censoring, potentially biasing sample representation if high-motion subjects are systematically different [14]. The optimal threshold balances artifact removal against data retention.

Table: Impact of Different FD Censoring Thresholds

FD Threshold	Impact on Motion Artifact	Impact on Data Retention	Key Finding
< 0.2 mm	Effectively removes spurious motion-trait relationships.	High data loss; may exclude many participants.	Reduced traits with significant motion overestimation from 42% to 2% [14].
< 0.3 mm	Robustly removes motion-corrupted volumes.	Better data retention; more inclusive.	Allows 83% of participants to be retained while meeting rigorous quality standards [4].

Troubleshooting Guides

Problem: High censoring leads to unreliable model predictions and poor generalizability. Solution: Implement survival data augmentation techniques.

Assess Your Data: Confirm your dataset has a small sample size and a high rate of (non-informative) censoring [54].
Choose an Algorithm:
- PSDATA (Parametric): If you are willing to assume an underlying parametric distribution (e.g., Weibull) for the survival times. This is a model-based approach [54].
- nPSDATA (non-Parametric): If you prefer not to assume a specific survival distribution. This is a more flexible, data-driven approach [54].
Generate Synthetic Data: Apply the chosen algorithm to create an augmented dataset that is larger and contains more complete (uncensored) case information.
Validate and Refit: Build your survival model (e.g., Random Survival Forest or Cox model) on the augmented dataset. Use cross-validation to ensure the augmented data leads to improved and realistic predictive performance [54].

Problem: After stringent motion censoring in fMRI, my sample size is too small, and I risk excluding a critical sub-population. Solution: Adopt a multi-faceted acquisition and processing pipeline designed for high-motion cohorts.

Real-Time Monitoring: Use real-time motion monitoring software during scanning to ensure you acquire sufficient low-motion data across multiple runs [4].
Aggressive Censoring: Apply volume censoring post-hoc using a justified FD threshold (e.g., 0.3 mm) to remove the most severely corrupted volumes [4].
Advanced Denoising: Instead of traditional temporal filtering and nuisance regression, use an Independent Component Analysis (ICA)-based denoising approach. This can effectively remove much of the residual motion artifact that remains after censoring [4].
Concatenate Data: Combine the cleaned data from multiple runs for each subject into a single dataset for final analysis [4].

The following diagram illustrates this multi-step troubleshooting workflow for fMRI data.

The Scientist's Toolkit

Table: Essential Research Reagents and Tools

Item	Function in Context
Framewise Integrated Real-time MRI Monitoring (FIRMM)	Software that provides real-time feedback on participant head motion during an fMRI scan, allowing researchers to collect sufficient low-motion data [4].
Independent Component Analysis (ICA)	A computational method used to separate a multivariate signal into additive, statistically independent components. It is highly effective for identifying and removing motion-related artifacts from fMRI data without relying solely on censoring [4].
Cox Proportional Hazards Model	A semi-parametric regression model that is robust for analyzing survival data with high rates of non-informative censoring, providing unbiased estimates of hazard ratios [52] [53].
Data Augmentation Algorithms (PSDATA/nPSDATA)	Computational strategies designed to artificially increase the size and information content of a small, highly-censored survival dataset, thereby improving the predictive power of subsequent models [54].
Parametric Survival Distributions (Weibull/Exponential)	Assume a specific shape for the hazard function over time. These likelihood-based models can provide efficient estimates when their distributional assumptions are met, even with censoring [52].

Evaluating Threshold Effectiveness Across Datasets and Methodologies

Troubleshooting Guides

Guide 1: Addressing High Motion in Pediatric and Challenging Populations

Problem: Researchers working with high-motion populations, such as young children, find that standard framewise displacement (FD) thresholds lead to excessive data loss, compromising study power.

Solution: Implement a combined approach of real-time monitoring and a preprocessing pipeline robust to high motion.

Real-Time Motion Feedback: Use software like FIRMM (Real-time fMRI Motion Monitoring) during scanning sessions. Provide visual feedback to participants, displaying a white cross when motion is low (FD < 0.2 mm), yellow for moderate motion (FD 0.2-0.3 mm), and red for high motion (FD ≥ 0.3 mm). Providing a head motion report between runs further encourages participants to remain still [36].
Robust Preprocessing Pipeline: Adopt a pipeline that combines volume censoring (also known as "scrubbing") and independent component analysis (ICA)-based denoising.
- Volume Censoring: Identify and remove motion-corrupted volumes. A threshold of FD > 0.3 mm has been shown to be effective, even in cohorts with extreme motion [9].
- ICA Denoising: Use ICA to automatically identify and remove motion-related artifacts from the retained data [9].
Expected Outcome: This methodology has been proven to yield usable resting-state data while retaining a high percentage of participants (e.g., 83% in a first-grade cohort) even when motion is extreme [9].

Guide 2: Mitigating Bias from Non-Random Censoring

Problem: Systematically excluding high-motion participants or timepoints can introduce bias into the dataset, as motion is often correlated with specific phenotypic or clinical traits.

Solution: Prioritize motion reduction strategies and consider analytical corrections over aggressive censoring.

Prevention Over Exclusion: Invest in participant training and comfort to minimize motion at the source. This includes using a mock scanner, proper head stabilization, and engaging stimuli (e.g., movies during structural scans) [55].
Analytical Techniques: If significant motion-related bias is suspected, consider advanced statistical methods. One approach involves using Empirical Bayes Estimates (EBEs) for imputation. The process involves:
- Fitting the model to the censored dataset.
- Using the resulting EBEs to generate individual predictions for the censored (high-motion) timepoints.
- Creating an imputed dataset that combines the original observed data with the model-predicted values for censored periods.
- Refitting the model to the imputed dataset to obtain less biased parameter estimates [56].
Expected Outcome: This imputation method has been shown in simulation studies to significantly reduce bias in parameter estimates (e.g., reducing bias in a tumor growth rate estimate from 28% to 6%) [56].

Guide 3: Optimizing the Trade-off Between Scan Time and Sample Size

Problem: With limited resources, researchers must decide whether to collect more data per participant (longer scan time) or from more participants (larger sample size).

Solution: Design studies based on the "total scan duration" (sample size × scan time per participant) to maximize phenotypic prediction accuracy.

The Interchangeability Principle: For scan times up to 20 minutes, sample size and scan time are largely interchangeable. A study with 100 participants scanned for 15 minutes can achieve similar prediction accuracy as a study with 150 participants scanned for 10 minutes, as both represent a total scan duration of 1500 minutes [57].
Diminishing Returns: Beyond 20-30 minutes of scan time, the benefits of longer scans diminish relative to increasing the sample size. However, when factoring in the overhead cost of recruiting each new participant (e.g., recruitment, screening), longer scan times can be more cost-effective [57].
Recommendation: For most brain-wide association studies (BWAS), the most cost-effective scan time is at least 20 minutes, with 30 minutes being optimal on average. This can yield cost savings of around 22% compared to using 10-minute scans [57].

Frequently Asked Questions (FAQs)

Q1: What is the most recommended framewise displacement (FD) threshold for censoring volumes in resting-state fMRI? A: A threshold of FD > 0.3 mm is a widely used and rigorously validated starting point. Research on high-motion pediatric cohorts has shown that this threshold, when combined with ICA denoising, effectively removes motion artifacts while retaining a majority of subjects and data [9]. This threshold is also embedded in processing pipelines like the ABCD-HCP pipeline [58].

Q2: How does the choice of FD threshold impact my final analysis and results? A: The FD threshold directly balances data quality and quantity. An overly strict threshold (e.g., FD > 0.2 mm) can lead to the loss of too many volumes or entire subjects, reducing statistical power and potentially introducing bias if motion is related to your study conditions. A too-lenient threshold (e.g., FD > 0.5 mm) retains more data but risks including motion-corrupted volumes, which can create spurious functional connectivity patterns and inflate false positive rates [9] [59].

Q3: Are there dataset-specific considerations for motion censoring in ABCD, HCP, and TCP? A: Yes, key differences are summarized in the table below.

Dataset	Primary Population	Key Censoring Considerations	Recommended Resources
ABCD [60] [58]	Children (9-10 years at baseline)	Uses the ABCD-HCP pipeline which includes a respiratory motion filter to improve FD estimates. Implements censoring at FD > 0.3 mm.	ABCD-HCP BIDS fMRI Pipeline [58]
HCP [59] [55]	Healthy Young Adults	Known for high-quality data. Quality control databases are available to select subjects/scans with specific motion characteristics for benchmarking.	HCP Resting-State QC Database [59]
TCP [57]	Transdiagnostic (Various disorders)	While specific censoring thresholds may vary by site, the principles of using ~0.3 mm FD and optimizing for longer scan times (≥20 min) apply.	Study design tools from Yeo lab [57]

Q4: What is the single most effective step I can take to improve data quality concerning motion? A: The most effective step is to prevent excessive motion during the scan. Techniques such as thorough participant training, using a mock scanner, ensuring comfortable head stabilization, and providing real-time feedback on head motion have been proven to reduce framewise displacement significantly, preserving data integrity from the outset [55] [36].

Experimental Protocols & Workflows

Protocol: The ABCD-HCP fMRI Preprocessing and Censoring Pipeline

This protocol details the methodology used for processing large-scale datasets like the Adolescent Brain Cognitive Development (ABCD) Study, incorporating motion censoring as a core step [58].

1. Preprocessing Stages (Stages 1-5): The pipeline begins with standard HCP-style minimal preprocessing, which includes:

PreFreeSurfer/FreeSurfer/PostFreeSurfer: Anatomical data processing, brain extraction, tissue segmentation, and surface reconstruction [58].
FMRIVolume: Functional data processing, including distortion correction, motion realignment (resulting in 6 motion parameters), and registration to anatomical and standard (MNI) spaces [58].

2. DCAN BOLD Processing (DBP) - Stage 6: This is the critical stage for nuisance regression and motion censoring.

Standard Pre-processing: The data are de-meaned and de-trended. A general linear model (GLM) is used for denoising, including regressors for:
- Signal from white matter and CSF.
- Global signal regression (GSR).
- The 6 motion parameters and their derivatives (Volterra expansion) [58].
Respiratory Motion Filter (Optional): An optional filter can be applied to the motion parameters to remove artifactual motion caused by respiration, leading to more accurate FD estimates [58].
Motion Censoring: Data are labeled as "bad" frames if they exceed the FD threshold of 0.3 mm. These frames are removed when calculating the denoising GLM. The pipeline outputs temporal masks for a range of FD thresholds (from 0 to 0.5 mm in 0.01 mm steps) for further flexibility [58].
Band-Pass Filtering: The data are band-pass filtered (0.008 - 0.09 Hz) to focus on the frequencies of interest for resting-state connectivity [58].

The workflow for this protocol is standardized and can be visualized as follows:

The Scientist's Toolkit: Research Reagent Solutions

The following tools and resources are essential for implementing effective motion censoring and analysis across major neuroimaging datasets.

Tool / Resource	Function	Application Context
FIRMM Software [36]	Provides real-time feedback on head motion during scanning, allowing participants to adjust and reduce motion.	Data Acquisition, Motion Prevention
ABCD-HCP BIDS fMRI Pipeline [58]	A standardized processing pipeline that incorporates motion censoring (FD > 0.3mm), respiratory filtering, and global signal regression.	Data Preprocessing, Motion Censoring
HCP Resting-State QC Database [59]	A public database of quality control metrics for the Human Connectome Project, enabling researchers to select subjects with specific motion profiles for benchmarking.	Data Quality Control, Benchmarking
FSL's `fsl_motion_outliers`	A widely used tool for calculating Framewise Displacement (FD) and identifying outlier volumes based on motion.	Motion Quantification, Censoring
ICA-based Denoising (e.g., FSL FIX, AROMA)	Automated methods to identify and remove motion-related artifacts from fMRI data using Independent Component Analysis.	Data Denoising, Artifact Removal
Optimal Scan Time Calculator [57]	An online tool to help design studies by optimizing the trade-off between sample size and scan time per participant for prediction accuracy.	Experimental Design, Power Analysis

FAQs: Understanding Motion Impact Scores

What is a Motion Impact Score, and why is it necessary? A Motion Impact Score is a trait-specific metric that quantifies how much head motion artifacts are likely to bias the observed relationship between a phenotypic trait and functional connectivity (FC). It is necessary because standard denoising methods, while reducing overall noise, do not completely remove motion artifact. This residual artifact can systematically skew brain-behavior relationships, especially for traits inherently correlated with movement (e.g., symptoms of ADHD). The score helps researchers distinguish genuine neurobiological findings from motion-induced false positives or false negatives [14].

How does the SHAMAN method calculate a Motion Impact Score? The Split Half Analysis of Motion Associated Networks (SHAMAN) method calculates the score by capitalizing on the stability of traits over time. For each participant, the resting-state fMRI timeseries is split into high-motion and low-motion halves. It then tests whether the correlation structure between the trait and FC is significantly different between these two halves. A significant difference indicates that motion impacts the trait-FC relationship. The direction of this effect determines if motion causes overestimation (score aligned with the trait-FC effect) or underestimation (score opposite the trait-FC effect) [14].

My data has already been denoised with a standard pipeline. Do I still need to check for trait-specific motion impacts? Yes. Research on the ABCD dataset showed that even after comprehensive denoising (including global signal regression and motion parameter regression), 42% of the tested traits still showed significant motion overestimation scores. This demonstrates that standard denoising is necessary but not sufficient to eliminate motion-related bias for all traits. Checking for trait-specific motion impacts is a crucial additional step for validating your findings [14].

What is the trade-off between strict motion censoring and sample bias? Strict motion censoring (e.g., using a low framewise displacement threshold) effectively removes contaminated data but can systematically exclude participants with higher motion. Since motion often correlates with demographic, clinical, and behavioral traits (e.g., age, BMI, psychiatric symptoms), this can bias your sample. The consequence is that list-wise deletion of participants makes the final sample non-representative and can lead to biased statistical estimates of brain-behavior relationships [43].

Troubleshooting Guides

Problem: A Trait Shows a Significant Motion Overestimation Score

Description: After running the SHAMAN analysis, you find that your trait of interest has a significant positive motion impact score, suggesting your observed trait-FC relationship may be falsely inflated by motion.

Solution Steps:

Re-run analysis with stricter censoring: Re-process your data applying volume censoring at a more stringent framewise displacement (FD) threshold, such as FD < 0.2 mm [14].
Re-calculate the trait-FC effect: Using the censored data, re-calculate the primary trait-FC effect you are investigating.
Compare effect sizes: Determine if the effect size of the trait-FC relationship is reduced after stricter censoring. A substantial reduction supports the inference that the original finding was likely overestimated by motion.
Report transparently: In your manuscript, report both the original and motion-corrected results, along with the Motion Impact Score, to provide a complete picture of the influence of motion on your findings [14].

Problem: High Participant Exclusion Rate Due to Censoring

Description: Applying volume censoring to meet data quality standards results in the exclusion of a large portion of your sample, raising concerns about statistical power and sample representativeness.

Solution Steps:

Quantify the bias: Characterize the participants excluded due to high motion. Compare them to the included participants on key demographic (e.g., socioeconomic status, race/ethnicity) and clinical variables. This will clarify how your final sample may be biased [43].
Consider multiple imputation: To address bias from non-random missing data, use statistical techniques like multiple imputation. These methods can model the missing data, allowing for less biased and more generalizable estimates [43].
Optimize acquisition for longer scans: For future studies, consider that longer scan times per participant (e.g., 30 minutes) can be more cost-effective than simply recruiting more participants for shorter scans. Longer scans provide more low-motion frames per subject, making each participant's data more robust and less likely to be entirely excluded [15].

Problem: Persistent Motion-FC Correlation After Denoising

Description: Even after implementing a standard denoising pipeline, you observe a strong spatial correlation between the group-level motion-FC effect and the average FC matrix.

Solution Steps:

Verify with quantitative metrics: This is a known issue. Confirm by calculating the spatial correlation (e.g., Spearman's ρ) between the motion-FC effect matrix and the average FC matrix. A strong negative correlation is typically observed [14].
Implement a multi-faceted denoising strategy: Combine several approaches for a more robust solution. In addition to motion parameter regression, consider:
- Volume Censoring: Identify and remove high-motion volumes [4] [61].
- ICA-based Denoising: Use independent component analysis to identify and remove motion-related components from the data [4].
- Include Physiological Regressors: Expand nuisance regressors to include signals from white matter and CSF (aCompCor) [61].
Evaluate improvement: Re-calculate the motion-FC correlation after applying the enhanced pipeline to ensure the artifact has been sufficiently reduced.

Experimental Protocols & Data

Protocol: Implementing the SHAMAN Method

This protocol outlines the steps to compute a Motion Impact Score using the SHAMAN framework [14].

Primary Materials:

Preprocessed resting-state fMRI data for all participants.
Corresponding phenotypic trait data for all participants.
Framewise displacement (FD) time series for all participants.

Procedure:

Data Preparation: For each participant, ensure the preprocessed fMRI data, trait data, and FD time series are loaded.
Timeseries Splitting: For each participant, split the entire resting-state fMRI timeseries into two halves: a "high-motion" half (containing the volumes with the highest FD values) and a "low-motion" half (containing the volumes with the lowest FD values).
Trait-FC Correlation Calculation: a. For each half (high- and low-motion) and for each participant, calculate a whole-brain FC matrix (e.g., using Pearson correlation between region time series). b. For each half independently, compute a group-level trait-FC effect. This is typically a matrix where each connection (edge) represents the correlation coefficient (across participants) between the trait and the FC strength for that connection.
Motion Impact Calculation: a. For each functional connection, calculate the difference in the trait-FC effect between the high-motion and low-motion halves (Effect_high - Effect_low). This is the raw motion impact for that connection. b. To derive a single Motion Impact Score for the trait, the individual connection-level differences are combined using non-parametric combining across all connections in the network [14].
Significance Testing: Determine the statistical significance of the global Motion Impact Score using permutation testing (e.g., by randomly shuffling the high/low motion labels many times and re-calculating the score to build a null distribution).
Directional Interpretation:
- Overestimation Score: A significant positive score, where the motion impact is in the same direction as the overall trait-FC effect.
- Underestimation Score: A significant negative score, where the motion impact is in the opposite direction of the overall trait-FC effect.

Quantitative Data on Motion and Censoring

Table 1: Impact of Censoring on Motion Overestimation in the ABCD Study (n=45 traits) [14]

Censoring Threshold (Framewise Displacement)	Percentage of Traits with Significant Motion Overestimation
No Censoring	42% (19/45)
FD < 0.2 mm	2% (1/45)

Table 2: Effectiveness of Denoising in Explaining Signal Variance [14]

Processing Stage	Percentage of Signal Variance Explained by Head Motion
Minimal Processing (Motion Correction Only)	73%
After ABCD-BIDS Denoising Pipeline	23%

Workflow & Conceptual Diagrams

Diagram 1: SHAMAN Method Workflow

Diagram 2: Censoring Threshold Trade-offs

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Materials and Tools for Motion Impact Analysis

Item Name & Function	Brief Description	Example Use in Context
Framewise Displacement (FD)Function: Quantifies volume-to-volume head motion.	A scalar measure of instantaneous head motion, computed from the derivatives of the six rigid-body realignment parameters (three translations and three rotations).	Used to identify and censor (remove) individual fMRI volumes that are contaminated by excessive motion. Also used to split data into high- and low-motion halves in SHAMAN [14].
SHAMAN Scripts / PipelineFunction: Computes trait-specific Motion Impact Scores.	A computational workflow, often implemented in R, Python, or MATLAB, that automates the steps of the Split Half Analysis of Motion Associated Networks.	The core tool for diagnosing whether a specific trait-FC finding is vulnerable to being a false positive or false negative due to head motion [14].
Real-time Motion Monitoring (e.g., FIRMM)Function: Provides immediate feedback on in-scanner head motion.	Software that displays head motion in real-time during the fMRI scan, allowing researchers to collect additional data if a participant has not yet reached a target amount of low-motion data.	Crucial for prospective data quality control, especially with high-motion populations (e.g., children), to maximize the chances of acquiring usable data [4].
Multiple Imputation ToolsFunction: Corrects for bias introduced by missing data.	A statistical technique that handles missing data by creating several complete datasets with imputed values, analyzing them separately, and combining the results.	Used to address the bias that occurs when participants with higher motion (and correlated traits) are excluded from analysis, helping to restore sample representativeness [43].
ICA Denoising Software (e.g., FSL's FIX)Function: Identifies and removes non-neural noise components.	Software that uses Independent Component Analysis to separate the fMRI signal into components, which can then be automatically or manually classified as signal or noise (e.g., motion, physiology).	Applied as part of an enhanced preprocessing pipeline to remove motion artifacts that persist after standard regression and censoring [4].

In brain-wide association studies (BWAS), functional connectivity (FC) refers to the statistical associations between time series of different brain regions, revealing intrinsic brain networks. A major confound in FC analysis is in-scanner head motion, which introduces systematic bias not completely removed by standard denoising algorithms [14]. Motion artifact typically decreases long-distance connectivity and increases short-range connectivity [14].

Framewise displacement (FD) is a common metric for quantifying head motion between consecutive volumes. Motion censoring (or "scrubbing") is a post-processing technique that removes high-motion volumes exceeding a specific FD threshold from analysis to reduce spurious findings [14]. Selecting an appropriate FD threshold is critical, as it balances the need to remove motion-contaminated data against preserving enough data for a reliable analysis [9].

Quantitative Comparison: Task vs. Resting-State fMRI

The table below summarizes key characteristics influencing motion censoring thresholds in task-based versus resting-state fMRI.

Characteristic	Task-Based fMRI	Resting-State fMRI
Primary Purpose	Identify brain regions involved in specific task performance [62]	Explore intrinsic functional segregation/specialization [62]
Typical Use in Prediction	Individual-level prediction of cognitive/behavioral traits [63] [64]	Individual-level prediction of traits (often outperformed by task-based FC) [63] [64]
Network Configuration	More efficient global information transmission; lower modularity [65]	Higher modularity; energy-saving mode [65]
Data Reliability	Improved reliability with longer scan times (>20 min optimal) [15]	Improved reliability with longer scan times (>20 min optimal) [15]
Major Challenge	Task-specific head motion patterns	High vulnerability to motion artifact due to unknown neural timing [14]

Recommended Experimental Protocols

Protocol 1: Establishing a Motion Censoring Pipeline

This generalized workflow is applicable to both task and resting-state data.

Protocol 2: Assessing Trait-Specific Motion Impact with SHAMAN

The Split Half Analysis of Motion Associated Networks (SHAMAN) method quantifies whether motion artifact is causing overestimation or underestimation of a specific trait-FC relationship [14].

Split Data: For each participant, split the preprocessed fMRI timeseries into high-motion and low-motion halves [14].
Calculate FC: Compute separate functional connectivity matrices for each half [14].
Trait-FC Effect: Measure the correlation between the trait and each FC edge in both halves [14].
Motion Impact Score:
- A significant difference in trait-FC effects between halves indicates motion impact.
- Overestimation Score: Motion impact direction aligns with the trait-FC effect.
- Underestimation Score: Motion impact direction is opposite the trait-FC effect [14].
Permutation Testing: Use non-parametric combining across connections to obtain a significant p-value for the motion impact score [14].

Frequently Asked Questions (FAQs)

Q1: What is a reasonable starting FD threshold for censoring a general adult population? For analyses where minimizing false positives is critical, a threshold of FD < 0.2 mm is recommended. Studies show this threshold reduces the percentage of traits with significant motion overestimation from 42% to just 2% [14]. For pediatric or high-motion cohorts, a slightly more liberal threshold (e.g., FD < 0.3 mm) may be necessary to retain a sufficient number of participants and volumes, while still effectively removing major artifacts [9].

Q2: Why might task-based and resting-state fMRI require different thresholding considerations? While the same FD threshold (e.g., 0.2 mm) is often effective for both [14], the underlying rationale differs. Resting-state fMRI is exceptionally vulnerable to motion artifacts because the timing of underlying neural processes is unknown, making it harder to distinguish from noise [14]. In task-fMRI, the predicted response to the task design can sometimes help isolate behaviorally relevant signals from noise [64]. Furthermore, task designs actively perturb brain systems, leading to network configurations with higher global efficiency, which may be differentially affected by motion compared to the more modular resting-state networks [65].

Q3: How does scan duration interact with motion censoring decisions? Longer scan times (e.g., at least 20-30 minutes) significantly improve data quality and phenotypic prediction accuracy for both resting and task fMRI [15]. With a longer acquisition, censoring high-motion volumes has a less detrimental impact on temporal degrees of freedom and functional connectivity reliability. For short scans (<10 minutes), censoring can consume a large portion of your data, making a longer initial scan duration a powerful strategy to mitigate motion-related problems.

Q4: After censoring, how can I verify that my trait of interest is not confounded by motion? Relying on global mean FD alone is insufficient. It is strongly recommended to use a method like SHAMAN to calculate a trait-specific motion impact score [14]. This analysis will tell you if the relationship between your specific trait and brain connectivity is likely being over- or under-estimated due to residual motion artifact, even after censoring and denoising.

The Scientist's Toolkit

Research Reagent / Tool	Function	Example Use / Note
Framewise Displacement (FD)	Quantifies head motion between consecutive brain volumes.	Primary metric for determining volume censoring [14].
Volume Censoring (Scrubbing)	Removes motion-corrupted volumes exceeding an FD threshold.	Critical for reducing spurious findings; threshold choice is key [14] [9].
SHAMAN	Assigns a trait-specific motion impact score to FC results.	Determines if motion causes over/underestimation of a specific brain-behavior link [14].
ICA-Based Denoising	Identifies and removes noise components via machine learning.	Often used in conjunction with censoring (e.g., in ABCD-BIDS pipeline) [14] [9].
Two-Stage Sparse Representation	A big-data analytics strategy for characterizing fMRI signals.	Can effectively handle noise and variability across subjects [62].
Transparent Thresholding	Visualization method showing both supra- and sub-threshold results.	Improves interpretation and reproducibility by providing full context [66].

Comparative Analysis of Censoring Versus Alternative Motion Correction Methods

Head motion is the largest source of artifact in functional magnetic resonance imaging (fMRI) signals, significantly impacting data quality and the validity of scientific conclusions [14]. In resting-state functional connectivity (FC) studies, motion can introduce systematic biases, spuriously decreasing long-distance connectivity while increasing short-range connectivity [14]. These effects are particularly problematic when studying populations that tend to move more, such as children, older adults, or individuals with certain neurological or psychiatric conditions, potentially leading to false positive results [14] [24].

To mitigate these artifacts, numerous motion correction strategies have been developed. This technical guide provides a comparative analysis of these methods, with a specific focus on the role of frame censoring (also known as "scrubbing") and its optimization within the preprocessing pipeline. The content is framed to support research determining the optimal framewise displacement (FD) threshold for censoring, a critical parameter for balancing artifact removal and data retention.

Motion Correction Methods at a Glance

Multiple motion correction approaches are available, each with distinct mechanisms and applications. The following table summarizes the most common techniques evaluated in contemporary studies.

Table 1: Common Motion Correction Methods in fMRI

Method	Description	Key Mechanism
Frame Censoring (SCR)	Exclusion of high-motion volumes from analysis [3] [11].	Identifies "bad" scans based on metrics like Framewise Displacement (FD) or DVARS and removes their influence via nuisance regressors or direct exclusion [3].
Realignment Parameters (RP6/RP24)	Nuisance regression of head motion estimates [3] [24].	Regresses out the 6 rigid-body head motion parameters (translation, rotation) and their expansions (e.g., derivatives, squared terms) [3].
Wavelet Despiking (WDS)	A data-driven approach to identify and remove artifacts [3].	Uses wavelet decomposition to exploit the non-stationarity of motion artifacts across temporal scales [3].
Robust Weighted Least Squares (rWLS)	An iterative modeling procedure to downweight high-motion frames [3].	Uses a two-pass modeling procedure to produce nuisance regressors that downweight frames with high variance [3].
ICA-AROMA	A data-driven approach to identify and remove motion-related components [24].	Uses Independent Component Analysis (ICA) to automatically identify and remove motion-related components from the data [24].
aCompCor	A noise reduction strategy based on physiological signals [24].	Derives noise regressors from the principal components of the signal in white matter and cerebrospinal fluid (CSF) regions [24].
Global Signal Regression (GSR)	Regression of the global mean signal from the data [24].	Removes the average signal across the entire brain, which can contain substantial motion-related variance [24].

Quantitative Efficacy of Different Pipelines

Selecting a motion correction strategy involves trade-offs between efficacy, data retention, and reliability. The table below synthesizes findings from large-scale evaluations comparing different preprocessing pipelines.

Table 2: Comparative Performance of Motion Correction Pipelines

Pipeline	Residual Motion Artifact	Data Loss / DOF Loss	Test-Retest Reliability	Sensitivity to Group Differences
RP6/RP24 alone	Not sufficient for complete motion removal [24].	Low data loss [24].	--	Can inflate case-control differences (e.g., in schizophrenia) [24].
Volume Censoring (SCR)	Consistently ranks among the best at minimizing motion artifacts [24].	High data loss, especially in high-motion subjects; reduces degrees of freedom (DOF) [24].	Good, but benefits partly from excluding high-motion individuals [24].	Reduces spurious group differences linked to motion [24].
ICA-AROMA	Very effective, though slightly less than censoring [24].	Lower data loss compared to censoring [24].	Performs well at a relatively low cost in terms of data loss [24].	--
aCompCor	May only be viable in low-motion data [24].	--	--	--
Pipelines with GSR	Improved performance on most benchmarks, but can exacerbate distance-dependence of motion-FC correlations [24].	--	--	--

The Special Case of Frame Censoring: Threshold Selection and Impact

Frame censoring is a powerful but complex tool. Its effectiveness and impact on statistical power are highly dependent on the chosen FD threshold.

Table 3: Impact of Censoring Thresholds on Trait-FC Relationships

Censoring Threshold (FD)	Overestimation of Trait-FC Effects	Underestimation of Trait-FC Effects	Note
No Censoring	42% (19/45) of traits showed significant overestimation [14].	38% (17/45) of traits showed significant underestimation [14].	After standard denoising (ABCD-BIDS) without censoring [14].
< 0.2 mm	Reduced to 2% (1/45) of traits [14].	Did not decrease the number of traits with significant underestimation [14].	Very stringent threshold; effective against overestimation but not underestimation [14].

Experimental Protocols for Method Evaluation

For researchers seeking to validate or compare these methods, here are detailed protocols based on published studies.

Protocol: Multi-Dataset Evaluation of Frame Censoring

This protocol is adapted from a study that compared frame censoring to other common techniques across eight public datasets [3].

Datasets: Analyze multiple publicly available datasets (e.g., from OpenNeuro) representing different tasks, age groups, and acquisition parameters.
Motion Correction Approaches: Test a minimum of 7 approaches: RP6, RP24, Wavelet Despiking (WDS), Robust Weighted Least Squares (rWLS), untrained ICA (uICA), and frame censoring based on FD and DVARS.
Performance Metrics: Quantify performance using:
- Maximum group t-statistic (whole-brain and within a task-relevant ROI).
- Mean parameter estimates within the ROI.
- Split-half reliability of subject-level parametric maps.
- Spatial overlap (Dice similarity) of thresholded group-level statistical maps.
Key Finding: No single approach consistently outperformed all others across all datasets and tasks. The optimal choice depends on both the dataset and the outcome metric of interest [3].

Protocol: Evaluating Efficacy with the SHAMAN Method

The Split Half Analysis of Motion Associated Networks (SHAMAN) is a novel method to calculate a trait-specific motion impact score [14].

Principle: Capitalizes on the fact that traits (e.g., intelligence) are stable during a scan, while motion is a time-varying state. It measures differences in the correlation structure between high-motion and low-motion halves of a participant's timeseries.
Procedure:
- Split each participant's denoised (but not censored) fMRI timeseries into high-motion and low-motion halves based on FD.
- Calculate the trait-FC effect separately for each half.
- Compute a motion impact score based on the difference between the two halves.
- A score aligned with the trait-FC effect indicates overestimation; a score in the opposite direction indicates underestimation.
- Use permutation testing to determine significance.
Application: This method can be used to determine how a specific trait-FC finding is impacted by motion and to evaluate the effectiveness of different censoring thresholds at mitigating this impact.

Decision Workflow for Motion Correction Strategy

The following diagram outlines a logical workflow for selecting and evaluating a motion correction strategy, incorporating the choice of a censoring threshold.

Table 4: Key Software and Metric Tools for Motion Correction Research

Tool / Resource	Type	Function in Motion Correction Research
fMRIPrep [67] [68]	Software Pipeline	A robust, standardized tool for preprocessing anatomical and functional MRI data, which includes estimation of motion parameters. Provides a solid foundation for subsequent motion correction.
AFNI [69] [11]	Software Suite	Provides a wide array of commands for fMRI analysis, including 3dDespike, nuisance regression, and volume censoring functionality.
Framewise Displacement (FD) [3] [11] [14]	Metric	A scalar quantity that summarizes head movement between consecutive volumes. It is the primary metric used to determine thresholds for frame censoring.
DVARS [3]	Metric	Measures the rate of change of the BOLD signal across the entire brain at each frame. Used alongside FD to identify motion-contaminated volumes.
ICA-AROMA [24]	Software Tool/Algorithm	A data-driven method for automatically identifying and removing motion-related artifacts from fMRI data using Independent Component Analysis.
SHAMAN [14]	Method/Algorithm	A novel method for computing a trait-specific motion impact score, helping researchers determine if their findings are biased by motion overestimation or underestimation.

Frequently Asked Questions (FAQs) & Troubleshooting

Q1: I've performed nuisance regression (RP12/RP24). Is that sufficient to remove motion artifacts? A: No. Multiple studies have conclusively shown that simple linear regression of motion parameters is not sufficient to fully remove head motion artefacts [24]. Associations between head motion and functional connectivity often persist after this step, necessitating additional strategies like frame censoring or ICA-AROMA [11] [24].

Q2: How do I choose the optimal framewise displacement (FD) threshold for censoring? A: There is no universal "best" threshold, as it involves a trade-off. A more stringent threshold (e.g., FD < 0.2 mm) is highly effective at eliminating motion-induced overestimation of trait-FC effects but may not address underestimation and can lead to significant data loss [14]. A systematic approach is recommended:

Test multiple thresholds (e.g., 0.1, 0.2, 0.3, 0.5 mm).
Use metrics like the SHAMAN impact score to evaluate how each threshold affects your specific trait-of-interest [14].
Report the threshold used and the amount of data censored per subject.

Q3: Does frame censoring introduce bias by excluding certain participants? A: Yes, this is a critical consideration. Volume censoring performs well in part because it can lead to the exclusion of high-motion individuals from analysis [24]. If the trait you are studying is correlated with motion (e.g., attention-related disorders), this can systematically bias your sample. It is crucial to report motion differences between groups and to consider supplementing censoring with other methods that are less prone to this exclusion bias.

Q4: For my large-scale dataset (e.g., UK Biobank, ABCD), should I re-run preprocessing with a different motion correction strategy? A: This can be computationally prohibitive. A practical alternative is to use the provided minimally preprocessed data and apply a post-hoc method like frame censoring, which has been shown to effectively reduce residual motion artifact even in data that has undergone prior denoising [14]. The key is to apply censoring consistently across your sample and account for the resulting variation in data quality in your models.

Q5: Is global signal regression (GSR) recommended for motion correction? A: GSR is a double-edged sword. Evidence shows it can improve the performance of many pipelines on motion correction benchmarks [24]. However, a major drawback is that it can exacerbate the distance-dependence of correlations between motion and functional connectivity, potentially introducing its own biases [24]. Its use should be justified based on the research question and the specific pipeline.

Functional connectivity (FC) analysis examines the temporal similarity between blood oxygenation level dependent (BOLD) signals in different brain regions to infer synchronous neural activity. Unlike structural connectivity, which represents anatomical connections, FC is a statistical construct with no straightforward "ground truth," making validation metrics essential for ensuring methodological rigor [70] [71]. This technical support guide addresses the critical challenge of validating specificity (the ability to correctly identify true negative connections) and sensitivity (the ability to detect true positive connections) in FC analyses, with particular emphasis on optimizing framewise displacement thresholds for volume censoring in research applications.

The validation process is complicated by multiple factors: FC measurements are contaminated by various noise sources including head motion, cardiac cycles, and respiratory variations [71] [72]. These confounds can introduce spurious correlations or mask true relationships, potentially leading to invalid conclusions about brain organization and function. Furthermore, the choice of pairwise interaction statistics significantly impacts resulting FC network topology, with different metrics displaying varying capacities to differentiate individuals and predict behavioral measures [70].

Key Performance Metrics for FC Validation

Core Validation Metrics Table

Metric Category	Specific Metrics	Interpretation	Optimal Values
Motion Mitigation	Frame-wise displacement (FD) correlation with FC	Lower values indicate better motion correction	r < 0.1 after correction [11]
	Quality control metrics (e.g., DVARS)	Quantifies signal changes related to motion	Pipeline-dependent [72]
Structure-Function Coupling	Correlation with diffusion MRI structural connectivity	Higher values indicate better biological plausibility	R² up to 0.25, varies by FC method [70]
Distance Dependence	Correlation between physical distance and FC strength	Moderate inverse relationship expected	\|r\| ~0.2-0.3 for many metrics [70]
Biological Alignment	Correspondence with neurotransmitter receptor similarity	Higher values indicate better neurobiological validity	Varies by FC method; precision statistics often superior [70]
	Alignment with electrophysiological connectivity (MEG)	Links BOLD to direct neural activity measures	Precision methods show strong correspondence [70]
Individual Differences	Fingerprinting accuracy	Capacity to identify individuals from FC patterns	Varies substantially across FC methods [70]
	Brain-behavior prediction	Cross-validated R² for behavioral prediction	Often modest (r² < 0.1); enhanced with optimal pipelines [73] [72]

Advanced Performance Metrics

Metric Category	Specific Metrics	Application Context
Network Topology	Hub identification consistency	Evaluates whether FC methods identify biologically plausible hubs [70]
	Weight-distance tradeoffs	Tests fundamental brain organization principles [70]
Clinical Sensitivity	Effect size for group discrimination	For case-control studies (e.g., schizophrenia) [74]
	Classification performance (AUC)	Diagnostic accuracy for clinical applications [74]
Pipeline Reliability	Between-sample reproducibility	Consistency across different datasets [72]
	Denoising efficacy vs. signal retention	Tradeoff between noise removal and biological signal preservation [72]

Experimental Protocols for Validation

Protocol 1: Motion Correction Efficacy Assessment

Purpose: To evaluate the effectiveness of framewise displacement thresholds in mitigating motion-related artifacts while preserving neural signals.

Materials:

Resting-state fMRI dataset with known motion characteristics
Parallel physiological recordings (cardiac, respiratory) if available
Multiple denoising pipelines for comparison

Procedure:

Calculate framewise displacement (FD) for all volumes
Apply varying FD thresholds (e.g., 0.1mm, 0.2mm, 0.3mm, 0.5mm) for volume censoring
For each threshold, compute FD-FC correlation (association between head motion and functional connectivity)
Calculate frame retention rates for each threshold
Assess preservation of neurobiological signals via gestational age prediction or other biologically-anchored measures [11]
Compare pipelines using multiple quality metrics simultaneously [72]

Validation: Optimal thresholds demonstrate reduced FD-FC correlation (r < 0.1) while maintaining predictive power for neurobiological variables. In fetal imaging, censoring at 1.5mm improved gestational age prediction accuracy from 44.6% to 55.2% compared to no censoring [11].

Protocol 2: Pairwise Statistic Benchmarking

Purpose: To evaluate how different FC estimation methods affect network properties and sensitivity to biological signals.

Materials:

Resting-state fMRI time series from a well-characterized sample (e.g., HCP data)
Implementation of multiple pairwise statistics (covariance, precision, distance correlation, etc.)
External validation data (structural connectivity, gene expression, behavioral measures)

Procedure:

Generate 239 FC matrices using different pairwise statistics from the pyspi package [70]
For each matrix, compute:
- Edge weight distributions
- Weighted degree distributions (hub identification)
- Structure-function coupling (R² with diffusion MRI)
- Distance dependence (correlation with physical distance)
Evaluate individual fingerprinting capacity via identification accuracy
Assess brain-behavior prediction using cross-validated models
Compare alignment with multimodal neurophysiological networks

Validation: Superior methods demonstrate strong structure-function coupling, plausible hub identification, and enhanced capacity for individual differentiation and behavior prediction [70].

Troubleshooting Guides

FAQ: Addressing Common Validation Challenges

Q: Why do my motion-corrected FC matrices still show associations with head motion?

A: Persistent motion-FC correlations after nuisance regression indicate incomplete motion mitigation. Solution: Implement volume censoring (removing high-motion frames) in addition to regression. Systematically evaluate FD thresholds (0.2-0.5mm typically) to balance motion removal with data retention. In fetal imaging, combining regression with censoring significantly reduced lingering motion effects [11].

Q: How can I determine whether low brain-behavior correlations reflect weak biological relationships or methodological limitations?

A: This is a fundamental challenge in FC research. Conduct comprehensive pipeline validation:

Test multiple denoising approaches (ICA-FIX, GSR, DiCER, etc.)
Evaluate reliability of both FC and behavioral measures
Assess motion mitigation efficacy using quality metrics
Compare with structure-function coupling as a biological plausibility check Even with optimal pipelines, most brain-behavior correlations are modest, requiring large samples for detection [73] [72].

Q: Which pairwise correlation metric should I use for FC analysis?

A: The optimal choice depends on your research question and desired network properties:

Covariance-based methods: Good default choice with established properties
Precision-based methods: Superior for structure-function coupling and biological alignment
Distance correlation: Captures nonlinear relationships
Spectral measures: Moderate correlation with most other measures Benchmarking studies recommend tailoring the statistic to specific neurophysiological mechanisms rather than using Pearson correlation by default [70].

Q: How can I enhance the sensitivity of FC measures to clinical group differences?

A: Implement data-driven subnetwork identification:

Use ICA to identify connectivity components that maximally differentiate groups
Focus on subnetworks with biological plausibility (e.g., visual, attention, somatomotor networks in schizophrenia)
Calculate graph theory metrics specifically on these sensitive subnetworks rather than whole-brain networks This approach identified schizophrenia-related alterations with higher sensitivity than whole-brain analysis [74].

Q: What is the evidence for using task-based versus resting-state FC for behavior prediction?

A: Current evidence is mixed:

Some studies report advantages for task-based FC, particularly for cognitive domains
Working memory performance prediction improved with task-based versus resting-state FC
For social and emotion domains, task advantages are less clear
Whole-brain FC generally outperforms network-specific FC for behavior prediction Consider acquiring task-based data if predicting cognitive performance, but note that effects are often modest regardless of approach [73].

Signaling Pathways and Workflows

Functional Connectivity Validation Framework

Denoising Pipeline Comparison Workflow

Research Reagent Solutions

Resource Category	Specific Tools/Datasets	Purpose	Key Features
Analysis Packages	pyspi (Python)	Pairwise statistic calculation	Implements 239 interaction statistics for comprehensive benchmarking [70]
	FSL ICA-FIX	Artifact removal	Identifies and removes motion-related components via machine learning [72]
	fMRIprep	Automated preprocessing	Standardized pipeline for consistent data preparation [72]
Reference Datasets	Human Connectome Project (HCP)	Method benchmarking	High-quality multimodal data for validation [70] [72]
	Consortium for Neuropsychiatric Phenomics (CNP)	Pipeline evaluation	Includes behavioral measures for validation [72]
	Genomics Superstruct Project (GSP)	Large-sample validation	Enables assessment of reliability and generalizability [72]
Validation Metrics	Frame-wise displacement (FD)	Motion quantification	Standardized measure for censoring decisions [11]
	Structure-function coupling	Biological plausibility	Compares FC with diffusion MRI tractography [70]
	Multimodal alignment	Neurobiological validity	Assesses correspondence with gene expression, receptor distribution [70]
Quality Assessment	DVARS	Signal change quantification	Measures rate of change of BOLD signal across frames [72]
	Fingerprinting accuracy	Individual differentiation	Tests ability to identify individuals from FC patterns [70]
	Cross-validated prediction	Behavior correlation assessment	Estimates generalizable brain-behavior relationships [73]

Conclusion

Selecting an optimal FD threshold for censoring requires careful consideration of the specific research context, with 0.2mm emerging as an effective benchmark that significantly reduces motion overestimation artifacts while maintaining data integrity. However, this threshold must be balanced against the risk of introducing bias when studying motion-correlated traits and populations. Future directions should focus on developing adaptive thresholding methods that account for individual motion patterns, integrating real-time motion mitigation strategies to reduce censoring needs, and establishing standardized reporting practices for censoring decisions in publications. For drug development and clinical research, these optimizations are crucial for ensuring that functional connectivity biomarkers are reliable and reproducible across diverse populations.