Introduction

When axons become damaged, cytoskeletal proteins known as neurofilaments are released into the extracellular space, followed by the cerebrospinal fluid (CSF), with marked transmigration into the blood at a lower concentration [1]. Notably, among biomarkers for neurodegenerative disease, there is a need for minimally invasive, readily available, cost-effective biomarkers as current methods rely on measures derived from CSF and neuroimaging. Recently, sensitive methods were developed to measure blood-levels of neurofilament light (NfL) [2]. This methodological development for assaying plasma NfL has stimulated potential opportunities for large-scale applications in clinical practice and in randomized clinical trials as a method for identifying patients at risk for dementias, including Alzheimerā€™s disease (AD) [3]. Thus far, NfL reflects sub-cortical large-caliber axonal degeneration [4, 5]. Plasma NfL levels correlate strongly with CSF NfL levels [3, 6], adding to its clinical utility in differential diagnoses for dementias. While most studies have focused on plasma NfLā€™s positive association with AD, including at earlier stages [7,8,9,10], as well as other neurodegenerative diseases [11,12,13,14]. Thus, plasma NfL is a marker of non-specific neurodegeneration.

To date, only few studies have been conducted thus far reporting its predictive value for future cognitive decline and brain aging [15,16,17,18,19,20,21,22], and none have tested associations differentially across racial groups. Furthermore, few studies have examined how longitudinal changes in plasma NfL are related to change in cognition over time (e.g., [21]). Thus, our study (i) examined baseline NfL in relation to baseline and change in cognitive performance over time; (ii) examined change in NfL in relation to cognitive performance over time; (iii) examined baseline and change in NfL in relation to follow-up cognitive performance; and, (iv) tested racial differentials in those main associations; as well as exploring those associations across sex, age group, and poverty status.

Materials and methods

Database

We selected a sample from the Healthy Aging in Neighborhoods of Diversity across the Life Span (HANDLS) study. Since 2004, HANDLS is an on-going prospective cohort study of socioeconomically diverse White and African American adult women and men residing in Baltimore, MD. Initial data (visit 1) were collected between 2004 and 2009, in two phases. Phase I consisted of a home visit, with information collected for screening, recruitment, and a household in-person interview that included the first 24ā€‰h dietary recall of that visit. Phase II (v1) was performed as an in-person complete physical health examination including a cognitive test battery inside Medical Research Vehicles (MRV) and included a second 24ā€‰h dietary recall. Participants were invited for follow-up in-person visits (v2) between 2009 and 2013, which applied a similar protocol as v1 (phase II). Fasting blood samples were obtained from consenting participants in both in-person examinations. All participants provided written informed consent. The Institutional Review Board of the National Institutes of Health, National Institute of Environmental Health Sciences approved the HANDLS study protocol.

Study sample

In our present study, up to two repeats on cognitive tests were available from v1 or v2. Exposure data on plasma NfL concentrations were available at both visits for a sub-sample of Whites and African Americans after excluding participants who did not survive within a year of follow-up or who did not have NfL data at v2. As shown in the study design flowchart (Fig. 1), among 3,720 initially recruited HANDLS participants, Nā€‰=ā€‰674 had complete v1 and v2 data on plasma NfL. Of those participants, Nā€‰=ā€‰625 had data on v1 or v2 for all 11 cognitive test scores, with an average number of observations/participant kā€‰=ā€‰1.9āˆ’2.0, indicating 0ā€“5% missingness on cognitive test performance outcomes. A sub-set of those participants had complete and credible v2 cognitive performance data, with somewhat variable sample sizes. This sub-set was also analyzed, thus excluding those with unavailable or non-credible v2 cognitive performance on each test. Meanā€‰Ā±ā€‰SD follow-up time for the final analytic sample (nā€‰=ā€‰625 participants) was 4.30ā€‰Ā±ā€‰0.95ā€‰y. Method S1 shows a detailed description for sample selection with respect to the NfL exposure. Compared to the initial sample with incomplete data for our analysis, the final sample had a lower proportion of individuals living below poverty (27.8% vs. 43.9%, pā€‰<ā€‰0.001, Ļ‡2 test), and a reduced proportion of men (41.6% vs. 45.9%, pā€‰=ā€‰0.048, Ļ‡2 test). A similar pattern was observed when the sample with v1 NfL (Nā€‰=ā€‰674) was compared with the sample without this data, notwithstanding other exclusions.

Fig. 1: Participant flowchart.
figure 1

Abbreviations: HANDLS Healthy Aging in Neighborhoods of Diversity across the Life Span; k = # of observations/participant; NfL neurofilament light; v1 visit 1; v2 visit 2.

Cognitive assessment

HANDLS clinical staff examined cognitive performance with a battery of tests which included the Mini-Mental State Examination (MMSE), the California Verbal Learning Test (CVLT) immediate (List A) and Delayed Free Recall (DFR), the Benton Visual Retention Test (BVRT, # of errors), Brief Test of Attention (BTA), Animal Fluency test (AF), the Digit Span Forward and Backwards tests (DS-F and DS-B), the Clock Drawing Test (CDT), Trailmaking test parts A and B (TRAILS A and B, in seconds), (described in detail in Method S2). Cognitive domains spanned global mental status, verbal memory, verbal fluency, attention, visual memory, visuo-spatial abilities, and executive function, which includes working memory. A total of 11 cognitive test scores were computed from these tests. Total MMSE was normalized using previously described methods [23]; while Trails A and B scores (in seconds) were Loge transformed to achieve pseudo-normality. With the exception of BVRT, Trails A and B, all test scores were in the direction of higher values reflecting better performance at v1 or over time.

Plasma neurofilament light (NfL)

Fasting, morning plasma samples were collected into EDTA blood collection tubes. Tubes were centrifuged at 600Ɨg for 15ā€‰min and the buffy coat was removed. These steps were repeated two times and the samples were visually examined for hemolysis. Plasma was aliquoted and stored at āˆ’80ā€‰Ā°C until use. Plasma NfL levels were measured by Quanterix (Billerica, MA, USA) using the SimoaĀ® NF-light Advantage Kit following the kit instruction. Longitudinal samples for each person were run on the same plate and the proportion of people in each demographic group (race/sex/poverty) were balanced across all plates. Plasma samples were diluted 1:4 and concentrations reflect the dilution correction. Pooled plasma samples from two individuals were run in duplicate on all plates. These duplicate pooled plasma samples were used to calculate both the within plate (intra-assay) and between plates (inter-assay) coefficient of variation (CV). The average intra-assay CV was 4.5% and the average inter-assay CV was 7%. The analytical limit of detection (LOD) was calculated as 2.5 standard deviations above the background (mean of calibrator blanks). For the analytical lower limit of quantification (LLOQ), triplicate measurements of serially diluted calibrator were run as unknowns and read on the standard calibration curve. The LLOQ was determined as the lowest dilution with a pooled CVā€‰ā‰¤ā€‰20% and a sample read back recovery between 80 and 120% of the expected concentration. The analytical upper limit of detection (ULOQ) was the highest concentration of the calibrator curve. Analytical LOD, LLOQ, and ULOD values were converted to functional values by multiplying by the dilution factor (4Ɨ) to enable direct comparison to the sample results. The functional LOD and the functional LLOQ were 0.152 and 0.696ā€‰pg/ml, respectively. The functional ULOD was 1872 pg/ml.

Covariates

Several covariates were considered in this study as potential confounders, given their previously shown association with cognitive performance or decline, which may also be associated with NfL exposures. These included v1 age (continuous, years), sex (male, female), race (White, African American), poverty status (below vs. above 125% the federal poverty line), educational attainment (less than high school, high school, more than high school), and literacy (Wide Range Achievement Test, third edition [WRAT-3]). Age at v2 was also used to compute time between v1 and v2, a measure relevant to our main models. Poverty status was operationalized using the 2004 US Census Bureau poverty thresholds [24] based on household income and total family size (including children <18 years). Furthermore, lifestyle and health-related factors were among those considered as potential confounders, given their potential impact on both exposures and outcomes. Those factors included current smoking status (0ā€‰=ā€‰no vs. 1ā€‰=ā€‰yes), illicit drug use (0ā€‰=ā€‰no vs. 1ā€‰=ā€‰yes, using any of marijuana, opiates, and cocaine), body mass index (BMI, weight/height2, kgā€‰māˆ’2, continuous), self-rated health status categorized as 0=poor/average (referent), 1ā€‰=ā€‰good and 2ā€‰=ā€‰very good/excellent, the Healthy Eating Index 2010 (HEI-2010) [25], measuring overall diet quality based on food and macronutrient-related guidelines for Americans, total energy intake (kcal/d), and the 20-item CES-D total score for depressive symptoms. Moreover, an unweighted co-morbidity index was also accounted for. This index was composed of hypertension (0ā€‰=ā€‰no, 1ā€‰=ā€‰yes), diabetes (0ā€‰=ā€‰diabetic, 1ā€‰=ā€‰pre-diabetic, 2ā€‰=ā€‰diabetic) and dyslipidemia (or statin use) (0ā€‰=ā€‰no, 1ā€‰=ā€‰yes), and self-reported history of any of several cardiovascular disease conditions (0ā€‰=ā€‰no, 1ā€‰=ā€‰yes). The latter component screened for the occurrence of several conditions, namely atrial fibrillation, angina, coronary artery disease, congestive heart failure, and myocardial infarction. Consequently, the co-morbidity index could potentially range between 0 and 5.

Statistical methods

Stata release 16 [26] was used to conduct all analyses. We first described the analytic sampleā€™s characteristics at baseline using means and proportions with bivariate linear, logistic, and multinomial logit models to examine racial differences in continuous, binary, and categorical multi-level covariates, respectively. We then adjusted those models for age, sex, and poverty status to determine whether racial differences remained statistically significant. Second, for testing our main hypotheses, a series of linear models were conducted (mixed-effects and ordinary least-square, OLS) (Method S3 for mixed-effects models). Separate analyses for 11 cognitive test scores were conducted, adjusting for two sets of covariates: Model 1: only socio-demographic variables: age at v1, sex, race, and poverty status; Model 2: socio-demographicsā€‰+ā€‰all other lifestyle and health-related covariates. To reduce missing data due to the addition of covariates into different models, given that each covariate had, individually <5% missing on average, we ensured sample sizes were constant between reduced and fully adjusted models by conducting multiple imputations (five imputations, ten iterations), using the chained equations methodology. All covariates were used simultaneously during this estimation process, similar to previous studies [27, 28] and continuous covariates were centered around their means. Thus, for mixed-effects linear regression models, we applied Models 1 and 2 to two exposures (NfL and Ī“NfL), 11 cognitive test scores with up to two repeats (effect of exposures on v1 cognitive performance (CPv1) and cognitive performance change over time (Ī“CP)), one main stratifying variable (race), and several exploratory stratifying variables (sex, age group, and poverty status). NfL was Loge transformed in all these analyses, and the annualized changes in the Loge transformed NfL between v1 and v2 were used to operationalize Ī“NfL [i.e., Ī“NfLā€‰=ā€‰(Loge(NfLv2)ā€‰āˆ’ā€‰Loge(NfLv1)/(Agev2ā€‰āˆ’ā€‰Agev1)], using complete case analysis. Z-scoring for exposures was done using the final eligible sample (Nā€‰=ā€‰625). These two exposures were constructed in a similar way in other studies (e.g., [21]). Racial differences in the association between NfL exposures and cognitive performance at v1 was tested using NfLā€‰Ć—ā€‰Race and Ī“NfLā€‰Ć—ā€‰Race interaction terms in separate models, while that of the association between NfL exposures and cognitive change was carried out by testing the NfL/Ī“NfLā€‰Ć—ā€‰TIMEā€‰Ć—ā€‰Race term in the same model. Following a similar approach but with a set of OLS linear regression models, race-specific associations of v1 NfL and Ī“NfL with v2 cognitive performance (CPv2) as an outcome of interest, were examined, while additionally adjusting models with the time of follow-up (years) between v1 and v2. Racial differences were also tested using two-way interaction terms (NfLā€‰Ć—ā€‰Race) in unstratified models, as were differences by age group, sex, and poverty status.

In all models, sample selectivity due to missing exposure and outcome data, relative to the initially recruited sample, was adjusted for using a two-stage Heckman selection strategy. Thus, we first predicted an indicator of selection with socio-demographic factors, namely, v1 age, race, sex, and poverty status using a probit regression model, which yielded an inverse mills ratio (IMR), a function of the probability of being selected given those socio-demographic factors. At a second stage, we estimated our multiple mixed-effects and OLS linear regression models adjusted for the IMR in addition to the aforementioned covariates [29].

This study set the Type I error rate a priori for main and interactive effects before correction for multiple testing to 0.05 and 0.10, respectively [30]. We accounted for outcome multiplicity (i.e., 11 cognitive test scores) using the approach of familywise Bonferroni correction [31], specifically for Model 1. Subsequently, the full model (Model 2) was considered a sensitivity model in which potentially confounding and/or mediating factors were included. In addition, a reduced version of Model 2 (Model 3) was tested, whereby only covariates, aside from those included in Model 1, shown to be associated with each of the two exposures were included. This model was only conducted as a sensitivity analysis. Therefore, we adjusted significance levels for main effects to pā€‰<ā€‰0.00455 (0.05/11), and for two-way interaction terms to 0.10/11ā€‰=ā€‰0.00910, similar to previous work [32]. Moreover, q-values (false discovery rates) were also computed as an alternative means to correct for multiple testing in Model 1, accounting for multiplicity in cognitive tests only [33, 34]. Q-valuesā€‰<ā€‰0.05 were used for statistical significance for main effects (e.g., effect of NfLv1), while 0.05ā€‰ā‰¤ā€‰q-valuesā€‰<ā€‰0.10 were considered as significant for two-way interaction terms (e.g., effect of NfLv1ā€‰Ć—ā€‰TIME). In our exploratory stratified analysis, all main hypotheses were tested across sex, age group (ā‰¤50ā€‰y, >50ā€‰y, as 50ā€‰y was the approximate median age) and poverty status (above vs. below poverty), separately, using the same modeling approach; and only familywise Bonferroni correction was applied to this part of the analysis (Model 1). Main findings were illustrated using predictive margins (with estimated 95% CI) of outcomes across time, and by exposure, overall or stratified by race and/or the other socio-demographic factors, using a specific mixed-effects or OLS linear regression model. Data analysis code in parts or in full can be made available upon request to the corresponding author.

Results

Overall, and based on Table 1, participants were ~48 years old at initial testing; African Americans were significantly older than Whites (48.7 vs. 47.3, pā€‰<ā€‰0.05). A significantly higher proportion of Whites than African Americans had <HS education (7.9% vs. 3.7%). Although there were no race differences in poverty status, mean literacy was significantly higher among Whites. Loge transformed NfLv1 plasma concentration was significantly higher among Whites compared with African Americans. However, there were no significant differences between races in the annualized rate of change values of NfL (delta NfL; Ī“NfL). Current drug use was higher among African Americans; CES-D total score was higher among Whites. Although the co-morbidity index did not differ by race, dyslipidemia was more prevalent among Whites and hypertension was more prevalent among African Americans. In this select sample, Whites performed better than African Americans on most cognitive tests at v1. Whites had a greater rate of decline on CVLT-List A and a smaller rate of decline on the BVRT than African Americans.

Table 1 Study sample characteristics, overall and by race in the final analytic sample with imputed covariates (Nā€‰=ā€‰625), HANDLS 2004ā€“2013a.

Our main hypotheses of associations between plasma NfL exposures and time-dependent cognitive outcomes were examined by mixed-effects and OLS regression models (Tables 2, 3) and are summarized in Fig. S1. Our exploratory analyses by age group, sex, and poverty status are presented in Tables S1āˆ’S3. Over a mean follow-up of 4.3 years, no association retained statistical significance upon correction for multiple testing in the total sample. However, we found initial NfL (i.e., NfLv1) was associated with faster decline on normalized mental status scores in Whites only (Ī“MMSEnorm:: Ī³11ā€‰=ā€‰āˆ’0.661ā€‰Ā±ā€‰0.252, Pā€‰=ā€‰0.0085, qā€‰=ā€‰0.094, reduced model), an association that retained significance in the fully adjusted model 2. This association (NfLv1 vs. decline in performance) was also found in those >50 years of age (Ī“MMSEnorm: Ī³11ā€‰=ā€‰āˆ’0.705ā€‰Ā±ā€‰0.242, Pā€‰=ā€‰0.004, reduced model); (Tables 2 and S2). Annualized increase in NfL was associated with greater decline in verbal fluency in men (Ī“AF: Ī³11ā€‰=ā€‰āˆ’0.181ā€‰Ā±ā€‰0.058, Pā€‰=ā€‰0.002, full model); (Table S1). In other exploratory analyses (Tables S1āˆ’S3), annualized increase in NfL was associated with slower decline in verbal memory among individuals living above poverty (Ī“CVLT-DFR:ā€‰+0.104ā€‰Ā±ā€‰0.036, Pā€‰=ā€‰0.004, reduced model), while, in the older group (>50 years), first-visit NfL was linked with better performance at baseline in global mental status and verbal memory (Pā€‰<ā€‰0.004). Finally, and upon correction for multiple testing, no stratum-specific associations were found between NfLv1 (or Ī“NfL) and follow-up cognitive performance. Reduction of Model 2 to Model 3, leaving in only additional covariates (in addition to socio-demographics) that were associated with NfL exposures, did not alter our main findings.

Table 2 Baseline and annual rates of change in plasma neurofilament light (v1 NfL, and Ī“NfL) and their association with cognitive performance at v1 and change over time: overall and race-specific mixed-effects linear regression models: HANDLS 2004ā€“2013a.
Table 3 Baseline plasma neurofilament light (v1 NfL and Ī“NfL) and their association with cognitive performance at v2: overall and race-specific multiple ordinary least square linear regression models: HANDLS 2004āˆ’2013a.

The main finding among Whites, for NfLv1 vs. normalized MMSE scores across time is presented in terms of predictive margins of outcome per SD of exposure in Fig. 2A. The Figure indicates that among those with higher NfLv1 (i.e., v1 Loge transformed plasma NfL, z-scored: meanā€‰+ā€‰1ā€‰SD), normalized MMSE score was on a decline over a period of 5 years as opposed to participants with NfLv1 at the mean or at meanā€‰āˆ’ā€‰1ā€‰SD, whose performance was improving over time, from an initial low level. This was not the case among African Americans. Figure S1 summarizes findings from Model 1, across race, for all regression analyses with 11 cognitive test scores, three types of outcomes, and two exposures. Figure 2Bāˆ’E shows predictive margins of cognitive performance tests across exposure levels (NfLv1 and Ī“NfL: z-score for annualized change in Loge transformed plasma NfL between v1 and v2) and by sex, age group, and poverty status, highlighting the key exploratory findings.

Fig. 2: Summary of key findings by race, sex, age group, and poverty status across NfL exposuresa,b.
figure 2

aNfLv1 values are Loge transformed and z-scored. Levels of exposure are āˆ’1: meanā€‰āˆ’ā€‰1ā€‰SD; 0: at mean; +1: meanā€‰+ā€‰1ā€‰SD. 1ā€‰SD of baseline Loge(NfL) is estimated at 0.51; meanā€‰=ā€‰1.98. dNfL values are annualized changes in Loge transformed NfL between v1 and v2, z-scored. 1ā€‰SD of annualized change in Loge(NfL) is estimated at 0.101; meanā€‰=ā€‰0.044. All test scores presented in these figures are coded in the direction of higher score ā†’ better performance. bA Predicted margins for normalized MMSE total score across NfLv1 are based on Model 1 among Whites and African Americans in Table 2; B predicted margins for animal fluency scores across dNfL are based on Model 2 among women and men in Table S1; C predicted margins for normalized MMSE total score across NfLv1 are based on Model 1 among ā‰¤50ā€‰y vs. >50ā€‰y age groups in Table S2; D predicted margins for CVLT-List A across NfLv1 are based on Model 1 among ā‰¤50ā€‰y vs. >50ā€‰y age groups in Table S2; E predicted margins for CVLT-DFR across dNfL are based on Model 2 among ā€œabove povertyā€ vs. ā€œbelow povertyā€ groups in Table S3. Abbreviations: AF Animal Fluency; BC baseline cognitive performance; CVLT-DFR California Verbal Learning Test-Delayed Free Recall; CVLT-List A California Verbal Learning Test-List A; dNfL z-scores of annualized rates of change NfL, Loge transformed; NfLv1 plasma NfL levels, Loge transformed, z-scored at v1.

Discussion

Main findings

This study is one of the few to examine plasma NfL baseline level (NfLv1) and its annualized rate of change over a 5ā€‰y follow-up (Ī“NfL) and the longitudinal associations with cognitive performance in middle-aged adults over the same period of time. The study was specifically conducted among a bi-racial urban cohort of middle-aged men and women who were free from dementia at baseline. The sampling strategy allowed us to examine key tested associations across racial groups, and secondarily across sex, age, and poverty status groups. Cognitive performance was measured twice for most selected participants, reflecting global mental status and domains of verbal memory and fluency, visual memory and visuo-spatial abilities, attention, and executive functions. Over a mean follow-up of 4.3 years, we found initial NfL was associated with a faster decline on normalized mental status scores in Whites only and in those >50 years old. Annualized increase in NfL was associated with a greater decline in verbal fluency in men. In other exploratory analyses, annualized increase in NfL was associated with slower decline in verbal memory among individuals living above poverty, while, in the older group (>50 years), first-visit NfL was linked with better performance at baseline in global mental status and verbal memory.

Previous studies and biological mechanisms

Currently, methods to diagnose and monitor neuropathology are based on various imaging modalities, which are expensive with limited availability. CSF biomarkers, including NfL, have also been utilized, but require invasive procedures. Therefore, non-invasive biomarkers of neurocognitive decline are needed to identify those individuals at risk for AD and other neurodegenerative diseases. Plasma NfL may be one such non-invasive biomarker. Recent technological advances indicate that NfL levels measured in the blood, i.e., plasma NfL, are associated with AD diagnosis and with various cognitive, imaging, and biochemical disease measures [1, 15, 35]. CSF NfL also was inversely associated with the clinical dementia rating scale, the Recognition Memory Test [9], and the cognitive sub-scale of an AD assessment battery [10]. Several studies have indicated that CSF NfL is elevated in the early stages of dementia and is a strong predictor for cognitive decline in AĪ² positive individuals [36, 37], and in the general non-demented older adult population [22]. Given that AĪ² positivity alone was not sufficient to predict symptoms of cognitive decline in AD, identifying additional markers of neurodegeneration that are downstream from AĪ² accumulation has high utility for screening individuals in pre-symptomatic trials [9].

Given the high correlation between plasma and CSF NfL levels, and the invasiveness of acquiring CSF, plasma NfL may have greater overall utility as a screening tool. Several recent studies have shown that plasma NfL may accurately predict the estimated year of onset for dementia [38, 39]. In fact, several recent studies have shown that serum or plasma NfL are direct indicators of axonal degeneration based on neuroimaging markers, including gray and white matter pathology [21, 40, 41], and can act as a proxy for hypometabolism in AD-vulnerable brain regions, particularly in AĪ²-positive individuals [42]. Generally, the demyelination of axons triggers inefficiency in energy utilization, dysfunction of the mitochondria, and oxidative stress accumulation, alterations that increase axonal fragmentation and result in neurodegeneration [43]. The spread of such pathology can occur at independent tract locations and their associated gray matter structures [44]. Since such axonal retraction does not often occur simultaneously, it is more likely that baseline plasma NfL rather than follow-up or change in NfL, is associated with change or follow-up outcome of neurodegeneration, as well as adverse cognitive performance outcomes [40, 45]. This is in line with our main findings.

Among older adults, several studies have indicated that plasma NfL is a good predictor for cognitive decline or impairment, independently of neuroimaging markers. One recent study found that individuals with AD or fronto-temporal dementia cases had higher plasma NfL compared to cognitively normal controls, with no differences detected for other neuropsychiatric disorders [46]. Upon adjustment for baseline hippocampal atrophy and memory scores, plasma NfL predicted greater cognitive decline among the cognitively impaired [46]. Another study among older adults suggested that a combination of markers (low plasma AĪ²42/AĪ²40 ratio and high plasma NfL level) was associated with a greater decline in cognitive performance over time [20]. These findings were recently corroborated by Mielke and colleagues who examined both plasma and CSF NfL in relation to cognitive and neuroimaging outcomes in a small sample of older adults (Nā€‰=ā€‰79, median age: 76ā€‰y) participating in the Alzheimerā€™s Disease Neuro-imaging (ADNI) study. Their findings indicate that elevated baseline plasma NfL may adequately predict cognitive decline and brain imaging neurodegenerative measures, with comparable effect sizes to baseline CSF NfL [21]. Furthermore, Rajan and colleagues found that 1,327 older participants, plasma NfL > 25.5ā€‰pg/ml (determined 4ā€“8ā€‰y prior to AD onset) was associated with 110% faster cognitive decline over 16ā€‰y of follow-up, as well as a faster decline in cortical thickness [18]. Similarly, He and colleagues found that among 452 older adults, a combination of elevated AĪ² and plasma NfL was associated with faster decline on the MMSE compared with lower levels, even upon adjustment for APOE4 status [20]. Moreover, Nyberg and colleagues found that plasma NfL, while reflecting white matter alteration, may not be a good predictor for cognitive impairment or impending AD [19]. Most recently, RĆ¼bsamen et. al. (2021) evaluated associations between NfL and tau serum levels, neuropsychological functioning, and brain structure among a sample of 385 adults aged 65+ years enrolled in the Memory and Morbidity in Augsburg Elderly study [16]. The authors used linear regression models adjusted for age, sex, years of education, and comorbidities and reported a cross-sectional association between NfL serum levels and neuropsychological functioning which included standardized cognitive tests spanning the domains of short-term memory, cognitive speed, attention, and motor speed [16]. Furthermore, in a study by Khalil and colleagues (2020), the authors examined age-related changes in NfL serum levels and their associations with brain structure and functioning [17]. In a sample of 335 men and women drawn from the prospective and ongoing Austrian Stroke Prevention Family Study, the authors used backwards stepwise regression while considering comorbidities and observed that individuals with elevated and more variable NfL serum levels tended to show accelerated rates of neuronal injury which may be attributed to subclinical comorbid pathologies [17]. Moreover, the authors reported that baseline NfL serum levels were negatively associated with annualized changes in scores obtained from the Mini-Mental State Examination [17]. Taken together, these studies may suggest associations between NfL levels and changes in brain volume which may, in turn, influence neuropsychological functioning.

Our data in middle-aged adults is in agreement with other studies among older adults, indicating the utility of blood-based NfL as a non-invasive biomarker of cognitive decline, which may allow for disease monitoring. Few studies have examined longitudinal change in blood levels of NfL. In one study of AD, longitudinal plasma NfL levels increased in individuals with several baseline AD-disease measures [10]. Here, we examined longitudinal changes in plasma NfL in non-demented middle-aged adults. Therefore, we were able to assess baseline and rates of change of NfL in relation to longitudinal cognitive test performance across race and other socio-demographic variables (sex, age group, and poverty status). This is important given the limited information about the longitudinal changes in plasma NfL, especially in non-diseased cohorts. These associations we found, highlight the underlying neurodegeneration that occurs over time and suggests that baseline plasma NfL levels in Whites and in individuals >50ā€‰y may be valuable to predict those individuals who will cognitively decline faster than others. The lack of association between NfL and cognitive decline among African Americans may be due to less variability in NfL and limited change in cognitive performance over time within this racial group, especially among middle-aged adults, as compared with Whites and therefore a reduced statistical power to detect such an association. Among Whites, the only other cognitive performance test that was suggestive of an association between first-visit NfL and cognitive decline over time was BTA, reflecting attention, though this relationship did not survive correction for multiple testing (Ī³11ā€‰=ā€‰āˆ’0.072ā€‰Ā±ā€‰0.040, pā€‰<ā€‰0.10, Model 1).

More generally, our study detected few associations between plasma NfL and cognitive decline compared with other studies, due to several possible reasons. First, our sample consisted of middle-aged adults, while most other studies were conducted among older adults aged over 60ā€‰y at baseline. This would result in a less steep decline in cognition over time in our sample compared to others of older mean age at baseline, which in turn would reduce the statistical power to detect an association between exposure and change in cognition over time, keeping exposure variability the same across samples. However, younger age also results in less variability in the plasma NfL exposures, further reducing statistical power. Second, our sample consisted of a diverse group of middle-aged adults, whereas most other studies recruited middle to upper-middle-class White older adults. This difference in age group, racial, and SES composition is expected to yield diverging findings between our study and those of others, mainly due to differing baseline exposure and outcome levels. Finally, we have adjusted for a large number of potential confounders, including body mass index, and cardio-metabolic risk factors, some of which were shown to be associated with plasma NfL in previous studies [47, 48]. We also accounted for literacy, depressive symptoms, and other important factors that most other studies have not controlled for.

Strengths and limitations

Our study has several notable strengths. First, it is one of the largest longitudinal studies to examine plasma NfL levels in relation to cognition, using data from a community-based population, and the first to do so among middle-aged adults. In addition, plasma NfL was detected and quantified in non-demented individuals, which adds value to utilizing this biomarker as an early marker to monitor cognitive decline over time. Second, we had access to an extensive battery of cognitive tests that spanned the main domains of cognition, as well as measuring global mental status. Test scores had mostly two repeats, as did the main exposure of interest, plasma NfL. Third, the well-balanced sampling of HANDLS allowed for stratification of our analyses by race, sex, age group, and poverty status. Fourth, we used advanced statistical techniques, including mixed-effects linear regression models, multiple imputations, and 2-stage Heckman selection to test our key hypotheses, while reducing confounding and selection biases. The availability of two concurrent repeats of exposures and outcomes, allowed us to examine relationships in a detailed and bi-directional manner, though mainly focusing on the potential impact of NfL on cognition, rather than the reverse direction. Nevertheless, our study also has some limitations. First, our study sample was relatively young with a low mean NfL at baseline, when compared to previous studies that examined these questions in older adults. In addition, cognitive decline was limited in that age group, and was only evident above the age of 50ā€‰y. This may have reduced our ability to detect an association between NfL at v1 and change in cognitive function in the overall population. However, our results among Whites and the older group, suggest that NfL at v1 may be a predictor of decline in global mental status in middle-age in those groups who have a high performance on the MMSE at baseline and are prone to decline over a period of ~5ā€‰y.

Conclusions

In summary, first-visit NfL was primarily associated with the global mental status decline among Whites, while exhibiting inconsistent relationships in some exploratory analyses. More comparable longitudinal studies are needed among middle-aged adults to determine the utility of plasma NfL both at baseline and as a marker of change over time in relationship to cognitive performance and decline.

Disclaimer

The views expressed in this article are those of the authors and do not necessarily reflect the official policy or position of Fort Belvoir Community Hospital, the Defense Health Agency, Department of Defense, or U.S. Government. Reference to any commercial products within this publication does not create or imply any endorsement by Fort Belvoir Community Hospital, theDefense Health Agency, Department of Defense, or U.S. Government.