spiral.imperial.ac.uk  · web viewd bioclinica, inc., newark, ca, usa. e bioclinica, inc., lyon,...

28
Perspective: Clinical relevance of the dichotomous classification of Alzheimer’s disease biomarkers: Should there be a “grey zone”? Kevin McRae-McKee a , Chinedu T. Udeh-Momoh b , Geraint Price b , Sumali Bajaj a , Celeste A. de Jager b , David Scott d , Christoforos Hadjichrysanthou a , Emily McNaughton a , Luc Bracoud e , Sara Ahmadi-Abhari b , Frank de Wolf a , Roy M. Anderson a , Lefkos T. Middleton b,c, †, for the Alzheimer’s Disease Neuroimaging Initiative* Affiliations a Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London, United Kingdom b Ageing Epidemiology (AGE) Research Unit, School of Public Health, Imperial College London, London, United Kingdom c Directorate of Public Health, Imperial College Healthcare NHS Trust d Bioclinica, Inc., Newark, CA, USA e Bioclinica, Inc., Lyon, France † Corresponding author. Tel.: +44 20 3311 0216; E-mail address: [email protected] * Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu ). As such, the investigators 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32

Upload: others

Post on 27-Jan-2021

0 views

Category:

Documents


0 download

TRANSCRIPT

Perspective: Clinical relevance of the dichotomous classification of Alzheimer’s disease biomarkers: Should there be a “grey zone”?

Kevin McRae-McKeea, Chinedu T. Udeh-Momohb, Geraint Priceb, Sumali Bajaja, Celeste A. de Jagerb, David Scottd, Christoforos Hadjichrysanthoua, Emily McNaughtona, Luc Bracoude, Sara Ahmadi-Abharib, Frank de Wolfa, Roy M. Andersona, Lefkos T. Middletonb,c,†, for the Alzheimer’s Disease Neuroimaging Initiative*

Affiliations

a Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London, United Kingdom

b Ageing Epidemiology (AGE) Research Unit, School of Public Health, Imperial College London, London, United Kingdom

c Directorate of Public Health, Imperial College Healthcare NHS Trust

d Bioclinica, Inc., Newark, CA, USA

e Bioclinica, Inc., Lyon, France

† Corresponding author. Tel.: +44 20 3311 0216; E-mail address: [email protected]

* Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at:

http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf

Abstract word count: 150

Manuscript word count: 4,181

Research in Context

Systematic Review

We reviewed the literature to identify articles that considered the classification of biomarkers associated with Alzheimer’s disease (AD), articles that focused on cognitively healthy individuals with a range of AD biomarker levels and articles that assessed longitudinal cognitive scores relative to AD biomarker classification.

Interpretation

We introduce the concept of a “grey zone” around defined thresholds of AD-related biomarkers in cognitively healthy individuals, within which the ability to distinguish between subsequent trajectories of cognitive decline is lost. Our analysis of cortical amyloid PET levels and subsequent cognitive decline places this work in the context of informing the selection and inclusion criteria of participants into observational and clinical trial studies of cognitive ageing.

Future Directions

A unified effort across research groups is required to further address the issues we describe. Data from biomarker-enriched prospective studies with long follow-up must be pooled to better delineate and characterise the peri-threshold “grey zone” across studies.

Abstract

The 2018 NIA-AA research framework recently redefined Alzheimer’s disease (AD) as a biological construct, based on in vivo biomarkers reflecting key neuropathologic features. Combinations of normal/abnormal levels of three biomarker categories, based on single thresholds, form the AD signature profile that defines the biological disease state as a continuum, independent of clinical symptomatology. While single thresholds may be useful in defining the biological signature profile, we provide evidence that their use in studies with cognitive outcomes merits further consideration. Using data from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) with a focus on cortical amyloid binding, we discuss the limitations of applying the biological definition of disease status as a tool to define the increased likelihood of the onset of the Alzheimer’s clinical syndrome, and the effects that this may have on trial study design. We also suggest potential research objectives going forward, and what the related data requirements would be.

Keywords

NIA-AA; AT(N); Biomarker thresholds; Peri-threshold; Amyloid PET; SUVR; Alzheimer’s clinical syndrome; Cognitive decline; Alzheimer’s Disease Neuroimaging Initiative; ADNI.

1. Introduction

In 2018, the National Institute on Aging and the Alzheimer’s Association (NIA-AA) published a research framework (RF) for the biological definition of Alzheimer’s disease (AD), for clinical research purposes [1]. While the RF provides a purely biologically based description of AD that was derived independently of clinical symptomology, the authors expect more severe pathologic staging to indicate a greater risk of short-term cognitive decline [1], [2]. The dichotomous classification system on which the RF relies may be useful and necessary to describe the biological signature profile of AD, given our current understanding of the disease. However, it is important to understand the limitations and downstream consequences of using this classification system as a predictive or diagnostic tool in relation to the Alzheimer’s clinical syndrome. As an increasing number of clinical trials are utilising this dichotomous approach as part of the selection criteria for trial participants, and subsequently assess change in cognition as the primary study outcome, we argue that high priority should be given to refining the application of this dichotomous classification with respect to the Alzheimer’s clinical syndrome. This is particularly relevant in the preclinical stages of the AD continuum as these stages are increasingly seen as the optimal timepoint for pharmacological and non-pharmacological interventions.

We provide evidence which suggests that the use of a single classification threshold for an AD-related biomarker is overly precise, or strict, when the primary measure of interest (e.g. as a trial outcome) is based on measures of cognition. We do so by demonstrating the existence of a potential “grey zone” of a biomarker, defined as the interval around the threshold within which we lose the ability to differentiate between the subsequent cognitive trajectories of biomarker normal/abnormal individuals. By doing so, we hope to initiate an important discussion around, and further research into, this issue.

We begin by providing a general overview of the origins and evolutions of AD diagnostic criteria and the current use of dichotomous thresholds in AD, and medical research more widely. We discuss some of the practical limitations of the current classification in clinical research and make clear how the concept of the “grey zone” neither criticises nor seeks to improve upon the biological classification system. Instead, we demonstrate the effects of these limitations in practice, using evidence from our analysis of data from the Alzheimer’s Disease Neuroimaging Initiative (ADNI), with a focus on A levels in individuals that are cognitively healthy (CH) at baseline. We propose that going forward, research within the field should focus on individuals near the threshold, what the data requirements for doing so would be, and how harmonisation of the ways in which thresholds are defined and utilised across studies should be given high priority.

2. Background

The last thirty years have seen various iterations of methods for defining AD nosology with each variation utilising combinations of neuropathological features, clinical outcomes and biological profiles to classify individuals according to disease status. It is important to first understand the origins and evolution of AD diagnostic criteria, in order to assess the precision of the current criteria in a practical manner.

During the 80s, much progress was made in defining criteria for Alzheimer’s disease and Alzheimer’s dementia, relating to the pathological diagnosis and clinical syndrome (“possibly” or “probably” due to Alzheimer’s disease), respectively. Khachaturian and colleagues, in 1985, first introduced the concept that Alzheimer’s disease represents a biological construct, with key neuropathological features of increased burden of amyloid and tau, associated with evidence of neurodegeneration and brain atrophy [3]. Subsequently, standardised criteria for the pathological diagnosis of Alzheimer’s disease were defined by the Consortium to Establish a Registry for Alzheimer’s Disease (CERAD) [3]–[5]. This was further refined and updated in 2012 by Hyman and colleagues [6]. With respect to the clinical diagnosis of AD, the National Institute of Neurological and Communicative Disorders and Stroke – Alzheimer’s Disease and Related Disorders Association (NINCDS-ADRDA) published diagnostic criteria relating to the Alzheimer’s clinical syndrome, whereby AD definitive diagnosis required the post-mortem confirmation of Alzheimer’s typical neuropathology [7]. An in vivo diagnosis of AD could only be assigned the term “probable” in the presence of an established Alzheimer’s dementia syndrome, combining an advanced stage of memory and cognitive decline with functional disabilities and evidence that excluded all other possible aetiologies of dementia, through the use of blood, cerebrospinal fluid (CSF) and then available brain imaging tools of computed tomography and, subsequently, magnetic response imaging (MRI). Thus, the definition of Alzheimer’s disease was confined to a clinical dementia syndrome and the use of in vivo biomarkers was intended to provide evidence of exclusion criteria, rather than confirmatory evidence of AD diagnosis.

The relatively recent development of accurate imaging and fluid biomarkers brought along the first attempt, by an International Working Group (IWG) in 2007, to use biomarkers as translational in vivo tools to reflect AD neuropathology [8]. The authors suggested that the diagnosis of AD could be made in vivo using CSF, positron emission tomography (PET) imaging, and structural MRI, in conjunction with a distinct episodic memory phenotype, thought to be AD-specific. Subsequently, the NIA-AA published, in 2011, a series of three papers on diagnostic criteria of each of the disease stages: dementia due to AD [9]; mild cognitive impairment (MCI) due to AD [10]; and finally, recommendations toward the definition of preclinical AD stages [11]. AD was therefore defined as a continuum, from preclinical stages to advanced dementia stages. The 2011 criteria provided both clinical and biomarker-based criteria, the latter including evidence of amyloid burden and neurodegeneration. For the first time, the NIA-AA applied single thresholds for a dichotomous classification of AD-related biomarkers. Subsequently, the IWG revised their 2007 diagnostic criteria to incorporate atypical and mixed dementia phenotypes with supportive biomarker evidence of elevated cortical amyloid and tau loads (IWG2), considering MRI and fluorodeoxyglucose (FDG) PET biomarkers as downstream topographical evidence of regional distribution of pathology [12]. The same authors subsequently revised their IWG2 criteria to differentiate the at-risk of AD status (either abnormal amyloid burden or neurodegeneration) from preclinical AD (both abnormal) [13].

The 2018 NIA-AA RF proposed definition of AD was based on underlying neuropathology that can be translated in vivo using biomarkers related to abnormal levels of aggregated A (defined as “A”, including CSF A42 and cortical amyloid PET), fibrillar tau (defined as “T”, including CSF phosphorylated tau (P-tau) and cortical tau PET), as well as evidence of neurodegeneration or neuronal injury (defined as “(N)”, including CSF total tau (T-tau), FDG PET, and MRI) – the [AT(N)] scheme. Repositioning AD as a biological construct was a departure from the prevailing, primarily clinical (with or without biomarker profiling), classification of AD into one derived from all three AT(N) biomarker categories [1]. As clearly stated by the authors, their main objective was to provide an operational definition and staging of AD across the disease continuum to facilitate clinical research, including trials, and to standardise the reporting of findings of such studies. The RF was intended to be widely used in both interventional randomised clinical trials across the disease continuum and in biomarker-enriched observational prospective studies. Such studies would optimally involve much larger datasets that would, in turn, serve for the qualification and validation of the proposed RF, its key variables, and related methodologies and thresholds. The authors clearly stated that this is a fundamental and necessary step prior to its introduction into medical practice, as diagnostic criteria for AD in clinical diagnosis and management, or in secondary prevention in asymptomatic at-risk individuals [1], [14], [15].

Notably, the AT(N) scheme requires the dichotomous classification of related biomarkers in such a way that an asymptomatic individual would be defined as being on the AD continuum on the basis of a biomarker-related signature profile in the presence of abnormal levels of A markers, with T and (N) biomarkers providing insights into AD diagnosis and non-AD-specific pathologic changes, respectively. Thus, in parallel to adopting a holistic approach to viewing AD as a continuum, the RF’s recommendation relies on single thresholds to dichotomise the levels of each biomarker as either normal, or abnormal – a concept that has been widely used since the 2011 NIA criteria were published. Furthermore, the authors do not specify universal thresholds but describe several methodologies for how thresholds should be adapted by individual research groups to their research goals and environment [1]. While it is clear that the RF defines the AD disease state based on the [AT(N)] signature profile irrespective of the Alzheimer’s clinical syndrome or dementia, more work is now needed to better understand how the current RF subsequently relates to cognitive outcomes given the continued heavy focus on cognitive performance in practice (e.g. as primary outcome measures in clinical trials).

3. Dichotomous thresholds of continuous variables

The field of medical research is no stranger to using thresholds of continuous variables, aimed at identifying at-risk groups of individuals. Examples include, the utility of high/low systolic and diastolic blood pressure levels, and/or high/low cholesterol levels for characterising, within a given period of time, the risk of cardiovascular disease [16]. In the case of hypertension, thresholds were established based on decades of clinical practice and related research, using data from millions of individuals [17]. Such measures have also been used in combination with others to derive risk scores [18], which may in turn be applied as a means to group individuals based on their risk of disease progression, mainly for clinical management purposes.

For AD, where the asymptomatic stage is now believed to be the optimal time for pharmacological and non-pharmacological intervention, defining a set of biomarkers to identify at-risk individuals is still work in progress. This is in part due to the multifactorial nature of the disease, the scarcity of available (and well-powered) longitudinally structured epidemiological data from patients with sufficient numbers of outcomes relative to other diseases, and significant gaps in our understanding of the precise biological mechanisms underpinning disease causation and progression [1], [14], [19]–[22]. In attempting to overcome these obstacles, the NIA-AA RF has provided the research community with a set of tools to begin to disentangle the biological components of the disease, including the dichotomous classification of AT(N) biomarkers into what are considered to be normal and abnormal levels.

The consideration of thresholds was initially introduced in the 2011 NIA-AA research criteria, when the concept of a biomarker-based definition of pre-clinical AD was discussed [11]. Nonetheless, single thresholds come with a number of limitations, in terms of: sensitivity to longitudinal changes in biomarker levels over time [19], [23], [24], sensitivity with respect to choices of region(s) of interest relative to the reference region in the context of imaging markers [19], [24], [25], and sensitivity to often undefined measurement error [24], [26]. These limitations have been acknowledged and efforts are underway to address them – for example, the investigation into sub-threshold amyloid accumulation over time [27], [28], the topology of tracer uptake [27], [29], [30], among others. In addition to the research that is underway to improve the accuracy with which we measure biomarker levels in vivo, is research investigating and discussing possible alternatives to the dichotomous threshold-based system. Alternative methodologies have been discussed previously [1], [31]–[33], most notably the potential for applying a two-threshold approach, one lenient and one strict, which would in essence result in an “intermediary” group.

The perspective presented here is primarily in the context of Florbetapir PET within a clinical population. However, most arguments could, in theory, extend to other amyloid radiotracers [34] in studies that use different radiotracers and image processing methods [34], [35], and other continuous AD biomarkers, where the data are available. Amyloid cortical PET ligand binding, commonly assessed as the standardised uptake value ratio (SUVR), is currently one of the most commonly used biomarker of AD status and risk, along with measures of CSF A42 and/or the A42/A40 ratio. This is a result of the widespread belief of the role of A in the disease aetiology and symptom development despite the, as yet, unproven causal link between amyloid deposition and disease onset/progression [20]–[22], [36]. Nevertheless, amyloid PET is a key input within the NIA-AA 2018 RF and IWG2 for AD. In particular, it is used for determining whether or not an individual is on the AD continuum [1] or at risk of progression [13]. Furthermore, the consideration of these issues in the context of amyloid helps to address the immediate need to better understand what is going wrong in recent failed clinical trials.

4. Does there exist a peri-threshold “grey zone”?

4.1 The concept

To introduce the concept of having a region of uncertainty around a biomarker threshold, it is important to first place AD in the group of long-term chronic diseases of the elderly. This makes it difficult to disentangle the degenerative biological, pathological, and cognitive decline processes underlying natural brain ageing from the neurodegenerative processes associated with disease onset and progression. Unlike most infectious diseases, where the agent and onset of infection is known (or is assumed to be known), the moment at which AD is “triggered” and consequently affects cognitive decline, is not yet understood. There does not currently exist a defined healthy/diseased boundary for AD like the pre-/post-infection boundaries that exist for most infectious diseases. AD progression seems to be a long process with dynamic biological processes and biomarker levels that gradually go from being normal to abnormal and subsequently lead to worsening cognition by some as yet undetermined mechanisms. The gradual manifestation of abnormal biomarker levels, in contrast to the rapid biomarker changes seen in infectious diseases (for example, CD4 counts in HIV), may lend credibility to the concept of a more seamless continuum for AD and subsequent cognitive decline – something which cannot be accounted for using strict dichotomous thresholds. This does not preclude the consideration of thresholds to improve our understanding of the stratification of risk. However, it should be a concept that is kept in mind when trying to differentiate natural from disease-related neurodegenerative ageing.

Although these limitations apply to the entire population, they have a larger impact when it comes to peri-threshold individuals. It is likely that current dichotomous thresholds do well at distinguishing between individuals at either end of the distribution of biomarker measurements, as all derived thresholds (by definition) should, however the question of how well they distinguish between individuals that are near the threshold (or not so near, for that matter) with respect to the onset of the clinical syndrome seems not to have been addressed at present.

The dynamic nature of biomarker level changes within individuals over time and their association with cognitive decline is one reason why more research focusing on peri-threshold individuals is needed. Recent studies by Landau, and Farrell, and colleagues [27], [28] address this important issue by demonstrating that individuals with sub-threshold levels of cortical amyloid binding at baseline may be liable to develop abnormal amyloid pathology, and decline cognitively, at some point in the near, or not so near, future. Until longitudinal measures of amyloid become more accessible, a “grey area” around the threshold may be appropriate in order to better evaluate the cognitive trajectories of sub-threshold individuals that are accumulating over time.

Alternatively, as discussed above, the use of two thresholds that would allow for three groups (i.e. low, intermediate and high) has been proposed to account for some of the biological uncertainty in the exact causes of AD. Such approaches are already in clinical use in other common complex diseases, such as diabetes (pre-diabetes) or in hypertension, where sub-threshold levels are defined as “elevated” as opposed to “abnormal” [1]. Understanding the difference between defining an intermediary group using two thresholds and the concept of defining a “grey zone” is paramount to be able to understand the practical implications of the latter. The proposed three-group/two-threshold alternatives approach the issue biologically by, for example, using a lenient (lower) threshold in order to safely include all “true positive” cases or, vice versa, by using a strict (higher) threshold to ensure that only a limited number of “true negative” healthy controls are misclassified [33]. On the other hand, for purposes of risk prediction of cognitive decline rather than understanding causation, different approaches may be applicable. A “grey zone” approach accepts the dichotomised signature profile for the biological definition of disease status but represents an interval around its threshold within which more data is needed to better disentangle and define the risk of cognitive decline and the occurrence of the clinical syndrome. More specifically, the boundaries of a “grey zone” should be defined be in relation to the biomarker profile’s ability to distinguish between differences in the outcome of interest – in this case, cognitive decline.

For these reasons, the strict dichotomous classification of disease state based on single measurements from CH individuals may be sub-optimal for the purposes of prospective investigations into the risk of cognitive decline. Until we fully understand the biological relationship between AD-related biomarkers and the progression of clinical symptomology, a flexible approach may be more appropriate to assigning individuals to at-risk of cognitive decline groups [22], [37]. Such an approach would not only improve recruitment into randomised clinical trials but would also refrain from excluding those at-risk sub-threshold individuals from prospective longitudinal studies that carry the potential of providing more detailed information pertaining to the issues raised in this perspective piece.

4.2. The Alzheimer’s Disease Neuroimaging Initiative: a case study

Given the repeated failures of randomised clinical trials testing anti-amyloid medications [22], [38]–[40], we examined these issues in the context of amyloid deposition with longitudinal measures of cognition as the study outcome. Relevant data from preclinical individuals with measured amyloid PET were obtained from the ADNI study database, focussing on individuals that were CH at baseline, with and without subjective complaints. Figure 1 depicts the differences in cognitive decline over time, as assessed with the Preclinical Alzheimer Cognitive Composite (PACC) [41] modified to include the Trail-Making Test B. It illustrates the differences in cognitive decline between individuals with normal/abnormal levels of Florbetapir SUVR measured via PET (amyloid abnormality defined as whole-cerebellum SUVR above 1.11 [36], which has become the commonly accepted threshold using ADNI data) in a sample of individuals who were diagnosed as cognitively normal or with subjective memory concerns. Longitudinal PACC scores were modelled using linear mixed-effect regression with a random intercept and slope. The analyses were adjusted for relevant covariates and included a quadratic term for time since baseline (see Figure 1 caption for more details). The various components in Figure 1 illustrate the differences, or lack thereof, for decreasing numbers of “extreme” SUVR levels (i.e. measurements at either end of the overall distribution), between progressively smaller sample subsets of individuals. Subsets were defined as being within a given distance of the established SUVR threshold of 1.11 in either positive or negative directions. The distances from the threshold of +/- 0.20, 0.10 and 0.05 were chosen to represent a minimal decrease in overall sample size, to reflect the random variation in SUVR levels over time accounting for various factors, and to retain the proportion of the sample classified as abnormal (Figure 2). Trajectories of cognitive decline have been modelled using similar methods split by amyloid positivity/negativity [42], [43] and extended to include neurodegenerative markers and suspected non-Alzheimer’s disease pathophysiology (SNAP) [2], [44].

By iteratively focusing on individuals that are closer to the threshold, we can begin to unravel the point where the threshold loses its ability to describe differences in subsequent patterns of cognitive decline. Figure 1C demonstrates how, despite including a wide range of values around a SUVR threshold of 1.11 relative to the distribution in the sample (+/- SUVR 0.10), we lose the ability to differentiate between the subsequent trajectories of cognitive decline. This concept can be illustrated by comparing the overlapping credible intervals (CIs) after 5 years for individuals within 0.10 of the threshold (SUVR 1.01 – 1.21) in Figure 1C (positive: -0.55 [-1.93, 0.90]; negative: 0.04 [-1.01, 1.03]) with the non-overlapping CIs for individuals within 0.20 of the threshold (SUVR 0.91 – 1.31) in Figure 1B (positive: -1.63 [-2.61, -0.63]; negative: 0.18 [-0.54, 0.88]) after the same period of time. This specific finding implies that the cognitive trajectories of individuals within at least 0.10 of the SUVR threshold of 1.11 may be indistinguishable within 5 years. Of note, approximately 40% of individuals within our sample had baseline PET levels within 0.10 of the SUVR threshold.

This case study is meant to be illustrative of the issues surrounding the cognitive trajectories of peri-threshold individuals. Further work in this area may consider clinically meaningful differences in cognitive decline as opposed to the divergence of cognitive trajectories.

5. Suggestions going forward

5.1 Understanding the “grey zone”, not redefining thresholds

In light of these observations, the question arises whether the current classification based on a single threshold value needs to be revised, in particular with respect to clinical trials involving cognitively unaffected peri-threshold individuals that we have shown may carry the same risk of cognitive decline over time. The large proportion of peri-threshold individuals in ADNI highlights the importance of addressing these questions.

If we accept that there is an “unknown” instance around the threshold within which absolute measurements of a single biomarker are unable, in isolation, to distinguish between biomarker-based normal and abnormal individuals with respect to risk of cognitive decline over time, then the classification procedures for individuals whose measurements lie within this range will need to be supplemented using other data sources. This does not mean simply further stratifying peri-threshold positive/negative amyloid individuals into smaller, more precise groups [2]. An example of this would be stratification into AT(N) groups. It means taking this population as an amyloid homogenous peri-threshold group and defining risk of cognitive decline using additional data sources – ideally commonly collected measures.

Given the current state of knowledge and understanding of disease aetiology and pathophysiology, we envisage that a more integrative multidimensional approach would prove useful in defining the at-risk of cognitive decline in a group of CH peri-threshold individuals. Such a multi-dimensional approach may include the AT(N) markers but would extend beyond to include measures of other known, but not necessarily AD-specific, risk factors, such as age, genetics (e.g. APOE or APOE/TOMM40 allele carriage status [45]), polygenic risk scores [46], [47] or others, such as the CAIDE (Cardiovascular Risk Factors, Aging, and Incidence of Dementia) Dementia Risk Score [18], used in the Finnish Geriatric Intervention Study to Prevent Cognitive Impairment and Disability (FINGER) randomised control trial [48]. Metrics related to cognitive reserve, brain reserve and brain maintenance [49] would also be relevant in this context due to the focus on cognitive outcomes. The development and precise composition of such a composite measure would be based on emerging evidence from biomarker-enriched prospective studies from large datasets. Such a composite measure would allow to dissect, understand and better classify peri-threshold individuals, in terms of risk of cognitive decline and progression to the Alzheimer’s clinical syndrome, for interventional clinical research studies.

5.2 Requirements going forward

Unlike in the case of hypertension referenced earlier, AD researchers do not yet have the luxury of biomarker-enriched studies of sufficiently large sample sizes, frequent measurements, and long follow-up periods to adequately address these complex issues. Collaboration between research groups, in sharing their respective cohort data, would accelerate this process and would provide a tool for the validation of findings, such as those presented here. A unified effort is now required, perhaps under the auspices of the Alzheimer’s Association, that brings together data from multiple biomarker-enriched prospective studies with long follow-up, and failed clinical trials, to begin the difficult task of trying to harmonise the definition of thresholds across studies (as is underway with the calibration of different amyloid PET ligands along the common Centiloid scale [50]). Such an effort to harmonise should focus both on the biological disease signature and, in cognitive healthy individuals, the risk of cognitive decline with the concept of the “grey zone” in mind. This may lead to universally adopted thresholds and a better delineation of the peri-threshold “grey zone” across studies.

6. Summary

Discussions around issues such as the ones presented here are a necessity going forward. We argue that high priority should be given to moving beyond and improving upon the current utilisation of the dichotomous classification system in clinical research settings, given its limitations. Overall, the 2018 NIA-AA RF has accelerated the field of biomarker research and has provided a means to standardise the reporting of research findings. Nevertheless, the link between the biological profile aimed at defining disease status and the future risk profile of cognitive decline needs to be better understood. We suggest that a key focus of further research should be on how to begin to differentiate between the cognitive trajectories of peri-threshold individuals, while in parallel continuing to address the biological challenges mentioned previously (sub-threshold accumulation, image processing, regions of interest, clear definition of measurement error, among others). The only way to do so is through a unified effort across multiple research groups. The overarching aim of the perspective presented here is to stimulate further discussion and research on this topic. The challenge going forward will be finding a balance between accuracy, clinical usefulness, and practicality.

List of figures

Figure 1: Cognitive decline by amyloid status and peri-threshold interval

Mean Preclinical Alzheimer’s Cognitive Composite (PACC) score over time for a range of data subsets: (A) full sample; (B) within 0.20 of the threshold; (C) within 0.10 of the threshold; (D) within 0.05 of the threshold. Models for each subset were fit based on complete data from 356, 308, 150 and 72 individuals, respectively. Baseline values SUVR values were imputed prior to dichotomisation in cases where individuals had follow-up standard uptake value ratio (SUVR) measurements but no baseline measurement. Amyloid positivity was defined as Florbetapir whole-cerebellum SUVR greater than 1.11. Linear mixed-effect regression with random intercept and slopes was used to model longitudinal PACC scores controlling for amyloid status and baseline covariates. The modelling was conducted in a Bayesian framework with diffuse/non-informative priors. Time since baseline was modelled as a quadratic term. Results presented above are for mean age and ventricular volume (adjusted for intracranial volume) at baseline, APOE4 positive, female, mean years of education. Solid lines represent the expected PACC score over time with associated shaded regions representing 95% credible intervals. Inset histograms present the distribution of individuals by amyloid status included in each analysis. To verify that the trajectories presented here are robust to the imbalance in amyloid status, amyloid positive individuals were bootstrapped such that sample sizes were equally between both groups.

Figure 2: Proportion of total sample and amyloid positivity by peri-threshold interval

The grey line represents the proportion of the total sample size that would be included as a function of the width of the interval around the current threshold value of 1.11. The blue line represents the proportion of the included sample that would classified as amyloid positive (note: the proportion of the total sample that is amyloid positive is 0.33) as a function of the width of the interval. Dotted lines (B), (C) and (D) correspond with the one-sided widths of intervals of 0.20, 0.10 and 0.05 respectively and are aligned with those from Figure 1.

Abbreviations

Aβ – β-amyloid

AD – Alzheimer’s disease

ADNI – Alzheimer’s Disease Neuroimaging Initiative

CAIDE – Cardiovascular Risk Factors, Aging, and Incidence of Dementia

CERAD – Consortium to Establish a Registry for Alzheimer’s Disease

CI – Credible interval

CH – Cognitively healthy

CSF – Cerebrospinal fluid

FDG – Fluorodeoxyglucose

FINGER – Finnish Geriatric Intervention Study to Prevent Cognitive Impairment and Disability

IWG(2) – International Working Group (2)

MCI – Mild cognitive impairment

MRI – Magnetic response imaging

NIA-AA – National Institute on Aging and the Alzheimer’s Association

NINCDS-ADRDA – National Institute of Neurological and Communicative Disorders and Stroke – Alzheimer’s Disease and Related Disorders Association

P-tau – Phosphorylated tau

PACC – Preclinical Alzheimer Cognitive Composite

PET – Positron emission tomography

RF – NIA-AA 2018 Research Framework

SNAP – Suspected non-Alzheimer’s Pathophysiology

SUVR – Standardised uptake value ratio

T-tau – Total tau

Disclosure of potential conflicts of interest and funding

Conflict of interest: Prof Roy M. Anderson is a non-executive board member of GlaxoSmithKline (GSK). GSK played no part in this research, its funding or the preparation of the manuscript. Prof Lefkos T. Middleton has consultancy agreements with Eli Lilly, Astra Zeneca, Novartis and Takeda; He is National (UK) Coordinator for the TOMMORROW, Amaranth and Generation I&II Clinical Trials and the Janssen Chariot PRO studies, has received research funding for his Imperial team from Janssen, Takeda, AstraZeneca, Novartis and UCB Pharmaceuticals; and does not hold any agreement with any of the funders in relation to patents, products in development relevant to this study or marketed products. Prof Frank de Wolf was employed by the Janssen Prevention Center until February 2019. The Janssen Prevention Center played no part in this research, its funding, or the preparation of the manuscript.

Funding: DPUK provided infrastructure for this project through MRC grant reference MR/L023784/2. We acknowledge joint Centre funding from the UK Medical Research Council and Department for International Development through grant ref' MR/R015600/1. Affiliation: MRC Centre for Global Infectious Disease Analysis.

Research involving human participants and/or animals

Ethical approval (humans): As per ADNI protocols, all procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. More details can be found at adni.loni.usc.edu. (This article does not contain any studies with human participants performed by any of the authors)

Ethical approval (animals): This article does not contain any studies with animals performed by any of the authors.

Protocol approval, ethics approval and consent to participate

The study was approved by the Institutional Review Boards of all the participating institutions and informed written consent was obtained from all participants at each site. One such institution is the Office for the Protection of Research Subjects at the University of Southern California. More details can be found at adni.loni.usc.edu.

Consent for publication

Consent for publication has been granted by ADNI administrators.

Availability of data and material

Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). The primary goal of ADNI has been to test whether serial MRI, PET, other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment and early Alzheimer’s disease. The Principal Investigator of ADNI is Michael W. Weiner, MD (email: [email protected]).

Authors’ contributions

Wrote the manuscript (KMM, CU, DS, LM); designed the study (KMM, CdJ, SB, LB, LM); conducted the statistical analysis (KMM, SB, LB) and interpreted the results (KMM, SB, CdJ, CH, LB, FdW, LM); critically reviewed and revised the manuscript (KMM, CU, GP, SB, CdJ, DS, CH, EM, LB, SAA, FdW, RA, LM); supervised the study (RA, LM). All authors approved this manuscript for publication.

Acknowledgements

The investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators and affiliations can be found at:

http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf.

ADNI was launched in 2003 as public-private partnership, led by Principal Investigator Michael W. Weiner, MD. ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.

References

[1]C. R. Jack et al., “NIA-AA Research Framework: Toward a biological definition of Alzheimer’s disease,” Alzheimers Dement. J. Alzheimers Assoc., vol. 14, no. 4, pp. 535–562, 2018.

[2]C. R. Jack et al., “Associations of Amyloid, Tau, and Neurodegeneration Biomarker Profiles With Rates of Memory Decline Among Individuals Without Dementia,” JAMA, vol. 321, no. 23, pp. 2316–2325, Jun. 2019.

[3]Z. S. Khachaturian, “Diagnosis of Alzheimer’s Disease,” Arch. Neurol., vol. 42, no. 11, pp. 1097–1105, Nov. 1985.

[4]B. T. M. Hyman and J. Q. M. Trojanowski, “Editorial on Consensus Recommendations for the Postmortem Diagnosis of Alzheimer Disease from the National Institute on Aging and the Reagan Institute Working Group on Diagnostic Criteria for the Neuropathological Assessment of Alzheimer Disease,” J. Neuropathol., vol. 56, no. 10, pp. 1095–1097, Oct. 1997.

[5]S. S. Mirra et al., “The Consortium to Establish a Registry for Alzheimer’s Disease (CERAD). Part II. Standardization of the neuropathologic assessment of Alzheimer’s disease,” Neurology, vol. 41, no. 4, pp. 479–486, Apr. 1991.

[6]B. T. Hyman et al., “National Institute on Aging-Alzheimer’s Association guidelines for the neuropathologic assessment of Alzheimer’s disease,” Alzheimers Dement. J. Alzheimers Assoc., vol. 8, no. 1, pp. 1–13, Jan. 2012.

[7]G. McKhann, D. Drachman, M. Folstein, R. Katzman, D. Price, and E. M. Stadlan, “Clinical diagnosis of Alzheimer’s disease: report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer’s Disease,” Neurology, vol. 34, no. 7, pp. 939–944, Jul. 1984.

[8]B. Dubois et al., “Research criteria for the diagnosis of Alzheimer’s disease: revising the NINCDS-ADRDA criteria,” Lancet Neurol., vol. 6, no. 8, pp. 734–746, Aug. 2007.

[9]G. M. McKhann et al., “The diagnosis of dementia due to Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease,” Alzheimers Dement. J. Alzheimers Assoc., vol. 7, no. 3, pp. 263–269, May 2011.

[10]M. S. Albert et al., “The diagnosis of mild cognitive impairment due to Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease,” Alzheimers Dement. J. Alzheimers Assoc., vol. 7, no. 3, pp. 270–279, May 2011.

[11]R. A. Sperling et al., “Toward defining the preclinical stages of Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease,” Alzheimers Dement. J. Alzheimers Assoc., vol. 7, no. 3, pp. 280–292, May 2011.

[12]B. Dubois et al., “Advancing research diagnostic criteria for Alzheimer’s disease: the IWG-2 criteria,” Lancet Neurol., vol. 13, no. 6, pp. 614–629, Jun. 2014.

[13]B. Dubois et al., “Preclinical Alzheimer’s disease: Definition, natural history, and diagnostic criteria,” Alzheimers Dement. J. Alzheimers Assoc., vol. 12, no. 3, pp. 292–323, Mar. 2016.

[14]A. S. Khachaturian et al., “Future prospects and challenges for Alzheimer’s disease drug development in the era of the NIA-AA Research Framework,” Alzheimers Dement. J. Alzheimers Assoc., vol. 14, no. 4, pp. 532–534, 2018.

[15]N. Silverberg, C. Elliott, L. Ryan, E. Masliah, and R. Hodes, “NIA commentary on the NIA-AA Research Framework: Towards a biological definition of Alzheimer’s disease,” Alzheimers Dement. J. Alzheimers Assoc., vol. 14, no. 4, pp. 576–578, 2018.

[16]British Cardiac Society, British Hypertension Society, Diabetes UK, HEART UK, Primary Care Cardiovascular Society, and Stroke Association, “JBS 2: Joint British Societies’ guidelines on prevention of cardiovascular disease in clinical practice,” Heart Br. Card. Soc., vol. 91 Suppl 5, pp. v1-52, Dec. 2005.

[17]P. K. Whelton et al., “2017 ACC/AHA/AAPA/ABC/ACPM/AGS/APhA/ASH/ASPC/NMA/PCNA Guideline for the Prevention, Detection, Evaluation, and Management of High Blood Pressure in Adults: Executive Summary: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines,” Circulation, vol. 138, no. 17, pp. e426–e483, Oct. 2018.

[18]M. Kivipelto, T. Ngandu, T. Laatikainen, B. Winblad, H. Soininen, and J. Tuomilehto, “Risk score for the prediction of dementia risk in 20 years among middle aged people: a longitudinal, population-based study,” Lancet Neurol., vol. 5, no. 9, pp. 735–741, Sep. 2006.

[19]A. D. Cohen, S. M. Landau, B. E. Snitz, W. E. Klunk, K. Blennow, and H. Zetterberg, “Fluid and PET biomarkers for amyloid pathology in Alzheimer’s disease,” Mol. Cell. Neurosci., Dec. 2018.

[20]S. Gauthier et al., “Why has therapy development for dementia failed in the last two decades?,” Alzheimers Dement. J. Alzheimers Assoc., vol. 12, no. 1, pp. 60–64, Jan. 2016.

[21]W. Jagust et al., “‘Alzheimer’s disease’ is neither ‘Alzheimer’s clinical syndrome’ nor ‘dementia,’” Alzheimers Dement. J. Alzheimers Assoc., vol. 15, no. 1, pp. 153–157, Jan. 2019.

[22]G. P. Morris, I. A. Clark, and B. Vissel, “Questions concerning the role of amyloid-β in the definition, aetiology and diagnosis of Alzheimer’s disease,” Acta Neuropathol. (Berl.), vol. 136, no. 5, pp. 663–689, Nov. 2018.

[23]P. S. Insel et al., “Time to Amyloid Positivity and Preclinical Changes in Brain Metabolism, Atrophy, and Cognition: Evidence for Emerging Amyloid Pathology in Alzheimer’s Disease,” Front. Neurosci., vol. 11, p. 281, 2017.

[24]S. M. Landau, “Optimizing Longitudinal Amyloid-β PET Measurement: The Challenges of Intensity Normalization,” J. Nucl. Med. Off. Publ. Soc. Nucl. Med., vol. 59, no. 10, pp. 1581–1582, Oct. 2018.

[25]M. J. Grothe et al., “In vivo staging of regional amyloid deposition,” Neurology, vol. 89, no. 20, pp. 2031–2038, Nov. 2017.

[26]M. E. Schmidt et al., “The influence of biological and technical factors on quantitative analysis of amyloid PET: Points to consider and recommendations for controlling variability in longitudinal data,” Alzheimers Dement. J. Alzheimers Assoc., vol. 11, no. 9, pp. 1050–1068, Sep. 2015.

[27]M. E. Farrell, X. Chen, M. M. Rundle, M. Y. Chan, G. S. Wig, and D. C. Park, “Regional amyloid accumulation and cognitive decline in initially amyloid-negative adults,” Neurology, vol. 91, no. 19, pp. e1809–e1821, Nov. 2018.

[28]S. M. Landau, A. Horng, W. J. Jagust, and Alzheimer’s Disease Neuroimaging Initiative, “Memory decline accompanies subthreshold amyloid accumulation,” Neurology, vol. 90, no. 17, pp. e1452–e1460, Apr. 2018.

[29]S. Palmqvist et al., “Earliest accumulation of β-amyloid occurs within the default-mode network and concurrently affects brain connectivity,” Nat. Commun., vol. 8, no. 1, p. 1214, 31 2017.

[30]S. Villeneuve et al., “Existing Pittsburgh Compound-B positron emission tomography thresholds are too high: statistical and pathological evaluation,” Brain J. Neurol., vol. 138, no. Pt 7, pp. 2020–2033, Jul. 2015.

[31]W. Klunk et al., “Why we need two cutoffs for amyloid imaging: Early versus Alzheimer’s-like amyloid-positivity,” Alzheimers Dement., vol. 8, no. 4, Supplement, pp. P453–P454, Jul. 2012.

[32]C. R. Jack et al., “A/T/N: An unbiased descriptive classification scheme for Alzheimer disease biomarkers,” Neurology, vol. 87, no. 5, pp. 539–547, Aug. 2016.

[33]C. R. Jack et al., “Defining imaging biomarker cut points for brain aging and Alzheimer’s disease,” Alzheimers Dement. J. Alzheimers Assoc., vol. 13, no. 3, pp. 205–216, Mar. 2017.

[34]S. M. Landau et al., “Amyloid-β imaging with Pittsburgh compound B and florbetapir: comparing radiotracers and quantification methods,” J. Nucl. Med. Off. Publ. Soc. Nucl. Med., vol. 54, no. 1, pp. 70–77, Jan. 2013.

[35]A. D. Joshi et al., “Performance characteristics of amyloid PET with florbetapir F 18 in patients with alzheimer’s disease and cognitively normal subjects,” J. Nucl. Med. Off. Publ. Soc. Nucl. Med., vol. 53, no. 3, pp. 378–384, Mar. 2012.

[36]S. M. Landau et al., “Amyloid deposition, hypometabolism, and longitudinal cognitive decline,” Ann. Neurol., vol. 72, no. 4, pp. 578–586, Oct. 2012.

[37]B. D. James and D. A. Bennett, “Causes and Patterns of Dementia: An Update in the Era of Redefining Alzheimer’s Disease,” Annu. Rev. Public Health, Jan. 2019.

[38]R. M. Anderson, C. Hadjichrysanthou, S. Evans, and M. M. Wong, “Why do so many clinical trials of therapies for Alzheimer’s disease fail?,” Lancet Lond. Engl., vol. 390, no. 10110, pp. 2327–2329, Nov. 2017.

[39]S. Evans, K. McRae-McKee, M. M. Wong, C. Hadjichrysanthou, F. De Wolf, and R. Anderson, “The importance of endpoint selection: How effective does a drug need to be for success in a clinical trial of a possible Alzheimer’s disease treatment?,” Eur. J. Epidemiol., vol. 33, no. 7, pp. 635–644, Jul. 2018.

[40]L. S. Honig et al., “Trial of Solanezumab for Mild Dementia Due to Alzheimer’s Disease,” N. Engl. J. Med., vol. 378, no. 4, pp. 321–330, 25 2018.

[41]M. C. Donohue et al., “The preclinical Alzheimer cognitive composite: measuring amyloid-related decline,” JAMA Neurol., vol. 71, no. 8, pp. 961–970, Aug. 2014.

[42]M. E. Farrell et al., “Association of Longitudinal Cognitive Decline With Amyloid Burden in Middle-aged and Older Adults: Evidence for a Dose-Response Relationship,” JAMA Neurol., vol. 74, no. 7, pp. 830–838, 01 2017.

[43]M. C. Donohue et al., “Association Between Elevated Brain Amyloid and Subsequent Cognitive Decline Among Cognitively Normal Persons,” JAMA, vol. 317, no. 22, pp. 2305–2316, 13 2017.

[44]S. C. Burnham et al., “Clinical and cognitive trajectories in cognitively healthy elderly individuals with suspected non-Alzheimer’s disease pathophysiology (SNAP) or Alzheimer’s disease pathology: a longitudinal study,” Lancet Neurol., vol. 15, no. 10, pp. 1044–1053, 2016.

[45]M. W. Lutz, D. G. Crenshaw, A. M. Saunders, and A. D. Roses, “Genetic variation at a single locus and age of onset for Alzheimer’s disease,” Alzheimers Dement. J. Alzheimers Assoc., vol. 6, no. 2, pp. 125–131, Mar. 2010.

[46]V. Escott-Price et al., “Common polygenic variation enhances risk prediction for Alzheimer’s disease,” Brain J. Neurol., vol. 138, no. Pt 12, pp. 3673–3684, Dec. 2015.

[47]S. J. van der Lee et al., “The effect of APOE and other common genetic variants on the onset of Alzheimer’s disease and dementia: a community-based cohort study,” Lancet Neurol., vol. 17, no. 5, pp. 434–444, 2018.

[48]M. Kivipelto et al., “The Finnish Geriatric Intervention Study to Prevent Cognitive Impairment and Disability (FINGER): study design and progress,” Alzheimers Dement. J. Alzheimers Assoc., vol. 9, no. 6, pp. 657–665, Nov. 2013.

[49]Y. Stern et al., “Whitepaper: Defining and investigating cognitive reserve, brain reserve, and brain maintenance,” Alzheimers Dement. J. Alzheimers Assoc., Sep. 2018.

[50]W. E. Klunk et al., “The Centiloid Project: standardizing quantitative amyloid plaque estimation by PET,” Alzheimers Dement. J. Alzheimers Assoc., vol. 11, no. 1, pp. 1-15.e1–4, Jan. 2015.