Background: Although recent diagnostic criteria for Alzheimer's disease propose the use of biomarkers, validation of these biomarkers by diagnostic test accuracy studies is a necessary first step, followed by the synthesis of the evidence from these studies in systematic reviews and meta-analyses. The quality of the resulting evidence depends on the number and size of the primary studies, their quality, and the adequacy of their reporting. This systematic review assesses the weight and quality of the evidence available from primary diagnostic test accuracy studies. Methods: A MEDLINE search was performed in August 2011 to identify all potentially relevant publications relating to the biomarkers β-amyloid, tau, positron emission tomography ( 18F-fluorodeoxyglucose or ligands for amyloid), or magnetic resonance imaging (MRI). The reporting and methodology were assessed using the Standards for Reporting of Diagnostic Accuracy and Quality Assessment of Diagnostic Accuracy Studies assessment tools, respectively. Because clinical progression to dementia is the most commonly used reference standard, this review focuses on participants with objective cognitive impairment but no dementia at baseline. Results: Of the 19,104 published references identified by the search, 142 longitudinal studies relating to the biomarkers of interest were identified, which included subjects who had objective cognitive impairment but no dementia at baseline. The highest number of studies (n = 70) and of participants (n = 4722) related to structural MRI. MRI also yielded the highest number of studies with extractable data for meta-analysis (n = 32 [46% of all structural MRI studies]), followed by cerebrospinal fluid tau (n = 24 [73%]). There were few studies on positron emission tomography ligands for amyloid having suitable data for meta-analysis (n = 4). There was considerable variation across studies in reporting outcomes, methods of blinding and selection, means of accounting for indeterminate or missing values, the interval between the test and assessments, and the determination of test thresholds. Conclusions: The body of evidence for biomarkers is not large and is variable across the different types of biomarkers. Important information is missing from many study reports, highlighting the need for standardization of methodology and reporting to improve the rigor of biomarker validation. © 2013 The Alzheimer's Association. All rights reserved.