Scolaris Content Display Scolaris Content Display

First trimester serum tests for Down's syndrome screening

Collapse all Expand all

Background

Down's syndrome occurs when a person has three, rather than two copies of chromosome 21; or the specific area of chromosome 21 implicated in causing Down's syndrome. It is the commonest congenital cause of mental disability and also leads to numerous metabolic and structural problems. It can be life‐threatening, or lead to considerable ill health, although some individuals have only mild problems and can lead relatively normal lives. Having a baby with Down’s syndrome is likely to have a significant impact on family life.

Noninvasive screening based on biochemical analysis of maternal serum or urine, or fetal ultrasound measurements, allows estimates of the risk of a pregnancy being affected and provides information to guide decisions about definitive testing. However, no test can predict the severity of problems a person with Down’s syndrome will have.

Objectives

The aim of this review was to estimate and compare the accuracy of first trimester serum markers for the detection of Down’s syndrome in the antenatal period, both as individual markers and as combinations of markers. Accuracy is described by the proportion of fetuses with Down’s syndrome detected by screening before birth (sensitivity or detection rate) and the proportion of women with a low risk (normal) screening test result who subsequently had a baby unaffected by Down's syndrome (specificity).

Search methods

We conducted a sensitive and comprehensive literature search of MEDLINE (1980 to 25 August 2011), Embase (1980 to 25 August 2011), BIOSIS via EDINA (1985 to 25 August 2011), CINAHL via OVID (1982 to 25 August 2011), The Database of Abstracts of Reviews of Effectiveness (The Cochrane Library 25 August 2011), MEDION (25 August 2011), The Database of Systematic Reviews and Meta‐Analyses in Laboratory Medicine (25 August 2011), The National Research Register (Archived 2007), Health Services Research Projects in Progress database (25 August 2011). We did forward citation searching ISI citation indices, Google Scholar and PubMed ‘related articles’. We did not apply a diagnostic test search filter. We also searched reference lists and published review articles.  

Selection criteria

We included studies in which all women from a given population had one or more index test(s) compared to a reference standard (either chromosomal verification or macroscopic postnatal inspection). Both consecutive series and diagnostic case‐control study designs were included. Randomised trials where individuals were randomised to different screening strategies and all verified using a reference standard were also eligible for inclusion. Studies in which test strategies were compared head‐to‐head either in the same women, or between randomised groups were identified for inclusion in separate comparisons of test strategies. We excluded studies if they included less than five Down's syndrome cases, or more than 20% of participants were not followed up.

Data collection and analysis

We extracted data as test positive or test negative results for Down's and non‐Down's pregnancies allowing estimation of detection rates (sensitivity) and false positive rates (1‐specificity). We performed quality assessment according to QUADAS (Quality Assessment of Diagnostic Accuracy Studies) criteria. We used hierarchical summary ROC meta‐analytical methods or random‐effects logistic regression methods to analyse test performance and compare test accuracy as appropriate. Analyses of studies allowing direct and indirect comparisons between tests were undertaken.

Main results

We included 56 studies (reported in 68 publications) involving 204,759 pregnancies (including 2113 with Down's syndrome). Studies were generally of good quality, although differential verification was common with invasive testing of only high‐risk pregnancies. We evaluated 78 test combinations formed from combinations of 18 different tests, with or without maternal age; ADAM12 (a disintegrin and metalloprotease), AFP (alpha‐fetoprotein), inhibin, PAPP‐A (pregnancy‐associated plasma protein A, ITA (invasive trophoblast antigen), free βhCG (beta human chorionic gonadotrophin), PlGF (placental growth factor), SP1 (Schwangerschafts protein 1), total hCG, progesterone, uE3 (unconjugated oestriol), GHBP (growth hormone binding protein), PGH (placental growth hormone), hyperglycosylated hCG, ProMBP (proform of eosinophil major basic protein), hPL (human placental lactogen), (free αhCG, and free ßhCG to AFP ratio. Direct comparisons between two or more tests were made in 27 studies.

Meta‐analysis of the nine best performing or frequently evaluated test combinations showed that a test strategy involving maternal age and a double marker combination of PAPP‐A and free ßhCG significantly outperformed the individual markers (with or without maternal age) detecting about seven out of every 10 Down's syndrome pregnancies at a 5% false positive rate (FPR). Limited evidence suggested that marker combinations involving PAPP‐A may be more sensitive than those without PAPP‐A.

Authors' conclusions

Tests involving two markers in combination with maternal age, specifically PAPP‐A, free βhCG and maternal age are significantly better than those involving single markers with and without age. They detect seven out of 10 Down's affected pregnancies for a fixed 5% FPR. The addition of further markers (triple tests) has not been shown to be statistically superior; the studies included are small with limited power to detect a difference.

The screening blood tests themselves have no adverse effects for the woman, over and above the risks of a routine blood test. However some women who have a ‘high risk’ screening test result, and are given amniocentesis or chorionic villus sampling (CVS) have a risk of miscarrying a baby unaffected by Down’s. Parents will need to weigh up this risk when deciding whether or not to have an amniocentesis or CVS following a ‘high risk’ screening test result.

Screening tests for Down’s syndrome in first three months of pregnancy

Background
Down's syndrome (also known as Down's or Trisomy 21) is an incurable genetic disorder that causes significant physical and mental health problems, and disabilities. However, there is wide variation in how Down's affects people. Some individuals are severely affected whilst others have mild problems and are able to lead relatively normal lives. There is no way of predicting how badly a baby might be affected.

Expectant parents are given the choice to be tested for Down’s during pregnancy to assist them in making decisions. If a mother is carrying a baby with Down’s, then there is the decision about whether to terminate or continue with the pregnancy. The information offers parents the opportunity to plan for life with a Down’s child.

The most accurate tests for Down’s involve testing fluid from around the baby (amniocentesis) or tissue from the placenta (chorionic villus sampling (CVS)) for the abnormal chromosomes associated with Down’s. Both these tests involve inserting needles through the mother's abdomen and are known to increase the risk of miscarriage. Thus, the tests are not suitable for offering to all pregnant women. Rather, tests that measure markers in the mother’s blood, urine or on ultrasound scans of the baby are used for screening. These screening tests are not perfect, they can miss cases of Down’s and also give a ‘high risk’ test result to a number of women whose babies are not affected by Down’s. Thus, pregnancies identified as ‘high risk’ using these screening tests require further testing using amniocentesis or CVS to confirm a diagnosis of Down’s.

What we did
The aim of this review was to find out which of the blood screening tests done during the first three months of pregnancy are the most accurate at predicting the risk of a pregnancy being affected by Down's. We looked at 18 different blood markers that can be used alone or in combination, taken before 14 weeks gestation, thus creating 78 screening tests fro Down’s. We found 56 studies, involving 204,759 pregnancies of which 2113 had pregnancies affected by Down's.

What we found
For the first 14 weeks of pregnancy, the evidence supports the use of the double test of two blood markers; pregnancy‐associated plasma protein A (PAPP‐A) and free beta human chorionic gonadotrophin (βhCG), in combination with the mother's age. This test detects around seven out of every 10 (68%) pregnancies affected by Down's. It is common practice to offer amniocentesis or CVS to women with a high risk test result. About one in 20 women (5%) having this test will have a ‘high risk’ result but most of these women will not be carrying a baby with Down’s. We found for tests in the first 14 weeks of pregnancy, there is little evidence to support the use of serum tests made up of more than two blood markers.

Other important information to consider
The blood tests themselves have no adverse effects for the woman, over and above the risks of a routine blood test. However some women who have a ‘high risk’ screening test result, and are given amniocentesis or CVS have a risk of miscarrying a baby unaffected by Down’s. Parents will need to weigh up this risk when deciding whether or not to have an amniocentesis or CVS following a ‘high risk’ screening test result.

Authors' conclusions

Implications for practice

The evidence supports the use of the first trimester double test comprised of PAPP‐A, free βhCG and maternal age, there is little evidence to recommend the use of first trimester serum tests with three or more markers, however the data available on these tests are limited, and based on generally small populations of women. We would not recommend that these tests be introduced into wider clinical practice without careful consideration of cost.

The review has shown that tests involving two or three markers in combination with maternal age are significantly better than those involving one marker. We would therefore recommend that one marker tests are not used for Down's syndrome screening. The choice of multiple markers will depend on the availability of certain assays in local laboratories. On the basis of this review we would recommend the combination of PAPP‐A, free βhCG and maternal age, as it significantly outperforms free βhCG, AFP and maternal age, and is widely available. The data for other test combinations limits our ability to make any other recommendations about specific test combinations. Alternative screening methods should also be considered (i.e. use of ultrasound markers in the first trimester) when making policy decisions, and are the subject of other reviews in this suite.

The screening blood tests themselves have no adverse effects for the woman, over and above the risks of a routine blood test. However some women who have a ‘high risk’ screening test result, and are given amniocentesis or CVS have a risk of miscarrying a baby unaffected by Down’s. Parents will need to weigh up this risk when deciding whether or not to have an amniocentesis or CVS following a ‘high risk’ screening test result.

Implications for research

Further evaluations of test combinations involving three or more markers are required to determine whether their apparent advantages are not chance findings.

Future studies should ensure that adequate sample sizes are recruited, and take opportunities to make comparisons of test performance testing several alternative test combinations on the same serum samples. Such direct comparison removes issues of confounding when making test comparisons, and allows a clear focus on testing the incremental benefit of increasingly complex and expensive testing strategies. The reporting of studies of test accuracy can be improved and more closely adhere to the standards for the reporting of diagnostic accuracy studies (STARD) guideline. Three key aspects of this are 1) formally testing the statistical significance of differences in test performance in direct comparisons and estimating incremental changes in detection rates (together with confidence intervals), 2) clearly reporting the number of mothers studied and their results, and 3) reporting the numbers of women who are lost to follow‐up. Many authors reported results of extrapolating findings to age‐standardised national cohorts to demonstrate the performance of the test, and failed to report the actual numbers studied and evaluated.

For the purposes of meta‐analysis and to allow for comparisons to be made between different tests and combinations, we would recommend the publication of consensus standard algorithms for estimating risk, and reporting of test performance at a standard set of thresholds. This would be difficult to achieve and implement, but an attempt at consensus should be made.

Summary of findings

Open in table viewer
Summary of findings 1. Performance of first trimester serum test strategies with or without maternal age

Review Question

What is the accuracy of serum‐based markers for Down's syndrome screening in the first trimester?

Population

Pregnant women at less than 14 weeks' gestation confirmed by ultrasound, who had not undergone previous testing for Down’s syndrome. Most studies were undertaken in women identified to be high risk based on maternal age

Settings

All settings

Numbers of studies, pregnancies and Down's syndrome cases

56 studies (68 publications) involving 204,759 pregnancies of which 2113 were Down's syndrome pregnancies.

Index tests

18 serum markers (ADAM12, AFP, inhibin, PAPP‐A, ITA, free βhCG, PlGF, SP1, total hCG, progesterone, uE3, GHBP, PGH, hyperglycosylated hCG, ProMBP, hPL, free αhCG, and free ßhCG to AFP ratio) singly or in combination with or without maternal age.

Reference standards

Chromosomal verification (amniocentesis and CVS undertaken during pregnancy, and postnatal karyotyping) and postnatal macroscopic inspection.

Study limitations

35 studies used selective chromosomal verification during pregnancy, and were at risk of under‐ascertainment of Down's syndrome cases due loss of the pregnancy to miscarriage between the serum test and the reference standard.

Tests with at least 70% sensitivity and at least 95% specificity

Test strategy

Studies

Women (cases)

Sensitivity (95% CI)

Specificity (95% CI)

Test*

Test strategies (with or without maternal age) evaluated by a single study

Without maternal age

Double tests

PAPP‐A and AFP

1

96 (16)

81 (54 to 96)

95 (88 to 99)

PAPP‐A and ITA

1

344 (24)

71 (49 to 87)

95 (92 to 97)

Triple tests

PAPP‐A, free ßhCG and ITA

1

344 (24)

75 (53 to 90)

95 (92 to 97)

PlGF, PAPP‐A and free ßhCG

1

699 (90)

72 (62 to 81)

95 (93 to 97)

With maternal age

Double tests

Free ßhCG and SP1

1

60 (14)

71 (42 to 92)

96 (85 to 99)

PAPP‐A and Hyperglycosylated hCG

1

10775 (23)

74 (52 to 90)

95 (95 to 95)

Triple tests

PAPP‐A, free ßhCG and Inhibin

1

1110 (85)

74 (63 to 83)

95 (94 to 96)

PAPP‐A, SP1 and ProMBP

1

192 (15)

73 (45 to 92)

95 (91 to 98)

hPL, PAPP‐A and free ßhCG (1:250 risk)

1

183 (47)

77 (62 to 88)

95 (90 to 98)

Quadruple tests

GHBP, PGH, PAPP‐A and free ßhCG (1:250 risk)

1

335 (74)

76 (64 to 85)

95 (91 to 97)

Quintuple tests

PAPP‐A, free ßhCG, AFP, uE3 and Inhibin

1

1110 (85)

78 (67 to 86)

95 (94 to 96)

PAPP‐A, total hCG, AFP, uE3 and Inhibin

1

1110 (85)

73 (62 to 82)

95 (94 to 96)

Test strategies (with or without maternal age) evaluated by at least two studies

Free ßhCG

4

4280 (390)

25 (18 to 34)

95 (94 to 96)

P < 0.001

PAPP‐A

4

2837 (325)

52 (39 to 65)

95 (94 to 96)

Age, free ßhCG

7

5893 (460)

42 (36 to 48)

95 (94 to 96)

Age, PAPP‐A

5

3491 (359)

55 (46 to 63)

95 (94 to 96)

Age, free ßhCG and AFP

3

2992 (157)

49 (39 to 60)

95 (94 to 96)

Age, PAPP‐A and free ßhCG

17

49827 (1037)

68 (65 to 71)

95 (95 to 95)

Age, PAPP‐A, free ßhCG and AFP

2

2705 (116)

74 (65 to 81)

95 (94 to 96)

Age, ADAM 12, PAPP‐A and free ßhCG

2

1222 (74)

74 (63 to 83)

95 (94 to 96)

Age, PlGF, PAPP‐A and free ßhCG

2

1144 (160)

76 (69 to 82)

95 (93 to 96)

*Likelihood ratio test for the difference in sensitivity between the nine test strategies that were formally compared in a single meta‐analytic model.

ADAM12: a disintegrin and metalloprotease; AFP: alpha‐fetoprotein; αhCG: alpha human chorionic gonadotrophin; ßhCG: beta human chorionic gonadotrophin; CI: confidence interval; CVS: chorionic villus sampling; GHBP: growth hormone binding protein; hCG: human chorionic gonadotrophin; hPL: human placental lactogen; ITA: invasive trophoblast antigen; PAPP‐A: pregnancy‐associated plasma protein A; PGH: placental growth hormone; PIGF: placental growth factor; PROMBP: proform of eosinophil major basic protein; SPI: Schwangerschafts protein 1; uE3: unconjugated oestriol

Open in table viewer
Summary of findings 2. Performance of other first trimester serum test strategies with or without maternal age

Test strategy

Studies

Women (cases)

Sensitivity (95% CI)

Specificity (95% CI)

Threshold

Without maternal age

Single tests

AFP

2

2248 (104)

10 (4 to 21)

95

5% FPR

ADAM 12

1

579 (17)

41 (18 to 67)

95 (93 to 97)

5% FPR

Free ßhCG to AFP ratio

1

476 (9)

11 (0 to 48)

98 (96 to 99)

0.25 MoM

Inhibin

3

2098 (184)

19 (4 to 58)

95

5% FPR

PlGF

1

699 (90)

28 (19 to 38)

95 (93 to 97)

5% FPR

Total hCG

3

2098 (184)

19 (4 to 58)

95

5% FPR

SP1

3

1080 (53)

32 (1 to 96)

95

5% FPR

uE3

1

1110 (85)

13 (7 to 22)

95 (94 to 96)

5% FPR

Double tests

Free ßhCG and AFP

1

1138 (19)

16 (3 to 40)

95 (94 to 96)

5% FPR

Free ßhCG and Inhibin

1

876 (76)

30 (20 to 42)

95 (93 to 96)

5% FPR

PAPP‐A and free ßhCG

2

795 (106)

64 (50 to 76)

95

5% FPR

Triple tests

Total hCG, free αhCG and progesterone

1

129 (17)

53 (28 to 77)

96 (90 to 99)

0.34 MoM

With maternal age

Single tests

ADAM 12

2

703 (46)

67 (46 to 83)

91 (87 to 94)

1:400 risk

AFP

2

1397 (126)

33 (23 to 46)

95

5% FPR

Free αhCG

1

512 (12)

25 (5 to 57)

89 (86 to 91)

1:384 risk

GHBP

1

335 (74)

27 (17 to 39)

95 (91 to 97)

1:250 risk

hPL

1

183 (47)

45 (30 to 60)

93 (88 to 97)

1:250 risk

Inhibin

1

1110 (85)

32 (22 to 43)

95 (94 to 96)

5% FPR

ITA

1

278 (54)

48 (34 to 62)

95 (91 to 98)

5% FPR

PGH

1

335 (74)

41 (29 to 53)

94 (91 to 97)

1:250 risk

PlGF

1

699 (90)

43 (33 to 54)

95 (93 to 97)

5% FPR

ProMBP

1

181 (25)

36 (18 to 57)

94 (89 to 97)

1:250 risk

SP1

2

804 (29)

38 (22 to 56)

95

5% FPR

Total hCG

1

512 (12)

33 (10 to 65)

94 (92 to 96)

1:384 risk

uE3

1

512 (12)

33 (10 to 65)

86 (83 to 89)

1:384 risk

Double tests

ADAM 12 and PAPP‐A

1

691 (46)

61 (45 to 75)

95 (93 to 97)

5% FPR

AFP and free αhCG

1

512 (12)

33 (10 to 65)

87 (83 to 89)

1:384 risk

AFP and total hCG

1

512 (12)

33 (10 to 65)

93 (90 to 95)

1:384 risk

AFP and uE3

1

512 (12)

42 (15 to 72)

87 (84 to 90)

1:384 risk

Free ßhCG and free αhCG

1

512 (12)

42 (15 to 72)

94 (91 to 96)

1:384 risk

Free ßhCG and Inhibin

1

1110 (85)

44 (33 to 55)

95 (94 to 96)

5% FPR

Free ßhCG and total hCG

1

512 (12)

25 (5 to 57)

93 (90 to 95)

1:384 risk

Free ßhCG and uE3

1

287 (41)

61 (45 to 76)

95 (92 to 97)

5% FPR

GHBP and free ßhCG

1

335 (74)

61 (49 to 72)

92 (88 to 95)

1:250 risk

GHBP and PAPP‐A

1

335 (74)

66 (54 to 77)

93 (89 to 96)

1:250 risk

GHBP and PGH

1

335 (74)

47 (36 to 59)

93 (90 to 96)

1:250 risk

hPL and free ßhCG

1

183 (47)

68 (53 to 81)

94 (89 to 97)

1:250 risk

hPL and PAPP‐A

1

183 (47)

55 (40 to 70)

94 (89 to 97)

1:250 risk

PAPP‐A and AFP

2

2705 (116)

63 (50 to 74)

95

5% FPR

PAPP‐A and Inhibin

1

1110 (85)

68 (57 to 78)

95 (94 to 96)

5% FPR

PAPP‐A and ITA

2

622 (78)

62 (46 to 75)

95

5% FPR

PGH and free ßhCG

1

335 (74)

64 (52 to 74)

93 (89 to 96)

1:250 risk

PGH and PAPP‐A

1

335 (74)

65 (53 to 76)

93 (89 to 96)

1:250 risk

Total hCG and free αhCG

1

512 (12)

42 (15 to 72)

92 (89 to 94)

1:384 risk

Total hCG and Inhibin

1

1110 (85)

34 (24 to 45)

95 (94 to 96)

5% FPR

Total hCG and PAPP‐A

2

4327 (133)

66 (54 to 76)

95

5% FPR

Total hCG and uE3

1

512 (12)

42 (15 to 72)

92 (89 to 94)

1:384 risk

uE3 and free αhCG

1

512 (12)

33 (10 to 65)

89 (86 to 91)

1:384 risk

Triple tests

AFP, free αhCG and uE3

1

512 (12)

58 (28 to 85)

82 (79 to 85)

1:384 risk

Free ßhCG, AFP and uE3

1

287 (41)

66 (49 to 80)

95 (92 to 97)

5% FPR

GHBP, PAPP‐A and free ßhCG

1

335 (74)

76 (64 to 85)

94 (91 to 97)

1:250 risk

PAPP‐A, total hCG and Inhibin

1

1110 (85)

69 (58 to 79)

95 (94 to 96)

5% FPR

PGH, PAPP‐A and free ßhCG

1

335 (74)

76 (64 to 85)

94 (91 to 97)

1:250 risk

Total hCG, AFP and uE3

1

512 (12)

42 (15 to 72)

91 (88 to 94)

1:384 risk

Quadruple tests

Free ßhCG, total hCG, AFP and uE3

1

512 (12)

50 (21 to 79)

92 (89 to 94)

1:384 risk

Total hCG, AFP, uE3 and free αhCG

1

512 (12)

50 (21 to 79)

90 (87 to 92)

1:384 risk

Quintuple tests

Free ßhCG, total hCG, AFP, uE3 and free αhCG

1

512 (12)

33 (10 to 65)

90 (87 to 92)

1:384 risk

AFP: alpha‐fetoprotein; αhCG: alpha human chorionic gonadotrophin; ßhCG: beta human chorionic gonadotrophin; CI: confidence interval; FPR: false positive rate; GHBP: growth hormone binding protein; hCG: human chorionic gonadotrophin; hPL: human placental lactogen; ITA: invasive trophoblast antigen; PAPP‐A: pregnancy‐associated plasma protein A; PGH: placental growth hormone; PIGF: placental growth factor; PROMBP: proform of eosinophil major basic protein; SPI: Schwangerschafts protein 1; uE3: unconjugated oestriol

Background

This is one of a series of reviews on antenatal screening for Down's syndrome following a generic protocol (Alldred 2010) ‐ see Published notes for more details.

Target condition being diagnosed

Down’s syndrome

Down’s syndrome affects approximately one in 800 live‐born babies (Cuckle 1987a).  It results from a person having three, rather than two, copies of chromosome 21, or the specific area of chromosome 21 implicated in causing Down's syndrome, as a result of trisomy or translocation. If not all cells are affected, the pattern is described as 'mosaic'. Down’s syndrome can cause a wide range of physical and mental problems. It is the commonest cause of mental disability, and is also associated with a number of congenital malformations, notably affecting the heart. There is also an increased risk of cancers such as leukaemia, and numerous metabolic problems including diabetes and thyroid disease. Some of these problems may be life‐threatening, or lead to considerable ill health, while some individuals with Down’s syndrome have only mild problems and can lead a relatively normal life. 

There is no cure for Down’s syndrome, and antenatal diagnosis allows for preparation for the birth and subsequent care of a baby with Down’s syndrome, or for the offer of a termination of pregnancy. Having a baby with Down’s syndrome is likely to have a significant impact on family and social life, relationships and parents’ work. Special provisions may need to be made for education and care of the child, as well as accommodating the possibility of periods of hospitalisation.

Definitive invasive tests (amniocentesis and chorionic villus sampling (CVS)) exist that allow the diagnosis of Down's syndrome before birth, but carry a risk of miscarriage. No test can predict the severity of problems a person with Down’s syndrome will have. Noninvasive screening tests based on biochemical analysis of maternal serum or urine, or fetal ultrasound measurements, allow an estimate of the risk of a pregnancy being affected and provide parents with information to enable them to make choices about definitive testing. Such screening tests are used during the first and second trimester of pregnancy.

Screening tests for Down's syndrome

Initially, screening was determined solely by using maternal age to classify a pregnancy as high or low risk for trisomy 21, as it was known that older women had a higher chance of carrying a baby with Down’s syndrome (Penrose 1933). 

Further advances in screening were made in the early 1980s, when Merkatz et al. investigated the possibility that low maternal serum alpha‐fetoprotein (AFP), obtained from maternal blood in the second trimester of pregnancy could be associated with chromosomal abnormalities in the fetus. Their retrospective case‐control study showed a statistically significant relationship between fetal trisomy, such as Down’s syndrome, and lowered maternal serum AFP (Merkatz 1984). This was further explored by Cuckle et al in a larger retrospective trial using data collected as part of a neural tube defect (NTD) screening project (Cuckle 1984). This work was followed by calculation of risk estimates using maternal serum AFP values and maternal age, which ultimately led to the introduction of the two screening parameters in combination (Alfirevic 2004).

In 1987, in a small case‐control study of women carrying fetuses with known chromosomal abnormalities, Bogart and colleagues investigated maternal serum levels of human chorionic gonadotrophin (hCG) as a possible screening tool for chromosomal abnormalities in the second trimester (Bogart 1987). This followed the observations that low hCG levels were associated with miscarriages, which are commonly associated with fetal chromosomal abnormalities. They concluded that high hCG levels were associated with Down’s syndrome and because hCG levels plateau at 18 to 24 weeks, that this would be the most appropriate time for screening. Later work suggested that the ß subunit of hCG was a more effective marker than total hCG (Macri 1990; Macri 1993).

Second trimester unconjugated oestriol (uE3), produced by the fetal adrenals and the placenta, was also evaluated as a potential screening marker. In another retrospective case‐control study, uE3 was shown to be lower in Down’s syndrome pregnancies compared with unaffected pregnancies. When used in combination with AFP and maternal age, it appeared to identify more pregnancies affected by Down’s syndrome than AFP and age alone (Canick 1988).

Further work suggested that all three serum markers (AFP, hCG and uE3) showed even higher detection rates when combined with maternal age (Wald 1988a; Wald 1988b) and appeared to be a cost‐effective screening strategy (Wald 1992a).

Two other serum markers, produced by the placenta, have been linked with Down’s syndrome, namely pregnancy associated plasma protein A or PAPP‐A, and first trimester Inhibin A. PAPP‐A has been shown to be reduced in the first trimester of Down’s syndrome pregnancies, with its most marked reduction in the early first trimester (Bersinger 1995). Inhibin A is high in the second trimester in pregnancies affected by Down’s syndrome (Cuckle 1995; Wallace 1995). There are some issues concerning the biological stability and hence reliability of this marker, and the effect this will have on individual risk. 

Screening and parental choice

Antenatal screening is used for several reasons (Alfirevic 2004), but the most important is to enable parental choice regarding pregnancy management and outcome. Before a woman and her partner opt to have a screening test, they need to be fully informed about the risks, benefits and possible consequences of such a test. This includes the choices they may have to face should the result show that the woman has a high risk of carrying a baby with Down’s syndrome and implications of both false positive and false negative screening tests. They need to be informed of the risk of a miscarriage due to invasive diagnostic testing, and the possibility that a miscarried fetus may be chromosomally normal. If, following invasive diagnostic testing, the fetus is shown to have Down’s syndrome, further decisions need to be made about continuation or termination of the pregnancy, the possibility of adoption and finally, preparation for parenthood. Equally, if a woman has a test that shows she is at a low risk of carrying a fetus with Down’s syndrome, it does not necessarily mean that the baby will be born with a normal chromosomal make up. This possibility can only be excluded by an invasive diagnostic test (Alfirevic 2003).The decisions that may be faced by expectant parents inevitably engender a high level of anxiety at all stages of the screening process, and the outcomes of screening can be associated with considerable physical and psychological morbidity. No screening test can predict the severity of problems a person with Down's syndrome will have.

Index test(s)

This review examined serum screening tests used in the first trimester of pregnancy (up to 14 weeks' gestation) comprised of the following 18 individual markers; a disintegrin and metalloprotease 12 (ADAM12), AFP, inhibin, PAPP‐A, invasive trophoblast antigen (ITA), free βhCG, placental growth factor (PlGF), Schwangerschafts protein 1 (SP1), total hCG, progesterone, uE3, growth hormone binding protein (GHBP), placental growth hormone (PGH), hyperglycosylated hCG, proform of eosinophil major basic protein (ProMBP), human placental lactogen (hPL), free alpha human chorionic gonadotrophin (αhCG), and free ßhCG to AFP ratio. These markers can be used individually, in combination with age, and can also be used in combination with each other. The risks are calculated by comparing a woman's test result for each marker with values for an unaffected population, and multiplying this with her age‐related risk. Where several markers are combined, risks are computed using risk equations (often implemented in commercial software) that take into account the correlational relationships between the different markers and marker distributions in affected and unaffected populations.

Alternative test(s)

Down’s syndrome can be detected during pregnancy with invasive diagnostic tests such as amniocentesis or CVS, with or without prior screening. The ability to determine fetal chromosomal make up (also known as a karyotype) from amniotic fluid samples was demonstrated in 1966 by Steele and Breg (Steele 1966), and the first antenatal diagnosis of Down’s syndrome was made in 1968 (Vaklenti 1968). Amniocentesis is an invasive procedure that involves taking a small sample of the amniotic fluid (liquor) surrounding the baby, using a needle which goes through the abdominal wall into the uterus, and is usually performed after 15 weeks' gestation. CVS involves taking a sample of the placental tissue using a needle which goes through the abdominal wall and uterus or a cannula through the cervix. It is usually performed between 10 and 13 weeks' gestation. Amniocentesis and CVS are both methods of obtaining fetal chromosomal material, which are then used to diagnose Down’s syndrome. Both tests use ultrasound scans to guide placement of the needle. Amniocentesis carries a risk of miscarriage in the order of 1%; transabdominal CVS may carry a similar risk (Alfirevic 2003).

Rationale

This is one of a suite of Cochrane reviews, the aim of which is to identify all screening tests for Down's syndrome used in clinical practice, or evaluated in the research setting, in order to try to identify the most accurate test(s) available, and to provide clinicians, policy‐makers and women with robust and balanced evidence on which to base decisions about interpreting test results and implementing screening policies to triage the use of invasive diagnostic testing.

There are many different screening tests which are available and offered which will be the subject of additional Cochrane reviews (currently in preparation or published (Alldred 2012)), and there are other reviews looking at this area. Tests to be assessed in Cochrane reviews include second trimester serum tests; urine tests; first trimester ultrasound markers; tests that combine serum and ultrasound markers; and tests that combine markers from the first trimester with markers from the second trimester. Second trimester ultrasound markers have been assessed in a previous systematic review (Smith‐Bindman 2001).

The topic has been split into several different reviews to allow for greater ease of reading and greater accessibility of data, and also to allow the reader to focus on separate groups of tests, for example, first trimester serum tests alone, first trimester serum and ultrasound, second trimester serum alone, first and second trimester serum combinations, with or without ultrasound markers; and urine markers alone. An overview review will compare the best tests, focusing on commonly used strategies and the best tests from each of these categories. This review is written with the global perspective in mind, rather than to conform with any specific local or national policy, as not all tests will be available in all areas where screening for Down's syndrome is carried out.

A systematic review of second trimester ultrasound markers in the detection of Down’s syndrome fetuses was published in 2001 that concluded that nuchal fold thickening may be useful in detecting Down’s syndrome, but that it was not sensitive enough to use as a screening test. The review concluded that the other second trimester ultrasound markers did not usefully distinguish between Down’s syndrome and pregnancies without Down’s syndrome (Smith‐Bindman 2001). There has yet to be a systematic review and meta‐analysis of the observed data on serum, urine and first trimester ultrasound markers, in order to draw rigorous and robust conclusions about the diagnostic accuracy of available Down’s syndrome screening tests.

Objectives

The aim of this review was to estimate and compare the accuracy of first trimester serum markers for the detection of Down’s syndrome in the antenatal period, both as individual markers and as combinations of markers. Accuracy is described by the proportion of fetuses with Down’s syndrome detected by screening before birth (sensitivity or detection rate), and the proportion of women with a low risk (normal) screening test result who subsequently had a baby unaffected by Down's syndrome (specificity).

Investigation of sources of heterogeneity

We planned to investigate whether a uniform screening test is suitable for all women, or whether different screening methods are more applicable to different groups, defined by advanced maternal age, ethnic groups and aspects of the pregnancy and medical history such as multiple pregnancy, diabetes and family history of Down's syndrome. We also considered whether there existed evidence of overestimation of test accuracy in studies evaluating risk equations in the derivation sample rather than in a separate validation sample.

Methods

Criteria for considering studies for this review

Types of studies

We included studies in which all women from a given population had one or more index test(s) compared to a reference standard. Both consecutive series and diagnostic case‐control study designs were included. Randomised trials where individuals were randomised to different screening strategies and all verified using a reference standard were also eligible for inclusion. Studies in which test strategies were compared head‐to‐head, either in the same women, or between randomised groups were identified for inclusion in separate comparisons of test strategies. Studies were excluded if they included less than five Down's syndrome cases, or more than 20% of participants were not followed up.

Participants

Pregnant women at less than 14 weeks' gestation confirmed by ultrasound, who had not undergone previous testing for Down’s syndrome in their pregnancy were eligible. Studies were included if the pregnant women were unselected, or if they represented groups with increased risk of Down’s syndrome, or difficulty with conventional screening tests including maternal age greater than 35 years old, multiple pregnancy, diabetes mellitus and family history of Down’s syndrome.

Index tests

The following 18 index tests were examined; ADAM12, AFP, inhibin, PAPP‐A, ITA, free βhCG, PlGF, SP1, total hCG, progesterone, uE3, GHBP, PGH, hyperglycosylated hCG, ProMBP, hPL, free αhCG, and free ßhCG to AFP ratio, and combinations of these markers combined with maternal age.

We looked at comparisons of tests used in isolation and in 78 various combinations. These included single (one marker), double (two markers), triple (three markers), quadruple (four markers) and quintuple (five markers) tests, some of which were adjusted for maternal age.

Where tests were used in combination, we looked at the performance of test combinations according to predicted probabilities computed using risk equations and dichotomised into high risk and low risk.

Target conditions

Down's syndrome in the fetus due to trisomy, translocation or mosaicism.

Reference standards

We considered several reference standards, involving chromosomal verification and postnatal macroscopic inspection. Chromosomal verification is considered preferential but because of the risks involved, often not feasible. Where macroscopic inspection or examination raises a question about the possibility of an individual being affected by Down's syndrome, in clinical practice this is usually confirmed or refuted by formal karyotyping.

Amniocentesis and CVS are invasive chromosomal verification tests undertaken during pregnancy. They are highly accurate but the process carries a 1% miscarriage rate, and therefore they are only used in pregnancies considered to be high risk of Down's, or on the mother's request. All other types of testing (postnatal examination, postnatal karyotyping, birth registers and Down’s syndrome registers) are based on information available at the end of pregnancy. For the purposes of meta‐analysis they are considered equivalent. The greatest concern is not their accuracy, but the loss of the pregnancy to miscarriage between the timing of serum testing and the reference standard. Miscarriage with cytogenetic testing of the fetus is included in the reference standard where available.

We anticipated that older studies, and studies undertaken in older women were more likely to have used invasive chromosomal verification tests in all women. Studies undertaken in younger women and more recent studies were likely to use differential verification as they often only used prenatal karyotypic testing on fetuses considered screen positive or high risk according to the screening test; the reference standard for most unaffected infants is likely to be observation of a phenotypically normal baby. Although the accuracy of this combined reference standard is considered high, it is methodologically a weaker approach because pregnancies that miscarry between the index test and birth are likely to be lost from the analysis, and miscarriage is more likely to occur in Down’s than normal pregnancies.

Search methods for identification of studies

We used one generic search strategy to identify studies for all reviews in this series

Electronic searches

We applied a sensitive search strategy to search the following databases using the text words and MeSH terms detailed in Appendix 1, adapting the search strategy for each different database.

Databases searched included:

  • MEDLINE via OVID (1980 to 25 August 2011)

  • Embase via Dialog Datastar (1980 to 25 August 2011)

  • BIOSIS via EDINA (1985 to 25 August 2011)

  • CINAHL via OVID (1982 to 25 August 2011)

  • The Database of Abstracts of Reviews of Effectiveness (25 August 2011)

  • MEDION (25 August 2011)

  • The Database of Systematic Reviews and Meta‐Analyses in Laboratory Medicine (www.ifcc.org/) (25 August 2011)

  • The National Research Register (Archived 2007)

  • Health Services Research Projects in Progress database (HSRPROJ) (25 August 2011)

The search strategy combined three sets of search terms (see Appendix 1). The first set was made up of named tests, general terms used for screening/diagnostic tests and statistical terms. Note that the statistical terms were used to increase sensitivity and were not used as a methodological filter to increase specificity. The second set was made up of terms that encompass Down's syndrome and the third set made up of terms to limit the testing to pregnant women. All terms within each set were combined with the Boolean operator OR and then the three sets were combined using AND. The terms used were a combination of subject headings and free text terms. The search strategy was adapted to suit each database searched.

We attempted to identify cumulative papers which reported data from the same data set, and contacted authors to obtain clarification of the overlap between data presented in these papers, in order to prevent data from the same women being analysed more than once.

Searching other resources

In addition, we examined references cited in studies identified as being potentially relevant, and those cited by previous reviews. We contacted authors of studies where further information was required.

We carried out forward citation searching of relevant items, using the search strategy in ISI citation indices, Google scholar and Pubmed ‘related articles’.

We did not apply language restrictions to the search.

Data collection and analysis

Selection of studies

Two review authors screened the titles and abstracts (where available) of all studies identified by the search strategy. We obtained full‐text versions of studies identified as being potentially relevant and two review authors independently assessed these for inclusion, using a study eligibility screening pro forma according to the pre‐specified inclusion criteria. Any disagreement between the two review authors was settled by consensus, or where necessary, by a third party.

Data extraction and management

A data extraction form was developed and piloted using a subset of 20 identified studies. Two review authors independently extracted data, and where disagreement or uncertainty existed, a third review author validated the information extracted.

Data on each marker were extracted as binary test positive/test negative results for Down's and non‐Down's pregnancies, with a high‐risk result—as defined by each individual study—being regarded as test positive (suggestive or diagnostic of Down's syndrome), and a low‐risk result being regarded as test negative (suggestive of absence of Down's syndrome). Where results were reported at several thresholds, we extracted data at each threshold. 

We made a note of those in special groups that posed either increased risk of Down’s syndrome or difficulty with conventional screening tests, including maternal age greater than 35 years old, multiple pregnancy, diabetes mellitus and family history of Down’s syndrome.

Assessment of methodological quality

We used a modified version of the QUADAS (Quality Assessment of Diagnostic Accuracy Studies) tool (Whiting 2003), a quality assessment tool for use in systematic reviews of diagnostic accuracy studies, to assess the methodological quality of included studies. We anticipated that a key methodological issue would be the potential for bias arising from the differential use of invasive testing and follow‐up for the reference standard according to index test results, bias arising due to higher loss to miscarriage if false negatives than true negatives. We chose to code this issue as originating from differential verification in the QUADAS tool: we are aware that it could also be coded under delay in obtaining the reference standard, and reporting of withdrawals. We omitted the QUADAS item assessing quality according to length of time between index and reference tests, as Down's syndrome is either present or absent rather than a condition that evolves and resolves, and disregarding the differential reference standard issue, thus any length of delay is acceptable. Two review authors assessed each included study separately.  Any disagreement between the two authors was settled by consensus, or where necessary, by a third party. Each item in the QUADAS tool was be marked as ‘yes’, ‘no’ or ‘unclear’, and scores are presented graphically and in tables. We did not use a summary quality score. See Appendix 3 for QUADAS questionnaire.

Statistical analysis and data synthesis

We initially examined each test or test strategy at each of the common risk thresholds used to define test positivity by plotting estimates of sensitivity and specificity from each study on forest plots and in receiver operating characteristic (ROC) space. Test strategies were selected for further investigation if they were evaluated in four or more studies or, if there were two or three studies, but the individual study results indicated performance likely to be superior to a sensitivity of 70% and specificity of 90%.

Estimation of average sensitivity and specificity

The analysis for each test strategy was undertaken first restricting to studies that reported a common threshold to estimate average sensitivity and specificity for each test at each threshold. Although data on all thresholds were extracted, we present only key common thresholds from the literature, close to risks of 1:384, 1:250 and the 5% false positive rate (FPR), unless other thresholds were more commonly reported. Where combinations of tests were used in a risk score we extracted the result for the test combination using the risk score and not the individual components that made up the test.

We undertook meta‐analyses using hierarchical summary ROC (HSROC) models, which included estimation of random‐effects in accuracy and threshold parameters when there were four or more studies. Where there were fewer than four studies and the studies reported test performance at a common threshold, we computed average sensitivity and specificity values by using univariate fixed‐effect or random‐effects logistic regression models to average logit sensitivity and logit specificity separately because of insufficient number of studies to reliably estimate all the parameters in the HSROC model. It is common in this field for studies to report sensitivity for a fixed specificity (usually a 5% FPR). This removes the requirement to account for the correlation between sensitivity and specificity across studies by using a bivariate meta‐analytical method since all specificities are the same value. Thus, at a fixed specificity value, logit sensitivities were pooled using a univariate random‐effects logistic regression model. This model was further simplified to a fixed‐effect model when there were only two or three studies and heterogeneity was not observed on the SROC plot. All analyses were undertaken using the NLMIXED procedure in SAS (version 9.2; SAS Institute, Cary, NC) and the xtmelogit command in Stata version 11.2 (Stata‐Corp, College Station, TX, USA).

Comparisons between tests

We made comparisons between tests, first by utilising all available studies, selecting one threshold from each study to estimate a summary ROC curve without restricting to a common threshold, and second, by making pair‐wise comparisons using studies that compared tests in the same mothers (direct head‐to‐head comparison). The threshold was chosen for each study according to the following order of preference a) the risk threshold closest to 1 in 250; b) a multiples of the median (MoM) or presence/absence threshold; c) the performance closest to a 5% FPR or 95th percentile. The 5% FPR was chosen as a cut‐off point as this is the cut‐off most commonly reported in the literature.

For the analysis that included data from all studies, we compared test strategies in a single HSROC model, including two indicator terms for each test to allow for differences in accuracy and threshold. There was no indication of differing SROC curve shape between tests and so a single SROC shape parameter was included in the model, such that the fitted SROC curves did not cross. The initial meta‐analyses of individual test strategies indicated there were differences in the variability of the accuracy parameter such that the assumption of equal variances may not be justifiable. We attempted to fit a model with separate variance terms for each test strategy for the accuracy parameter but the model did not converge. We therefore restricted the meta‐analysis that compared the accuracy of the different test strategies to only studies that used a 5% FPR threshold so that we could fit a univariate random effects logistic regression model that allowed for a separate variance term for the random‐effects of logit sensitivity for each test. Using nonlinear combinations of the parameter estimates from this model, we derived ratios of sensitivities for each pair of tests included in the model and obtained their corresponding 95% confidence interval (CI) by using the delta method. We used likelihood ratio tests to assess the statistical significance of differences in sensitivity between tests.

For direct comparisons between each pair of tests at the 5% FPR threshold, we used a separate model for each pair‐wise comparison and pooled logit sensitivities using a univariate random‐effects model. As studies rarely reported data cross‐classified by both tests for Down's and normal pregnancies, the analytical method did not take full account of the pairing of test results, but the restriction to direct head‐to‐head comparisons should have removed the potential confounding of test comparisons with other features of the studies. The strength of evidence for differences in performance of test strategies relied on evidence from both the direct and indirect comparisons.

Investigations of heterogeneity

We planned to undertake investigations of heterogeneity if there were 10 or more studies available for a test. We planned to investigate the effect of a covariate by adding covariate terms to the HSROC model to assess differences in accuracy and threshold.

Sensitivity analyses

Mothers with pregnancies identified as high risk for Down's syndrome by serum testing are often offered immediate definitive testing by amniocentesis, whereas those considered low risk are assessed for Down's syndrome by inspection at birth. Such delayed and differential verification will introduce bias most likely through there being greater loss to miscarriage in the Down's syndrome pregnancies that were not detected by the serum testing (the false negative diagnoses). Testing and detection of miscarriages is impractical in many situations, and no clear data are available on the magnitude of these miscarriage rates.

To account for the possible bias introduced by such a mechanism, we planned to perform sensitivity analyses by increasing the percentage of false negatives in studies where delayed verification in test negatives occurred (Mol 1999). We planned to incrementally increase the percentage from 10% to 50%, the final value representing a scenario where a third or more Down's pregnancies than normal pregnancies were likely to miscarry, thought to be higher than the likely value. We intended to conduct the sensitivity analyses on the analysis investigating the effect of maternal age on test sensitivity.

Results

Results of the search

The search for the whole suite of reviews identified a total of 15,394 papers, once the results from each bibliographic database were combined and duplicates were removed. After screening out obviously inappropriate papers based on their title and abstract, 1145 papers remained and we obtained full‐text copies for formal assessment of eligibility. From these, a total of 269 papers were deemed eligible and were included in the suite of reviews. We included a total of 56 studies (reported in 68 publications) in this review of first trimester serum screening, involving 204,759 pregnancies, of which 2113 were Down's syndrome pregnancies.

A total of 78 different test strategies or combinations, at one or more thresholds, were evaluated in the 56 studies. These tests were produced from combinations of 18 different serum tests with and without maternal age; ADAM12, AFP, inhibin, PAPP‐A, ITA, free βhCG, PlGF, SP1, total hCG, progesterone, uE3, GHBP, PGH, hyperglycosylated hCG, ProMBP, hPL, free αhCG, and free ßhCG to AFP ratio. Strategies evaluated included three quintuple tests, three quadruple tests, 12 triple tests, 27 double tests and 15 single tests in combination with maternal age, and three triple tests, five double tests and 10 single tests without maternal age.

The following combinations evaluated included four or more studies.

Double tests with maternal age

  1. Free ßhCG, AFP and maternal age (five studies; 5160 women including 174 Down's syndrome pregnancies)

  2. Free ßhCG, PAPP‐A and maternal age (31 studies; 158,878 women including 1430 Down's syndrome pregnancies)

Single tests with maternal age

  1. Free ßhCG and maternal age (nine studies; 16,656 women including 549 Down's syndrome pregnancies)

  2. PAPP‐A and maternal age(six studies; 13,742 women including 409 Down's syndrome pregnancies)

Single tests without maternal age

  1. Free ßhCG (four studies; 4280 women including 390 Down's syndrome pregnancies)

  2. PAPP‐A (six studies; 25,510 women including 430 Down's syndrome pregnancies)

Of the remaining test combinations, seven were evaluated in three studies, 17 were evaluated in two studies and the remainder were evaluated in single studies only.

Methodological quality of included studies

We judged the methodological quality of the studies to be high in most categories (Figure 1). Due to the nature of testing for Down's syndrome screening and the potential side effects of invasive testing, differential verification is almost universal in the general screening population, as most women whose screening test result is defined as low risk will have their screening test verified at birth, rather than by invasive diagnosis in the antenatal period. Additionally, it was not always possible to ascertain from the included studies whether or not the results of index tests and reference standards were blinded. It would be difficult to blind clinicians performing invasive diagnostic tests (reference standards) to the index test result, unless all women received the same reference standard, which would not be appropriate in most scenarios. Any biases secondary to a lack of clinician blinding are likely to be minimal.


Methodological quality graph: review authors' judgements about each methodological quality item presented as percentages across all included studies.

Methodological quality graph: review authors' judgements about each methodological quality item presented as percentages across all included studies.

Where details of completeness of follow‐up were poorly reported, most studies seemed to assume 100% follow‐up. However, there will inevitably be losses to follow‐up due to women moving out of area, for example. Studies sometimes accounted for these and it is unlikely that there were enough losses to follow‐up to have introduced significant bias. There was likely under‐ascertainment of miscarriage, and very few papers accounted for miscarriage, or performed tissue karyotyping in pregnancies resulting in miscarriage. Some studies attempted to adjust for predicted miscarriage rate and the incidence of Down's syndrome in this specific population, but most did not. We have not attempted to adjust for expected miscarriage rate in this review. There is a higher natural miscarriage rate in the first trimester, however this will be uniform across studies and therefore unlikely to introduce significant bias.

Some studies that provided estimates of risk using multivariable equations used the same data set to evaluate performance of the risk equation as was used to derive the equation. This is often thought to lead to over‐estimation of test performance.

Findings

The findings of the 21 most common and/or best performing test strategies are given in summary of findings Table 1. The remaining 57 strategies are briefly summarised in summary of findings Table 2. The test strategies evaluated by four or more studies are detailed below.

1) Free ßhCG, PAPP‐A and maternal age (double test)

Results for this double test were derived from 31 studies (Biagiotti 1998; Brambati 1994; Christiansen 2005; Christiansen 2007a; Christiansen 2009; Christiansen 2010; Cowans 2010; Crossley 2002a; De Graaf 1999a; Forest 1997; Gyselaers 2005; Haddow 1998; Kagan 2009; Kozlowski 2007 GC; Kozlowski 2007 PC; Krantz 2000; Muller 2003a; Niemimaa 2001a; O'Leary 2006; Orlandi 1997; Sahota 2010; Schaelike 2009; Scott 2004; Spencer 1999a; Torring 2010; Tsukerman 1999; Valinen 2007; Wald 2003a; Wapner 2003; Wojdemann 2005; Zaragoza 2009), and included 158,878 women in whom 1430 pregnancies were known to be affected by Down's syndrome. Seven studies contributed over 10,000 pregnancies each to the data (Crossley 2002a; Gyselaers 2005; Kagan 2009; Krantz 2000; O'Leary 2006; Sahota 2010; Schaelike 2009). Studies presented data for cut‐points of 5% FPR (Biagiotti 1998; Brambati 1994; Cowans 2010; De Graaf 1999a; Forest 1997; Haddow 1998; Kagan 2009; Sahota 2010; Spencer 1999a;Sahota 2010; Torring 2010; Tsukerman 1999; Wald 2003a; Wapner 2003; Zaragoza 2009), 1:250 risk (Christiansen 2005; Christiansen 2007a; Christiansen 2009; Christiansen 2010; Crossley 2002a; Kagan 2009; Muller 2003a; Niemimaa 2001a; Torring 2010; Valinen 2007; Wojdemann 2005), and 1:300 risk (Kozlowski 2007 GC; Kozlowski 2007 PC; Schaelike 2009). At a cut‐point of 5% FPR (17 studies), the sensitivity wa s estimated as 68% (95% confidence interval (CI) 65 to 71) and the specificity at 95% (95% CI 95 to 95). At a cut‐point of 1:250 FPR (11 studies), the sensitivity was estimated as 73% (95% CI 67 to 79) and the specificity as 93% (95% CI 91 to 94).

2) Free ßhCG, AFP and maternal age (double test)

Results for this double test were derived from five studies (Benattar 1999; Biagiotti 1995; Forest 1995; Tsukerman 1999; Wald 2003a), and included 5160 women in whom 174 pregnancies were known to be affected by Down's syndrome. Two contributed over 1000 pregnancies each to the data (Benattar 1999; Tsukerman 1999. Studies presented data for cut‐points of 5% FPR (Biagiotti 1995; Tsukerman 1999; Wald 2003a), 1:250 risk (Benattar 1999) and 1:384 risk (Forest 1995). At a cut‐point of 5% FPR (three studies), the sensitivity was estimated as 49% (95% CI 39 to 60) and the specificity as 95% (95% CI 94 to 96).

3) PAPP‐A and maternal age (single test)

Results for this single test were derived from six studies (Biagiotti 1998; Brambati 1993; Forest 1997; Krantz 2000; Spencer 1999a; Wald 2003a), and included 13,742 women in whom 409 pregnancies were known to be affected by Down's syndrome. Krantz 2000 was the largest study, contributing over 10,000 pregnancies to the data. Studies presented data for cut‐points of 5% FPR (Biagiotti 1998; Brambati 1993; Forest 1997; Spencer 1999a; Wald 2003a) and 1:105 risk (Krantz 2000). At a cut‐point of 5% FPR (five studies), the sensitivity was estimated as 55% (95% CI 46 to 63) and the specificity as 95% (95% CI 94 to 96).

4) Free ßhCG and maternal age (single test)

Results for this single test were derived from nine studies (Biagiotti 1995; Biagiotti 1998; Brambati 1994; Forest 1995; Forest 1997; Krantz 2000; Noble 1995; Spencer 1999a; Wald 2003a), and included 16,656 women in whom 549 pregnancies were known to be affected by Down's syndrome. Krantz 2000 contributed over 10,000 pregnancies to the data. Studies presented data for cut‐points of 5% FPR (Biagiotti 1995; Biagiotti 1998; Brambati 1994; Forest 1997, Noble 1995; Spencer 1999a; Wald 2003a), 1:384 risk (Forest 1995) and 1:105 risk (Krantz 2000). At a cut‐point of 5% FPR (seven studies), the sensitivity was estimated as 42% (95% CI 36 to 48) and the specificity as 95% (95% CI 94 to 96).

5) PAPP‐A alone (single test without maternal age)

Results for this single test were derived from six studies (Brambati 1993; Brameld 2008; Brizot 1994; Casals 1996; Spencer 1999a; Wald 2003a), and included 25,510 women in whom 430 pregnancies were known to be affected by Down's syndrome. Brameld 2008 was the largest study contributing over 20,000 pregnancies to the data. Studies presented data for cut‐points of 5% FPR (Brambati 1993; Brizot 1994; Casals 1996; Spencer 1999a; Wald 2003a) and ≤ 5th percentile (Brameld 2008). At a cut‐point of 5% FPR (four studies), the sensitivity was estimated as 52% (95% CI 39 to 65) and the specificity as 95% (95% CI 94 to 96).

6) Free ßhCG alone (single test without maternal age)

Results for this single test were derived from four studies (Casals 1996; Noble 1997; Spencer 1999a; Wald 2003a), and included 4280 women in whom 390 pregnancies were known to be affected by Down's syndrome. Studies were all of a similar size. Studies presented data at a 5% FPR. At this cut‐point, the sensitivity was estimated as 25% (95% CI 18 to 34) and the specificity as 95% (95% CI 94 to 96).

7) Other test combinations

Of the 73 test combinations evaluated in three or fewer studies, several test combinations demonstrated estimated sensitivities of more than 70% and estimated specificities of more than 90%. Twelve of these were evaluated in single studies (summary of findings Table 2), however, three test combinations were evaluated in two or more studies.

  1. A triple test ofPAPP‐A, free ßhCG, AFP and maternal age was evaluated in three studies (Muller 2003a; Tsukerman 1999; Wald 2003a), had an estimated sensitivity of 74% (95% CI 65 to 81) at a cut‐point of 5% FPR.

  2. A triple test of ADAM 12, PAPP‐A, free ßhCG and maternal age was evaluated in three studies (Christiansen 2010; Torring 2010; Valinen 2009), had an estimated sensitivity of 74% (95% CI 63 to 83) at a cut‐point of 5% FPR.

  3. A triple test of PlGF, PAPP‐A, free ßhCG and maternal age was evaluated in two studies (Cowans 2010; Zaragoza 2009), had an estimated sensitivity of 76% (95% CI 69 to 82) at a cut‐point of 5% FPR.

Comparative analysis of the nine selected test strategies

We chose to estimate detection rates at a 5% FPR, in common with much of the literature. Figure 2 shows point estimates of detection rates for a 5% FPR based on all available data for all nine test combinations described above, and the confidence intervals at a fixed 5% FPR. For example, the plot shows that for the double test with a marker combination of free βhCG, AFP and maternal age, the estimated detection rate at a 5% FPR was 49% (95% CI 39 to 60) based on data from three studies with 157 affected cases and 2992 total participants. The test combinations in Figure 2 are ordered according to decreasing detection rates. The single test strategies with and without maternal age (PAPP‐A alone; free βhCG alone, PAPP‐A and maternal age, and free βhCG and maternal age) have the worst performance, whereas, the triple test strategies (ADAM 12, PAPP‐A, free βhCG and maternal age; PAPP‐A, free βhCG, AFP and maternal age) have the highest performance. In between lie the double tests (free βhCG, PAPP‐A and maternal age; free βhCG, AFP and maternal age). However, it should be noted that the confidence intervals on these estimates are wide and overlap for the lower performing five strategies, suggesting that any of the differences observed may be explicable by chance.


Detection rates (sensitivity) at a 5% false positive rate for the nine selected test strategies. Each circle represents the summary sensitivity for a test strategy and the size of each circle is proportional to the number of Down's cases. The estimates are shown with 95% confidence intervals. The test strategies are ordered on the plot according to decreasing detection rate. The number of studies, cases and women included for each test strategy are shown on the horizontal axis.A = Age, PlGF, PAPP‐A and free ßhCG; B = Age, PAPP‐A, free ßhCG and AFP; C = Age, ADAM 12, PAPP‐A and free ßhCG; D = Age, PAPP‐A and free ßhCG ; E = Age, PAPP‐A; F = PAPP‐A; G = Age, free ßhCG and AFP ; H = Age, free ßhCG; I = Free ßhCG

Detection rates (sensitivity) at a 5% false positive rate for the nine selected test strategies. Each circle represents the summary sensitivity for a test strategy and the size of each circle is proportional to the number of Down's cases. The estimates are shown with 95% confidence intervals. The test strategies are ordered on the plot according to decreasing detection rate. The number of studies, cases and women included for each test strategy are shown on the horizontal axis.

A = Age, PlGF, PAPP‐A and free ßhCG; B = Age, PAPP‐A, free ßhCG and AFP; C = Age, ADAM 12, PAPP‐A and free ßhCG; D = Age, PAPP‐A and free ßhCG ; E = Age, PAPP‐A; F = PAPP‐A; G = Age, free ßhCG and AFP ; H = Age, free ßhCG; I = Free ßhCG

Table 1 shows pair‐wise direct comparisons (head‐to‐head) where studies were available. Such comparisons are regarded as providing the strongest evidence as they compare tests within pregnancies and are thus unconfounded. The table shows the ratios of sensitivities with 95% CIs and P values (P < 0.05 being considered a statistically significant difference) for each test comparison, the number of studies (K) for which data were available. The table shows that the sensitivity of the single test combinations (PAPP‐A alone, free βhCG alone, PAPP‐A and maternal age, and free βhCG and maternal age) tended to be significantly worse (P < 0.05) than the double and triple tests where data are available. The double test comprised of PAPP‐A, free βhCG and maternal age appears to have significantly better (P = 0.004) test accuracy than the double test comprised of free βhCG, AFP and maternal age. Otherwise, there was no strong evidence of significant improvements in sensitivity with the addition of a third marker. However, most comparisons in this table are based on only single studies and are unlikely to be powered to detect differences in detection rates.

Open in table viewer
Table 1. Direct comparisons of the sensitivity of nine test strategies at the 5% false positive rate

Ratio of sensitivity

(95% CI), P value for comparison (studies)

Free ßhCG

PAPP‐A

Age, free ßhCG

Age, PAPP‐A

Age, PAPP‐A , free ßhCG

Age, free ßhCG, AFP

Age, ADAM 12, PAPP‐A, free ßhCG

Age, PAPP‐A, free ßhCG, AFP

PAPP‐A

1.78 (1.10 to 2.88), P = 0.02

(2)

Age, free ßhCG

1.67 (1.11 to 2.50). P = 0.013

(2)

0.94 (0.68 to 1.29), P = 0.70

(2)

Age, PAPP‐A

2.15 (1.37 to 3.38), P = 0.001

(2)

1.20 (0.86 to 1.67), P = 0.29

(3)

1.26 (1.02 to 1.57), P = 0.034

(4)

Age, PAPP‐A , free ßhCG

2.62 (1.77 to 3.87), P < 0.001

(2)

1.47 (1.09 to 2.00), P = 0.012

(2)

1.61 (1.31 tp 1.98), P < 0.001

(5)

1.26 (1.04 to 1.52), P = 0.02

(4)

Age, free ßhCG, AFP

2.19 (1.31 to 3.64), P = 0.002

(1)

0.71 (0.52 to 0.98), P = 0.03

(1)

1.08 (0.80 to 1.46), P = 0.62

(2)

0.61 (0.46 to 0.82), P < 0.001

(1)

0.63 (0.47 to 0.86), P = 0.004

(2)

Age, ADAM 12, PAPP‐A, free ßhCG

1.04 (0.85 to 1.26), P = 0.71

(2)

Age, PAPP‐A, free ßhCG, AFP

3.94 (2.49 to 6.23), P < 0.001

(1)

1.29 (1.03 to 1.60), P = 0.024

(1)

1.91 (1.42 to 2.56), P < 0.001

(1)

1.11 (0.91 to 1.34), P = 0.31

(1)

1.02 (0.88 to 1.20), P = 0.77

(2)

1.62 (1.19 to 2.19), P = 0.002

(2)

Age, PlGF, PAPP‐A, free ßhCG

1.03 (0.91 to 1.17), P = 0.61

(2)

— indicates that no comparative study was available for the pair of tests.

Direct comparisons were made only using data from studies which compared each pair of tests on the same women. Where there were at least two studies, meta‐analysis was performed to summarise and compare the sensitivities. The ratio of sensitivities was computed by division of the sensitivity for the column by the sensitivity for the row. If the ratio of sensitivity is greater than one then the sensitivity of the test for the column is higher than that for the row, if less than one the sensitivity of the test in the row is higher than in the column. All test comparisons that were evaluated by only one study were from Wald 2003. The ratio of the sensitivities for test comparisons from a single study were calculated as a ratio of two proportions.

ADAM12: a disintegrin and metalloprotease; AFP: alpha‐fetoprotein; ßhCG: beta human chorionic gonadotrophin; CI: confidence interval; PAPP‐A: pregnancy‐associated plasma protein A; PIGF: placental growth factor; PROMBP: proform of eosinophil major basic protein

Table 2 shows the same comparisons made using all available data (as used to create Figure 2). Results are in agreement with the direct comparisons, and in addition, showed that the triple test comprised of PlGF, PAPP‐A, free ßhCG and maternal age is significantly better (P = 0.024) than the double test comprised of PAPP‐A, free βhCG and maternal age. However, these comparisons are potentially confounded by differences between the studies, and are based on small numbers of studies.

Open in table viewer
Table 2. Indirect comparisons of the sensitivity of nine test strategies at the 5% false positive rate

Ratio of sensitivity

(95% CI), P value for comparison

Free ßhCG

PAPP‐A

Age, free ßhCG

Age, PAPP‐A

Age, PAPP‐A, free ßhCG

Age, free ßhCG, AFP

Age, ADAM 12, PAPP‐A, free ßhCG

Age, PAPP‐A, free ßhCG, AFP

Studies (cases/ women)

4

(390/4280)

4

(325/2837)

7

(460/5893)

5

(359/3491)

17

(1037/49827)

3

(157/2992)

2

(74/1222)

2

(116/2705)

Studies (cases/ women)

Sensitivity % (95% CI)

25 (18 to 34)

52 (39 to 65)

42 (36 to 48)

55 (46 to 63)

68 (65 to 71)

49 (39 to 60)

74 (63 to 83)

74 (65 to 81)

PAPP‐A

4

(325/2837)

52 (39 to 65)

2.05

(1.37 to 3.09), P = 0.001

Age, free ßhCG

7

(460/5893)

42 (36 to 48)

1.66

(1.17 to 2.36), P = 0.004

0.81

(0.61 to 1.08), P = 0.15

Age, PAPP‐A

5

(359/3491)

55 (46 to 63)

2.16

(1.51 to 3.10), P < 0.001

1.05

(0.78 to 1.42), P = 0.73

1.30

(1.05 to 1.61), P = 0.015

Age, PAPP‐A, free ßhCG

17

(1037/49827)

68 (65 to 71)

2.70

(1.95 to 3.73), P < 0.001

1.31

(1.02 to 1.70), P = 0.037

1.62

(1.40 to 1.88), P < 0.001

1.25

(1.05 to 1.47), P = 0.01

Age, free ßhCG, AFP

3

(157/2992)

49 (39 to 60)

1.95

(1.33 to 2.86), P = 0.001

0.95

(0.69 to 1.32), P = 0.76

1.18

(0.92 to 1.51), P = 0.20

0.90

(0.69 to 1.17), P = 0.45

0.72

(0.59 to 0.89), P = 0.003

Age, ADAM 12, PAPP‐A, free ßhCG

2

(74/1222)

74 (63 to 83)

2.94

(2.07 to 4.16), P < 0.001

1.43

(1.07 to 1.90), P = 0.014

1.77

(1.46 to 2.14), P < 0.001

1.36

(1.10 to 1.67), P = 0.004

1.09

(0.95 to 1.25), P = 0.24

1.50

(1.17 to 1.92), P = 0.001

Age, PAPP‐A, free ßhCG, AFP

2

(116/2705)

74 (65 to 81)

2.93

(2.09 to 4.11), P < 0.001

1.43

(1.08 to 1.88), P = 0.011

1.76

(1.48 to 2.10), P < 0.001

1.35

(1.11 to 1.64), P = 0.002

1.09

(0.97 to 1.22), P = 0.16

1.50

(1.19, to 1.89).

P = 0.001

1.00

(0.84 to 1.18), P = 0.98

Age, PlGF, PAPP‐A, free ßhCG

2

(160/1144)

76 (69 to 82)

3.01

(2.16 to 4.20), P < 0.001

1.47

(1.12 to 1.91), P = 0.005

1.81

(1.54 to 2.14), P < 0.001

1.39

(1.16 to 1.67), P < 0.001

1.12

(1.01 to 1.23), P = 0.024

1.54

(1.23 to 1.93), P < 0.001

1.03

(0.87 to 1.20), P = 0.75

1.03

(0.90 to 1.18), P = 0.7

Ratio of sensitivities were computed by division of the sensitivity for the column by the sensitivity for the row. If the ratio of sensitivity is greater than one then the sensitivity of the test for the column is higher than that for the row, if less than one the sensitivity of the test in the row is higher than in the column.

AFP: alpha‐fetoprotein; αhCG: alpha human chorionic gonadotrophin; ßhCG: beta human chorionic gonadotrophin; CI: confidence interval; PAPP‐A: Pregnancy‐associated plasma protein A

Investigation of heterogeneity and sensitivity analyses

The key characteristics of the 56 included studies is summarised in Table 3 with further details available in the Characteristics of included studies table. Only one test combination— PAPP‐A, free ßhCG and maternal age (17 studies) was evaluated by 10 or more studies but there were no data for investigation of the effect of maternal age or any other potential source of heterogeneity. The planned sensitivity analyses were also not possible.

Open in table viewer
Table 3. Summary of study characteristics

Study

PAPP‐A, free ßhCG and age*

Maternal age (years)

Reference standard

Population

Study design

Study location

Baviera 2010

Mean 35.3 for Down's cases, 30.4 for control

Amniocentesis or follow‐up to birth

Routine screening

Case‐control

Italy

Benattar 1999

Mean 32 (16‐46), 8.3% > 35

Amniocentesis due to maternal age > 38 years (6.1% or women). Karyotyping encouraged for women with positive result on one or more index test. No details of reference standard for index test negative women

Routine screening

Prospective cohort

France

Biagiotti 1995

Not reported

Amniocentesis or CVS

High‐risk referral for invasive testing

Case‐control

Italy

Biagiotti 1998

X

Unclear (maybe all ≥ 38)

Amniocentesis or CVS

High‐risk referral for invasive testing

Retrospective case‐control

Italy

Brambati 1993

Median 38 (20‐47)

CVS

High‐risk referral for invasive testing

Retrospective cohort

Italy

Brambati 1994

X

Not reported

CVS

High‐risk referral for invasive testing

Case‐control

Italy

Brameld 2008

Median 31 (14‐47), 20% ≥ 35

Karyotyping or follow‐up to birth

Routine screening

Retrospective cohort

Australia

Brizot 1994

Median 38 (22‐45)

Fetal karyotyping

High‐risk referral for invasive testing

Retrospective case‐control

UK

Casals 1996

94.4% > 35

CVS

High‐risk referral for invasive testing

Retrospective case‐control

Spain

Christiansen 1999

Not reported

Karyotyping

High‐risk referral for invasive testing

Case‐control

Denmark

Christiansen 2004

Not reported

CVS (for 120 of cases of Down's) or follow‐up to birth (for 36 of cases of Down's)

Routine screening

Case‐control

Denmark

Christiansen 2005

Not reported

Karyotyping

Screening programmes for syphilis and Down's syndrome

Case‐control

Denmark

Christiansen 2007a

X

Median 37.7 (24‐48) for Down's cases, 36.4 (22‐44) for controls

Karyotyping or follow‐up to birth

Routine screening

Case‐control

Denmark

Christiansen 2009

X

Median 37.5 for Down's cases, 36.4 for controls

Karyotyping or follow‐up to birth

Routine screening

Case‐control

Denmark

Christiansen 2010

X

Median 36 (25‐44) for Down's cases, 29 (17‐45) for controls

Karyotyping or follow‐up to birth

Routine screening

Case‐control

Denmark

Cowans 2010

X

Mean 37.0 (IQR 32.9‐40.5) for Down's cases, 32.4 (IQR 29.0‐35.9) for controls

Karyotyping or follow‐up to birth

Routine screening

Case‐control

UK

Crandall 1993

90% > 35

Amniocentesis

High‐risk referral for invasive testing

Retrospective cohort

USA

Crossley 2002a

Median 29.9, 15.4% ≥ 35

CVS offered where women had high NT measurements. Also amniocentesis or follow‐up to birth

Routine screening

Prospective cohort

UK

De Graaf 1999a

X

Not reported

Amniocentesis or CVS

High‐risk referral for invasive testing

Case‐control

The Netherlands

Forest 1995

Mean 29.1 (SD 4.7), 10.7% ≥ 35

Follow‐up to birth

Routine screening

Case‐control

Canada

Forest 1997

X

Mean 27.9, 10.7% ≥ 35

Follow‐up to birth

Routine screening

Case‐control

Canada

Gyselaers 2005

Not reported

Amniocentesis, CVS and postnatal karyotyping

Routine screening

Prospective cohort

Belgium

Haddow 1998

X

Median 37 (15‐51)

Amniocentesis or CVS

High‐risk referral for invasive testing

Prospective cohort

USA

Kagan 2009

X

Mean 35.4 (14.1‐52.2)

Karyotyping or follow‐up to birth

Routine screening

Prospective cohort

UK

Kornman 1998

Not reported

CVS

High‐risk referral for invasive testing

Case‐control

The Netherlands

Kozlowski 2007 GC

Median 32 (15‐48), 26.4% ≥ 35

Karyotyping or follow‐up to birth

Routine screening

Cohort

Germany

Kozlowski 2007 PC

Median 34 (14‐46), 43.2% ≥ 35

Karyotyping or follow‐up to birth

Routine screening

Cohort

Germany

Krantz 2000

34.7% ≥ 35

Not reported

Routine screening

Prospective cohort

USA

Kratzer 1991

Missing

CVS

High‐risk referral for invasive testing

Case‐control

USA

Laigaard 2003

Not reported

Karyotyping, unclear reference standard for controls

Routine screening

Case‐control

Denmark

Macintosh 1993

Median 38 (27‐40)

CVS

High‐risk referral for invasive testing

Retrospective cohort

UK and Italy

Muller 2003a

Not reported

Invasive testing (offered to women with high NT measurement) or follow‐up to birth

Routine screening

Retrospective cohort

France

Nebiolo 1990

Approximately 75% ≥ 35

CVS

High‐risk referral for invasive testing

Retrospective cohort

Italy

Niemimaa 2001a

17.5% ≥35

Invasive testing (patients considered high‐risk based on NT screening) or follow‐up to birth

Routine screening

Prospective cohort

Finland

Noble 1995

Median 34 (15‐47), 47% ≥ 35

Karyotyping performed (27%), ultrasound examination at 20 weeks (65%), or follow‐up to birth (9%).

Routine screening in a high‐risk population

Prospective cohort

UK

Noble 1997

Median 34 (15‐47)

CVS, follow‐up to birth not reported

Routine screening

Case‐control

UK

O'Leary 2006

Median 31 (14‐47), 20% ≥ 35 years

CVS or amniocentesis (women assessed to be high risk on screening) or follow‐up to birth

Routine screening

Prospective cohort

Australia

Orlandi 1997

Range 15‐46, 35% ≥ 35

Not reported

Routine screening

Prospective cohort

Italy

Palomaki 2007

Mean maternal age 32.3 years (SD 4.6 years)

Karyotyping or follow‐up to birth

Routine screening

Prospective cohort

Canada

Qin 1997

Not reported

CVS, amniocentesis, karyotyping at birth, unclear reference standard for control

Routine screening

Case‐control

Denmark

Sahota 2010

X

Median 33.1, 30.1% ≥ 35

Karyotyping or follow‐up to birth

Routine screening

Prospective cohort

China

Schaelike 2009

31.0% ≥35

Karyotyping or follow‐up to birth

Routine screening

Prospective cohort

Germany

Scott 2004

Median 32 (15‐44), 29% ≥ 35

Invasive testing or follow‐up to birth

Routine screening

Prospective cohort

Australia

Spencer 1999a

X

Median cases 38 (19‐46), controls 36 (15‐47)

Invasive testing (high‐risk women) or follow‐up to birth

Referred for invasive testing or self‐referred for screening

Case‐control

UK

Spencer 2002a

Median cases 36 (20‐44), controls 30 (16‐41)

Not reported

Routine screening

Case‐control

UK

Torring 2010

X

Mean 35 for Down's, 31 for controls

Karyotyping or follow‐up to birth

Routine screening

Case‐control

Denmark

Tsukerman 1999

X

Not reported

Karyotyping, karyotyping at birth, follow‐up to birth not reported

Routine screening

Case‐control

Belarus

Valinen 2007

Mean 29.6, 18.6% ≥ 35

Karyotyping or follow‐up to birth

Routine screening

Retrospective cohort

Finland

Valinen 2009

Not reported

Karyotyping or follow‐up to birth

Routine screening

Case‐control

Finland

Van Lith 1992

Not reported

CVS

High‐risk referral for invasive testing

Case‐control

The Netherlands

Wald 2003a

X

Missing

Invasive testing (following second trimester screening) or follow‐up to birth

Routine screening

Case‐control

UK and Austria

Wallace 1995

Mean 32 (22‐44) for Down's cases, 28 (19‐38) for controls

Not reported

Routine screening

Case‐control

UK

Wapner 2003

X

Mean 35 (SD 4.6), 50% ≥ 35

Invasive testing, miscarriage with cytogenetic testing, follow‐up to birth

Routine screening

Prospective cohort

USA

Weinans 2005

Mean 38 (SD 2.7) for Down's cases, 37 (SD 3.0) for controls

CVS

High‐risk referral for invasive testing

Case‐control

The Netherlands

Wojdemann 2005

Mean 29, 10.8% ≥ 35

Invasive testing (in cases of increased risk) or follow‐up to birth

Routine screening

Prospective cohort

Denmark

Zaragoza 2009

X

Median 37.9 (19.1‐46.5) for Down's cases, 32.7 (16.1‐45.2) for controls

Karyotyping or follow‐up to birth

Routine screening

Case‐control

UK

*The PAPP‐A, free ßhCG and age test combination was the only test evaluated by at least 10 studies. X indicates that the test was evaluated in the study.

CVS: chorionic villus sampling; IQR: interquartile range; SD: standard deviation.

Discussion

Summary of main results

The systematic review found a large number of studies evaluating first trimester Down's syndrome serum screening tests, including studies evaluating the commonly used double test. Few studies were available to evaluate the performance of test strategies involving newer markers, such as ADAM 12, and few studies provided unconfounded comparisons of test strategies by applying and comparing several strategies using the same serum sample, the majority of studies only evaluating a single test combination. A summary of results for the nine most common and best performing strategies is given in summary of findings Table 1, briefer details for the remaining strategies are given in summary of findings Table 2.

Three key findings were noted.

  1. The double test comprised of PAPP‐A, free βhCG and maternal age appears to have significantly better (P < 0.05) test accuracy than the double test comprised of free βhCG, AFP and maternal age, and the single tests (both the markers alone and in combination with maternal age). This test detects around seven out of every 10 Down's affected pregnancies for a fixed 5% FPR. By comparison, the double test comprised of free βhCG, AFP and maternal age, and single tests alone and in combination with maternal age detects between two and five out of every 10 Down's affected pregnancies for a fixed 5% FPR.

  2. Whilst the triple test combinations show the highest detection rates, they were not shown to be statistically superior to the double test comprised of PAPP‐A, free βhCG and maternal age. Whilst some significant differences between these categories of tests were noted in the indirect comparisons, the potential for confounding is of concern. Estimates suggest that triple test combinations may detect between seven and eight out of every 10 Down's syndrome pregnancies at a 5% FPR, however these estimates are based on data from two or three studies evaluating small numbers of women. It is difficult to make strong recommendations on the use of triple tests, as we cannot rule out possible differences due to the limited power there is to detect a difference.

  3. The evidence for higher numbers of markers shows similar detection rates to double and triple markers, but are based on data from one study only, therefore further evaluation of these tests is required. Furthermore, there are other combinations of double markers that show similar detection rates to standard double markers commonly used in clinical practice, which may warrant further study.

Strengths and weaknesses of the review

This review is the first comprehensive review of first trimester serum screening. We examined papers from around the world, covering a wide cross‐section of women in varying populations. We contacted authors to verify data where necessary to give as complete a picture as possible, while trying to avoid replication of data.

There were a number of factors that made meta‐analysis of the data difficult, which we tried to adapt for, in order to allow for comparability of data presented in different studies.

  1. There were many different cut‐points used to define pregnancies as high or low risk for Down's syndrome. This means that direct comparison is more difficult than if all studies used the same cut‐point to dichotomise their populations. This is less of an issue for first trimester serum screening, compared to second trimester serum screening, as the majority of authors chose a cut‐point of 5% FPR.

  2. There were many different risk equations and software applications in use for combination of multiple markers, which were often not described in the papers. This means that risks may be calculated by different formulae and they may not be directly comparable for this reason. It is possible that this is responsible for unexplained heterogeneity in results.

  3. Different laboratories and clinics run different assays and use different machines and methods. This may influence raw results and subsequent risk calculations. Many laboratories have a quality assessment or audit trail, however, this may not necessarily be standard across the board. For example, how many assays are run, how often medians are calculated and adjusted for a given population and how quickly samples are tested from initially being taken.

  4. Few papers made direct comparisons between tests, making it difficult to detect if a real difference exists between tests (i.e. how different tests perform in the same population). There were differences in populations, with assay medians being affected, for example, by race. It is not certain whether it is appropriate to make comparisons between populations which are inherently different.

  5. We were unable to perform the investigations of heterogeneity that we had originally intended to because the data simply were not available. The vast majority of papers looking at pregnancies conceived by IVF, affected by diabetes, multiple gestation or a family history of Down's syndrome involved unaffected pregnancies only.

In addition, the search for this review was last updated in August 2011, and it is possible that new studies may have been published which have not been included. Since the search was completed we have kept a watching brief on outputs and are not aware of any studies with substantial sample sizes which could substantially affect the findings.

Applicability of findings to the review question

Potentially, when planning screening policy or a clinical screening programme, clinicians and policy makers need to make decisions about a finite number of tests or type of tests that can be offered. These policies are often driven by both the needs of a specific population and by financial resources. Economic analysis was considered to be outside of the scope of this review. Many of the tests examined as part of this review are already commercially available and in use in the clinical setting. The studies were carried out on populations of typical pregnant women and therefore, the results should be considered comparable with most pregnant populations encountered in every day clinical practice.

We were unable to extract information about harms of testing, information about miscarriage rates and uptake of definitive testing as the data were not available the majority of the time. Whilst it is unlikely that major differences between the tests evaluated here exist in terms of direct harms of testing, as they are all based on a single blood sample, differences in accuracy may lead to differences in the use of definitive testing and its consequent adverse outcomes.

In some countries with a defined screening policy (i.e. the UK), first trimester screening plays a major role, although usually in combination with first trimester ultrasound scanning. In others however, there may only be a limited range of tests or markers available, often second trimester markers, rather than first trimester markers. The results of this review should be interpreted and applied in the context of test availability and local restrictions, populations or policies.

Methodological quality graph: review authors' judgements about each methodological quality item presented as percentages across all included studies.
Figures and Tables -
Figure 1

Methodological quality graph: review authors' judgements about each methodological quality item presented as percentages across all included studies.

Detection rates (sensitivity) at a 5% false positive rate for the nine selected test strategies. Each circle represents the summary sensitivity for a test strategy and the size of each circle is proportional to the number of Down's cases. The estimates are shown with 95% confidence intervals. The test strategies are ordered on the plot according to decreasing detection rate. The number of studies, cases and women included for each test strategy are shown on the horizontal axis.A = Age, PlGF, PAPP‐A and free ßhCG; B = Age, PAPP‐A, free ßhCG and AFP; C = Age, ADAM 12, PAPP‐A and free ßhCG; D = Age, PAPP‐A and free ßhCG ; E = Age, PAPP‐A; F = PAPP‐A; G = Age, free ßhCG and AFP ; H = Age, free ßhCG; I = Free ßhCG
Figures and Tables -
Figure 2

Detection rates (sensitivity) at a 5% false positive rate for the nine selected test strategies. Each circle represents the summary sensitivity for a test strategy and the size of each circle is proportional to the number of Down's cases. The estimates are shown with 95% confidence intervals. The test strategies are ordered on the plot according to decreasing detection rate. The number of studies, cases and women included for each test strategy are shown on the horizontal axis.

A = Age, PlGF, PAPP‐A and free ßhCG; B = Age, PAPP‐A, free ßhCG and AFP; C = Age, ADAM 12, PAPP‐A and free ßhCG; D = Age, PAPP‐A and free ßhCG ; E = Age, PAPP‐A; F = PAPP‐A; G = Age, free ßhCG and AFP ; H = Age, free ßhCG; I = Free ßhCG

1T PAPP‐A, 5% FPR.
Figures and Tables -
Test 1

1T PAPP‐A, 5% FPR.

1T PAPP‐A, ≤5th percentile.
Figures and Tables -
Test 2

1T PAPP‐A, ≤5th percentile.

1T PAPP‐A, mixed cut‐points.
Figures and Tables -
Test 3

1T PAPP‐A, mixed cut‐points.

1T free ßhCG, 5% FPR.
Figures and Tables -
Test 4

1T free ßhCG, 5% FPR.

1T total hCG, 5FPR.
Figures and Tables -
Test 5

1T total hCG, 5FPR.

1T AFP, 5% FPR.
Figures and Tables -
Test 6

1T AFP, 5% FPR.

1T AFP, mixed cut‐points.
Figures and Tables -
Test 10

1T AFP, mixed cut‐points.

1T Inhibin, 5FPR.
Figures and Tables -
Test 11

1T Inhibin, 5FPR.

1T ADAM 12, 5FPR.
Figures and Tables -
Test 12

1T ADAM 12, 5FPR.

1T SP1, 5% FPR.
Figures and Tables -
Test 13

1T SP1, 5% FPR.

ba_hcg_ratio, 0.25MoM.
Figures and Tables -
Test 17

ba_hcg_ratio, 0.25MoM.

1T uE3, 5% FPR.
Figures and Tables -
Test 18

1T uE3, 5% FPR.

1T PlGF, 5FPR.
Figures and Tables -
Test 19

1T PlGF, 5FPR.

1T PAPP‐A and 1T free ßhCG, 5% FPR.
Figures and Tables -
Test 20

1T PAPP‐A and 1T free ßhCG, 5% FPR.

1T PAPP‐A and 1T free ßhCG, mixed cut‐points.
Figures and Tables -
Test 21

1T PAPP‐A and 1T free ßhCG, mixed cut‐points.

1T PAPP‐A and 1T AFP, 5% FPR.
Figures and Tables -
Test 22

1T PAPP‐A and 1T AFP, 5% FPR.

1T PAPP‐A and 1T ITA, 3% FPR.
Figures and Tables -
Test 23

1T PAPP‐A and 1T ITA, 3% FPR.

1T PAPP‐A and 1T ITA, 5% FPR.
Figures and Tables -
Test 24

1T PAPP‐A and 1T ITA, 5% FPR.

1T free ßhCG and 1T Inhibin, 5% FPR.
Figures and Tables -
Test 25

1T free ßhCG and 1T Inhibin, 5% FPR.

1T free ßhCG and 1T AFP, 5% FPR.
Figures and Tables -
Test 26

1T free ßhCG and 1T AFP, 5% FPR.

1T PAPP‐A and 1T ITA, 10% FPR.
Figures and Tables -
Test 27

1T PAPP‐A and 1T ITA, 10% FPR.

1T PAPP‐A, 1T free ßhCG and 1T ITA, 5% FPR.
Figures and Tables -
Test 28

1T PAPP‐A, 1T free ßhCG and 1T ITA, 5% FPR.

1T PAPP‐A, 1T free ßhCG and 1T ITA,3% FPR.
Figures and Tables -
Test 29

1T PAPP‐A, 1T free ßhCG and 1T ITA,3% FPR.

1T PAPP‐A, 1T free ßhCG and 1T ITA, 10% FPR.
Figures and Tables -
Test 30

1T PAPP‐A, 1T free ßhCG and 1T ITA, 10% FPR.

1T total hCG, 1T free αhCG and 1T progesterone, 0.34 MoM.
Figures and Tables -
Test 31

1T total hCG, 1T free αhCG and 1T progesterone, 0.34 MoM.

Age, 1T Inhibin, risk 1:100.
Figures and Tables -
Test 32

Age, 1T Inhibin, risk 1:100.

Age, 1T Inhibin, risk 1:250.
Figures and Tables -
Test 33

Age, 1T Inhibin, risk 1:250.

Age, 1T Inhibin, risk 1:400.
Figures and Tables -
Test 34

Age, 1T Inhibin, risk 1:400.

Age, 1T Inhibin, 5FPR.
Figures and Tables -
Test 35

Age, 1T Inhibin, 5FPR.

Age, 1T Inhibin, mixed cut‐points.
Figures and Tables -
Test 36

Age, 1T Inhibin, mixed cut‐points.

Age, 1T PAPP‐A, 5FPR.
Figures and Tables -
Test 37

Age, 1T PAPP‐A, 5FPR.

Age, 1T PAPP‐A, mixed cut‐points.
Figures and Tables -
Test 38

Age, 1T PAPP‐A, mixed cut‐points.

Age, 1T free ßhCG, 5FPR.
Figures and Tables -
Test 39

Age, 1T free ßhCG, 5FPR.

Age, 1T free ßhCG, risk 1:384.
Figures and Tables -
Test 40

Age, 1T free ßhCG, risk 1:384.

Age, 1T free ßhCG, mixed cut‐points.
Figures and Tables -
Test 41

Age, 1T free ßhCG, mixed cut‐points.

Age, 1T total hCG,risk 1:384.
Figures and Tables -
Test 42

Age, 1T total hCG,risk 1:384.

Age, 1T total hCG, mixed cut‐points.
Figures and Tables -
Test 43

Age, 1T total hCG, mixed cut‐points.

Age, 1T AFP, 5FPR.
Figures and Tables -
Test 44

Age, 1T AFP, 5FPR.

Age, 1T AFP, risk1:384.
Figures and Tables -
Test 45

Age, 1T AFP, risk1:384.

Age, 1T AFP,mixed cut‐points.
Figures and Tables -
Test 46

Age, 1T AFP,mixed cut‐points.

Age, 1T uE3, risk 1:384.
Figures and Tables -
Test 47

Age, 1T uE3, risk 1:384.

Age, 1T uE3, mixed cut‐points.
Figures and Tables -
Test 48

Age, 1T uE3, mixed cut‐points.

Age, 1T free αhCG, risk 1:384.
Figures and Tables -
Test 49

Age, 1T free αhCG, risk 1:384.

Age, 1T SP1, 5FPR.
Figures and Tables -
Test 50

Age, 1T SP1, 5FPR.

Age, 1T ProMBP, risk 1:250.
Figures and Tables -
Test 51

Age, 1T ProMBP, risk 1:250.

Age, 1T ITA, 5FPR.
Figures and Tables -
Test 52

Age, 1T ITA, 5FPR.

Age, 1T ADAM 12, risk 1:400.
Figures and Tables -
Test 53

Age, 1T ADAM 12, risk 1:400.

Age, 1T PAPP‐A and 1T free ßhCG, risk 1:250.
Figures and Tables -
Test 54

Age, 1T PAPP‐A and 1T free ßhCG, risk 1:250.

Age, 1T PAPP‐A and 1T free ßhCG, 5FPR.
Figures and Tables -
Test 55

Age, 1T PAPP‐A and 1T free ßhCG, 5FPR.

Age, 1T PAPP‐A and 1T free ßhCG, mixed cut‐points.
Figures and Tables -
Test 56

Age, 1T PAPP‐A and 1T free ßhCG, mixed cut‐points.

Age, 1T PAPP‐A and 1T free ßhCG, mixed cut‐points without 5FPR.
Figures and Tables -
Test 57

Age, 1T PAPP‐A and 1T free ßhCG, mixed cut‐points without 5FPR.

Age, 1T total hCG and 1T PAPP‐A, 5FPR.
Figures and Tables -
Test 58

Age, 1T total hCG and 1T PAPP‐A, 5FPR.

Age, 1T PAPP‐A and 1T Inhibin, risk 1:100.
Figures and Tables -
Test 59

Age, 1T PAPP‐A and 1T Inhibin, risk 1:100.

Age, 1T PAPP‐A and 1T Inhibin, risk 1:250.
Figures and Tables -
Test 60

Age, 1T PAPP‐A and 1T Inhibin, risk 1:250.

Age, 1T PAPP‐A and 1T Inhibin, risk 1:400.
Figures and Tables -
Test 61

Age, 1T PAPP‐A and 1T Inhibin, risk 1:400.

Age, 1T PAPP‐A and 1T Inhibin, 5FPR.
Figures and Tables -
Test 62

Age, 1T PAPP‐A and 1T Inhibin, 5FPR.

Age, 1T PAPP‐A and 1T Inhibin, mixed cut‐points.
Figures and Tables -
Test 63

Age, 1T PAPP‐A and 1T Inhibin, mixed cut‐points.

Age, 1T PAPP‐A and 1T ITA, 5FPR.
Figures and Tables -
Test 64

Age, 1T PAPP‐A and 1T ITA, 5FPR.

Age, 1T PAPP‐A and 1T AFP, 5FPR.
Figures and Tables -
Test 65

Age, 1T PAPP‐A and 1T AFP, 5FPR.

Age, 1T free ßhCG and 1T Inhibin, risk 1:100.
Figures and Tables -
Test 66

Age, 1T free ßhCG and 1T Inhibin, risk 1:100.

Age, 1T free ßhCG and 1T Inhibin, risk 1:250.
Figures and Tables -
Test 67

Age, 1T free ßhCG and 1T Inhibin, risk 1:250.

Age, 1T free ßhCG and 1T Inhibin, risk 1:400.
Figures and Tables -
Test 68

Age, 1T free ßhCG and 1T Inhibin, risk 1:400.

Age, 1T free ßhCG and 1T Inhibin, 5FPR.
Figures and Tables -
Test 69

Age, 1T free ßhCG and 1T Inhibin, 5FPR.

Age, 1T free ßhCG and 1T Inhibin, mixed cut‐points.
Figures and Tables -
Test 70

Age, 1T free ßhCG and 1T Inhibin, mixed cut‐points.

Age, 1T free ßhCG and 1T AFP, 5FPR.
Figures and Tables -
Test 71

Age, 1T free ßhCG and 1T AFP, 5FPR.

Age, 1T free ßhCG and 1T AFP, risk 1:250.
Figures and Tables -
Test 72

Age, 1T free ßhCG and 1T AFP, risk 1:250.

Age, 1T free ßhCG and 1T AFP, risk 1:384.
Figures and Tables -
Test 73

Age, 1T free ßhCG and 1T AFP, risk 1:384.

Age, 1T free ßhCG and 1T AFP, mixed cut‐points.
Figures and Tables -
Test 74

Age, 1T free ßhCG and 1T AFP, mixed cut‐points.

Age, 1T AFP and 1T uE3, risk 1:384.
Figures and Tables -
Test 75

Age, 1T AFP and 1T uE3, risk 1:384.

Age, 1T AFP and 1T free αhCG, risk 1:384.
Figures and Tables -
Test 76

Age, 1T AFP and 1T free αhCG, risk 1:384.

Age, 1T free ßhCG and 1T total hCG, risk 1:384.
Figures and Tables -
Test 77

Age, 1T free ßhCG and 1T total hCG, risk 1:384.

Age, 1T free ßhCG and 1T uE3, risk 1:384.
Figures and Tables -
Test 78

Age, 1T free ßhCG and 1T uE3, risk 1:384.

Age, 1T free ßhCG and 1T uE3, 5FPR.
Figures and Tables -
Test 79

Age, 1T free ßhCG and 1T uE3, 5FPR.

Age, 1T free ßhCG and 1T uE3, mixed cut‐points.
Figures and Tables -
Test 80

Age, 1T free ßhCG and 1T uE3, mixed cut‐points.

Age, 1T free ßhCG and 1T SP1, 5FPR.
Figures and Tables -
Test 81

Age, 1T free ßhCG and 1T SP1, 5FPR.

Age, 1T free ßhCG and 1T SP1 risk 1:250.
Figures and Tables -
Test 82

Age, 1T free ßhCG and 1T SP1 risk 1:250.

Age, 1T AFP and 1T total hCG, 1:384.
Figures and Tables -
Test 83

Age, 1T AFP and 1T total hCG, 1:384.

Age, 1T free ßhCG and 1T free αhCG, risk 1:384.
Figures and Tables -
Test 84

Age, 1T free ßhCG and 1T free αhCG, risk 1:384.

Age, 1T total hCG and 1T uE3, risk 1:384.
Figures and Tables -
Test 85

Age, 1T total hCG and 1T uE3, risk 1:384.

Age, 1T total hCG and 1T Inhibin, 5FPR.
Figures and Tables -
Test 86

Age, 1T total hCG and 1T Inhibin, 5FPR.

Age, 1T total hCG and 1T free αhCG, risk 1:384.
Figures and Tables -
Test 87

Age, 1T total hCG and 1T free αhCG, risk 1:384.

Age, 1T uE3 and 1T free αhCG, risk 1:384.
Figures and Tables -
Test 88

Age, 1T uE3 and 1T free αhCG, risk 1:384.

Age, 1T PAPP‐A, 1T free ßhCG and 1T AFP, 5FPR.
Figures and Tables -
Test 89

Age, 1T PAPP‐A, 1T free ßhCG and 1T AFP, 5FPR.

Age, 1T PAPP‐A, 1T free ßhCG and 1T AFP, mixed cut‐points.
Figures and Tables -
Test 90

Age, 1T PAPP‐A, 1T free ßhCG and 1T AFP, mixed cut‐points.

Age, 1T free ßhCG, 1T AFP and 1T uE3, 5FPR.
Figures and Tables -
Test 91

Age, 1T free ßhCG, 1T AFP and 1T uE3, 5FPR.

Age, 1T free ßhCG, 1T AFP and 1T uE3, risk 1:384.
Figures and Tables -
Test 92

Age, 1T free ßhCG, 1T AFP and 1T uE3, risk 1:384.

Age, 1T free ßhCG, 1T AFP and 1T uE3, mixed cut‐points.
Figures and Tables -
Test 93

Age, 1T free ßhCG, 1T AFP and 1T uE3, mixed cut‐points.

Age, 1T total hCG, 1T AFP and 1T uE3, risk 1:384.
Figures and Tables -
Test 94

Age, 1T total hCG, 1T AFP and 1T uE3, risk 1:384.

Age, 1T total hCG, 1T AFP and 1T uE3, mixed cut‐points.
Figures and Tables -
Test 95

Age, 1T total hCG, 1T AFP and 1T uE3, mixed cut‐points.

Age, 1T AFP, free αhCG and 1T uE3, risk 1:384.
Figures and Tables -
Test 96

Age, 1T AFP, free αhCG and 1T uE3, risk 1:384.

Age, 1T PAPP‐A, 1T free ßhCG and 1T Inhibin, 5FPR.
Figures and Tables -
Test 97

Age, 1T PAPP‐A, 1T free ßhCG and 1T Inhibin, 5FPR.

Age, 1T PAPP‐A, 1T total hCG and 1T Inhibin, 5FPR.
Figures and Tables -
Test 98

Age, 1T PAPP‐A, 1T total hCG and 1T Inhibin, 5FPR.

Age, 1T PAPP‐A, sp1 and 1T ProMBP, 5FPR.
Figures and Tables -
Test 99

Age, 1T PAPP‐A, sp1 and 1T ProMBP, 5FPR.

Age, 1T PAPP‐A, sp1 and 1T ProMBP, risk 1:250.
Figures and Tables -
Test 100

Age, 1T PAPP‐A, sp1 and 1T ProMBP, risk 1:250.

Age, 1T free ßhCG, 1T total hCG, 1T AFP and 1T uE3, risk 1:384.
Figures and Tables -
Test 101

Age, 1T free ßhCG, 1T total hCG, 1T AFP and 1T uE3, risk 1:384.

Age, 1T total hCG, 1T AFP, 1T uE3 and 1T free αhCG, risk 1:384.
Figures and Tables -
Test 102

Age, 1T total hCG, 1T AFP, 1T uE3 and 1T free αhCG, risk 1:384.

Age, 1T PAPP‐A, 1T free ßhCG, 1T AFP, 1T uE3 and 1T Inhibin, 5FPR.
Figures and Tables -
Test 103

Age, 1T PAPP‐A, 1T free ßhCG, 1T AFP, 1T uE3 and 1T Inhibin, 5FPR.

Age, 1T PAPP‐A, 1T total hCG, 1T AFP, 1T uE3 and 1T Inhibin, 5FPR.
Figures and Tables -
Test 104

Age, 1T PAPP‐A, 1T total hCG, 1T AFP, 1T uE3 and 1T Inhibin, 5FPR.

Age, 1T free ßhCG, 1T total hCG, 1T AFP, 1T uE3 and 1T free αhCG, risk 1:384.
Figures and Tables -
Test 105

Age, 1T free ßhCG, 1T total hCG, 1T AFP, 1T uE3 and 1T free αhCG, risk 1:384.

Age, 1T hPL, risk 1:250.
Figures and Tables -
Test 106

Age, 1T hPL, risk 1:250.

Age, 1T hPL, 1T PAPP‐A, risk 1:250.
Figures and Tables -
Test 107

Age, 1T hPL, 1T PAPP‐A, risk 1:250.

Age, 1T hPL, 1T free ßhCG, risk 1:250.
Figures and Tables -
Test 108

Age, 1T hPL, 1T free ßhCG, risk 1:250.

Age, 1T hPL, 1T PAPP‐A, 1T free ßhCG, risk 1:250.
Figures and Tables -
Test 109

Age, 1T hPL, 1T PAPP‐A, 1T free ßhCG, risk 1:250.

Age, 1T PGH, risk 1:250.
Figures and Tables -
Test 110

Age, 1T PGH, risk 1:250.

Age, 1T PGH, 1T PAPP‐A , risk 1:250.
Figures and Tables -
Test 111

Age, 1T PGH, 1T PAPP‐A , risk 1:250.

Age, 1T PGH, 1T free ßhCG , risk 1:250.
Figures and Tables -
Test 112

Age, 1T PGH, 1T free ßhCG , risk 1:250.

Age, 1T PGH, 1T PAPP‐A, 1T free ßhCG , risk 1:250.
Figures and Tables -
Test 113

Age, 1T PGH, 1T PAPP‐A, 1T free ßhCG , risk 1:250.

Age, 1T GHBP, risk 1:250.
Figures and Tables -
Test 114

Age, 1T GHBP, risk 1:250.

Age, 1T GHBP, 1T PAPP‐A, risk 1:250.
Figures and Tables -
Test 115

Age, 1T GHBP, 1T PAPP‐A, risk 1:250.

Age, 1T GHBP, 1T free ßhCG, risk 1:250.
Figures and Tables -
Test 116

Age, 1T GHBP, 1T free ßhCG, risk 1:250.

Age, 1T GHBP, 1T PGH, risk 1:250.
Figures and Tables -
Test 117

Age, 1T GHBP, 1T PGH, risk 1:250.

Age, 1T GHBP, 1T PAPP‐A, 1T free ßhCG , risk 1:250.
Figures and Tables -
Test 118

Age, 1T GHBP, 1T PAPP‐A, 1T free ßhCG , risk 1:250.

Age, 1T GHBP, 1T PGH, 1T PAPP‐A, 1T free ßhCG , risk 1:250.
Figures and Tables -
Test 119

Age, 1T GHBP, 1T PGH, 1T PAPP‐A, 1T free ßhCG , risk 1:250.

Age, 1T ADAM 12, risk 1:250.
Figures and Tables -
Test 120

Age, 1T ADAM 12, risk 1:250.

Age, 1T ADAM 12, 1T PAPP‐A, 1T free ßhCG, risk 1:250.
Figures and Tables -
Test 121

Age, 1T ADAM 12, 1T PAPP‐A, 1T free ßhCG, risk 1:250.

Age, PlGF, 1T PAPP‐A, 1T free ßhCG, 5FPR.
Figures and Tables -
Test 122

Age, PlGF, 1T PAPP‐A, 1T free ßhCG, 5FPR.

Age, 1T PAPP‐A and 1T free ßhCG, risk 1:300.
Figures and Tables -
Test 123

Age, 1T PAPP‐A and 1T free ßhCG, risk 1:300.

Age, 1T PAPP‐A, 1T Hyperglycosylated hCG, 5FPR.
Figures and Tables -
Test 124

Age, 1T PAPP‐A, 1T Hyperglycosylated hCG, 5FPR.

Age, ADAM 12, 1T PAPP‐A, 5FPR.
Figures and Tables -
Test 128

Age, ADAM 12, 1T PAPP‐A, 5FPR.

Age, ADAM 12, 1T PAPP‐A, 1T free ßhCG, 5FPR.
Figures and Tables -
Test 129

Age, ADAM 12, 1T PAPP‐A, 1T free ßhCG, 5FPR.

Age, 1T PlGF, 5FPR.
Figures and Tables -
Test 130

Age, 1T PlGF, 5FPR.

1T PlGF, 1T PAPP‐A, 1T free ßhCG, 5FPR.
Figures and Tables -
Test 131

1T PlGF, 1T PAPP‐A, 1T free ßhCG, 5FPR.

Age, 1T ADAM 12, 1T PAPP‐A, 1T free ßhCG, mixed cut‐points.
Figures and Tables -
Test 132

Age, 1T ADAM 12, 1T PAPP‐A, 1T free ßhCG, mixed cut‐points.

Age, 1T PAPP‐A, 1T free ßhCG and 1T Inhibin, risk 1:250.
Figures and Tables -
Test 133

Age, 1T PAPP‐A, 1T free ßhCG and 1T Inhibin, risk 1:250.

Age, 1T PAPP‐A, 1T free ßhCG, and 1T Inhibin, mixed cut‐points.
Figures and Tables -
Test 134

Age, 1T PAPP‐A, 1T free ßhCG, and 1T Inhibin, mixed cut‐points.

Summary of findings 1. Performance of first trimester serum test strategies with or without maternal age

Review Question

What is the accuracy of serum‐based markers for Down's syndrome screening in the first trimester?

Population

Pregnant women at less than 14 weeks' gestation confirmed by ultrasound, who had not undergone previous testing for Down’s syndrome. Most studies were undertaken in women identified to be high risk based on maternal age

Settings

All settings

Numbers of studies, pregnancies and Down's syndrome cases

56 studies (68 publications) involving 204,759 pregnancies of which 2113 were Down's syndrome pregnancies.

Index tests

18 serum markers (ADAM12, AFP, inhibin, PAPP‐A, ITA, free βhCG, PlGF, SP1, total hCG, progesterone, uE3, GHBP, PGH, hyperglycosylated hCG, ProMBP, hPL, free αhCG, and free ßhCG to AFP ratio) singly or in combination with or without maternal age.

Reference standards

Chromosomal verification (amniocentesis and CVS undertaken during pregnancy, and postnatal karyotyping) and postnatal macroscopic inspection.

Study limitations

35 studies used selective chromosomal verification during pregnancy, and were at risk of under‐ascertainment of Down's syndrome cases due loss of the pregnancy to miscarriage between the serum test and the reference standard.

Tests with at least 70% sensitivity and at least 95% specificity

Test strategy

Studies

Women (cases)

Sensitivity (95% CI)

Specificity (95% CI)

Test*

Test strategies (with or without maternal age) evaluated by a single study

Without maternal age

Double tests

PAPP‐A and AFP

1

96 (16)

81 (54 to 96)

95 (88 to 99)

PAPP‐A and ITA

1

344 (24)

71 (49 to 87)

95 (92 to 97)

Triple tests

PAPP‐A, free ßhCG and ITA

1

344 (24)

75 (53 to 90)

95 (92 to 97)

PlGF, PAPP‐A and free ßhCG

1

699 (90)

72 (62 to 81)

95 (93 to 97)

With maternal age

Double tests

Free ßhCG and SP1

1

60 (14)

71 (42 to 92)

96 (85 to 99)

PAPP‐A and Hyperglycosylated hCG

1

10775 (23)

74 (52 to 90)

95 (95 to 95)

Triple tests

PAPP‐A, free ßhCG and Inhibin

1

1110 (85)

74 (63 to 83)

95 (94 to 96)

PAPP‐A, SP1 and ProMBP

1

192 (15)

73 (45 to 92)

95 (91 to 98)

hPL, PAPP‐A and free ßhCG (1:250 risk)

1

183 (47)

77 (62 to 88)

95 (90 to 98)

Quadruple tests

GHBP, PGH, PAPP‐A and free ßhCG (1:250 risk)

1

335 (74)

76 (64 to 85)

95 (91 to 97)

Quintuple tests

PAPP‐A, free ßhCG, AFP, uE3 and Inhibin

1

1110 (85)

78 (67 to 86)

95 (94 to 96)

PAPP‐A, total hCG, AFP, uE3 and Inhibin

1

1110 (85)

73 (62 to 82)

95 (94 to 96)

Test strategies (with or without maternal age) evaluated by at least two studies

Free ßhCG

4

4280 (390)

25 (18 to 34)

95 (94 to 96)

P < 0.001

PAPP‐A

4

2837 (325)

52 (39 to 65)

95 (94 to 96)

Age, free ßhCG

7

5893 (460)

42 (36 to 48)

95 (94 to 96)

Age, PAPP‐A

5

3491 (359)

55 (46 to 63)

95 (94 to 96)

Age, free ßhCG and AFP

3

2992 (157)

49 (39 to 60)

95 (94 to 96)

Age, PAPP‐A and free ßhCG

17

49827 (1037)

68 (65 to 71)

95 (95 to 95)

Age, PAPP‐A, free ßhCG and AFP

2

2705 (116)

74 (65 to 81)

95 (94 to 96)

Age, ADAM 12, PAPP‐A and free ßhCG

2

1222 (74)

74 (63 to 83)

95 (94 to 96)

Age, PlGF, PAPP‐A and free ßhCG

2

1144 (160)

76 (69 to 82)

95 (93 to 96)

*Likelihood ratio test for the difference in sensitivity between the nine test strategies that were formally compared in a single meta‐analytic model.

ADAM12: a disintegrin and metalloprotease; AFP: alpha‐fetoprotein; αhCG: alpha human chorionic gonadotrophin; ßhCG: beta human chorionic gonadotrophin; CI: confidence interval; CVS: chorionic villus sampling; GHBP: growth hormone binding protein; hCG: human chorionic gonadotrophin; hPL: human placental lactogen; ITA: invasive trophoblast antigen; PAPP‐A: pregnancy‐associated plasma protein A; PGH: placental growth hormone; PIGF: placental growth factor; PROMBP: proform of eosinophil major basic protein; SPI: Schwangerschafts protein 1; uE3: unconjugated oestriol

Figures and Tables -
Summary of findings 1. Performance of first trimester serum test strategies with or without maternal age
Summary of findings 2. Performance of other first trimester serum test strategies with or without maternal age

Test strategy

Studies

Women (cases)

Sensitivity (95% CI)

Specificity (95% CI)

Threshold

Without maternal age

Single tests

AFP

2

2248 (104)

10 (4 to 21)

95

5% FPR

ADAM 12

1

579 (17)

41 (18 to 67)

95 (93 to 97)

5% FPR

Free ßhCG to AFP ratio

1

476 (9)

11 (0 to 48)

98 (96 to 99)

0.25 MoM

Inhibin

3

2098 (184)

19 (4 to 58)

95

5% FPR

PlGF

1

699 (90)

28 (19 to 38)

95 (93 to 97)

5% FPR

Total hCG

3

2098 (184)

19 (4 to 58)

95

5% FPR

SP1

3

1080 (53)

32 (1 to 96)

95

5% FPR

uE3

1

1110 (85)

13 (7 to 22)

95 (94 to 96)

5% FPR

Double tests

Free ßhCG and AFP

1

1138 (19)

16 (3 to 40)

95 (94 to 96)

5% FPR

Free ßhCG and Inhibin

1

876 (76)

30 (20 to 42)

95 (93 to 96)

5% FPR

PAPP‐A and free ßhCG

2

795 (106)

64 (50 to 76)

95

5% FPR

Triple tests

Total hCG, free αhCG and progesterone

1

129 (17)

53 (28 to 77)

96 (90 to 99)

0.34 MoM

With maternal age

Single tests

ADAM 12

2

703 (46)

67 (46 to 83)

91 (87 to 94)

1:400 risk

AFP

2

1397 (126)

33 (23 to 46)

95

5% FPR

Free αhCG

1

512 (12)

25 (5 to 57)

89 (86 to 91)

1:384 risk

GHBP

1

335 (74)

27 (17 to 39)

95 (91 to 97)

1:250 risk

hPL

1

183 (47)

45 (30 to 60)

93 (88 to 97)

1:250 risk

Inhibin

1

1110 (85)

32 (22 to 43)

95 (94 to 96)

5% FPR

ITA

1

278 (54)

48 (34 to 62)

95 (91 to 98)

5% FPR

PGH

1

335 (74)

41 (29 to 53)

94 (91 to 97)

1:250 risk

PlGF

1

699 (90)

43 (33 to 54)

95 (93 to 97)

5% FPR

ProMBP

1

181 (25)

36 (18 to 57)

94 (89 to 97)

1:250 risk

SP1

2

804 (29)

38 (22 to 56)

95

5% FPR

Total hCG

1

512 (12)

33 (10 to 65)

94 (92 to 96)

1:384 risk

uE3

1

512 (12)

33 (10 to 65)

86 (83 to 89)

1:384 risk

Double tests

ADAM 12 and PAPP‐A

1

691 (46)

61 (45 to 75)

95 (93 to 97)

5% FPR

AFP and free αhCG

1

512 (12)

33 (10 to 65)

87 (83 to 89)

1:384 risk

AFP and total hCG

1

512 (12)

33 (10 to 65)

93 (90 to 95)

1:384 risk

AFP and uE3

1

512 (12)

42 (15 to 72)

87 (84 to 90)

1:384 risk

Free ßhCG and free αhCG

1

512 (12)

42 (15 to 72)

94 (91 to 96)

1:384 risk

Free ßhCG and Inhibin

1

1110 (85)

44 (33 to 55)

95 (94 to 96)

5% FPR

Free ßhCG and total hCG

1

512 (12)

25 (5 to 57)

93 (90 to 95)

1:384 risk

Free ßhCG and uE3

1

287 (41)

61 (45 to 76)

95 (92 to 97)

5% FPR

GHBP and free ßhCG

1

335 (74)

61 (49 to 72)

92 (88 to 95)

1:250 risk

GHBP and PAPP‐A

1

335 (74)

66 (54 to 77)

93 (89 to 96)

1:250 risk

GHBP and PGH

1

335 (74)

47 (36 to 59)

93 (90 to 96)

1:250 risk

hPL and free ßhCG

1

183 (47)

68 (53 to 81)

94 (89 to 97)

1:250 risk

hPL and PAPP‐A

1

183 (47)

55 (40 to 70)

94 (89 to 97)

1:250 risk

PAPP‐A and AFP

2

2705 (116)

63 (50 to 74)

95

5% FPR

PAPP‐A and Inhibin

1

1110 (85)

68 (57 to 78)

95 (94 to 96)

5% FPR

PAPP‐A and ITA

2

622 (78)

62 (46 to 75)

95

5% FPR

PGH and free ßhCG

1

335 (74)

64 (52 to 74)

93 (89 to 96)

1:250 risk

PGH and PAPP‐A

1

335 (74)

65 (53 to 76)

93 (89 to 96)

1:250 risk

Total hCG and free αhCG

1

512 (12)

42 (15 to 72)

92 (89 to 94)

1:384 risk

Total hCG and Inhibin

1

1110 (85)

34 (24 to 45)

95 (94 to 96)

5% FPR

Total hCG and PAPP‐A

2

4327 (133)

66 (54 to 76)

95

5% FPR

Total hCG and uE3

1

512 (12)

42 (15 to 72)

92 (89 to 94)

1:384 risk

uE3 and free αhCG

1

512 (12)

33 (10 to 65)

89 (86 to 91)

1:384 risk

Triple tests

AFP, free αhCG and uE3

1

512 (12)

58 (28 to 85)

82 (79 to 85)

1:384 risk

Free ßhCG, AFP and uE3

1

287 (41)

66 (49 to 80)

95 (92 to 97)

5% FPR

GHBP, PAPP‐A and free ßhCG

1

335 (74)

76 (64 to 85)

94 (91 to 97)

1:250 risk

PAPP‐A, total hCG and Inhibin

1

1110 (85)

69 (58 to 79)

95 (94 to 96)

5% FPR

PGH, PAPP‐A and free ßhCG

1

335 (74)

76 (64 to 85)

94 (91 to 97)

1:250 risk

Total hCG, AFP and uE3

1

512 (12)

42 (15 to 72)

91 (88 to 94)

1:384 risk

Quadruple tests

Free ßhCG, total hCG, AFP and uE3

1

512 (12)

50 (21 to 79)

92 (89 to 94)

1:384 risk

Total hCG, AFP, uE3 and free αhCG

1

512 (12)

50 (21 to 79)

90 (87 to 92)

1:384 risk

Quintuple tests

Free ßhCG, total hCG, AFP, uE3 and free αhCG

1

512 (12)

33 (10 to 65)

90 (87 to 92)

1:384 risk

AFP: alpha‐fetoprotein; αhCG: alpha human chorionic gonadotrophin; ßhCG: beta human chorionic gonadotrophin; CI: confidence interval; FPR: false positive rate; GHBP: growth hormone binding protein; hCG: human chorionic gonadotrophin; hPL: human placental lactogen; ITA: invasive trophoblast antigen; PAPP‐A: pregnancy‐associated plasma protein A; PGH: placental growth hormone; PIGF: placental growth factor; PROMBP: proform of eosinophil major basic protein; SPI: Schwangerschafts protein 1; uE3: unconjugated oestriol

Figures and Tables -
Summary of findings 2. Performance of other first trimester serum test strategies with or without maternal age
Table 1. Direct comparisons of the sensitivity of nine test strategies at the 5% false positive rate

Ratio of sensitivity

(95% CI), P value for comparison (studies)

Free ßhCG

PAPP‐A

Age, free ßhCG

Age, PAPP‐A

Age, PAPP‐A , free ßhCG

Age, free ßhCG, AFP

Age, ADAM 12, PAPP‐A, free ßhCG

Age, PAPP‐A, free ßhCG, AFP

PAPP‐A

1.78 (1.10 to 2.88), P = 0.02

(2)

Age, free ßhCG

1.67 (1.11 to 2.50). P = 0.013

(2)

0.94 (0.68 to 1.29), P = 0.70

(2)

Age, PAPP‐A

2.15 (1.37 to 3.38), P = 0.001

(2)

1.20 (0.86 to 1.67), P = 0.29

(3)

1.26 (1.02 to 1.57), P = 0.034

(4)

Age, PAPP‐A , free ßhCG

2.62 (1.77 to 3.87), P < 0.001

(2)

1.47 (1.09 to 2.00), P = 0.012

(2)

1.61 (1.31 tp 1.98), P < 0.001

(5)

1.26 (1.04 to 1.52), P = 0.02

(4)

Age, free ßhCG, AFP

2.19 (1.31 to 3.64), P = 0.002

(1)

0.71 (0.52 to 0.98), P = 0.03

(1)

1.08 (0.80 to 1.46), P = 0.62

(2)

0.61 (0.46 to 0.82), P < 0.001

(1)

0.63 (0.47 to 0.86), P = 0.004

(2)

Age, ADAM 12, PAPP‐A, free ßhCG

1.04 (0.85 to 1.26), P = 0.71

(2)

Age, PAPP‐A, free ßhCG, AFP

3.94 (2.49 to 6.23), P < 0.001

(1)

1.29 (1.03 to 1.60), P = 0.024

(1)

1.91 (1.42 to 2.56), P < 0.001

(1)

1.11 (0.91 to 1.34), P = 0.31

(1)

1.02 (0.88 to 1.20), P = 0.77

(2)

1.62 (1.19 to 2.19), P = 0.002

(2)

Age, PlGF, PAPP‐A, free ßhCG

1.03 (0.91 to 1.17), P = 0.61

(2)

— indicates that no comparative study was available for the pair of tests.

Direct comparisons were made only using data from studies which compared each pair of tests on the same women. Where there were at least two studies, meta‐analysis was performed to summarise and compare the sensitivities. The ratio of sensitivities was computed by division of the sensitivity for the column by the sensitivity for the row. If the ratio of sensitivity is greater than one then the sensitivity of the test for the column is higher than that for the row, if less than one the sensitivity of the test in the row is higher than in the column. All test comparisons that were evaluated by only one study were from Wald 2003. The ratio of the sensitivities for test comparisons from a single study were calculated as a ratio of two proportions.

ADAM12: a disintegrin and metalloprotease; AFP: alpha‐fetoprotein; ßhCG: beta human chorionic gonadotrophin; CI: confidence interval; PAPP‐A: pregnancy‐associated plasma protein A; PIGF: placental growth factor; PROMBP: proform of eosinophil major basic protein

Figures and Tables -
Table 1. Direct comparisons of the sensitivity of nine test strategies at the 5% false positive rate
Table 2. Indirect comparisons of the sensitivity of nine test strategies at the 5% false positive rate

Ratio of sensitivity

(95% CI), P value for comparison

Free ßhCG

PAPP‐A

Age, free ßhCG

Age, PAPP‐A

Age, PAPP‐A, free ßhCG

Age, free ßhCG, AFP

Age, ADAM 12, PAPP‐A, free ßhCG

Age, PAPP‐A, free ßhCG, AFP

Studies (cases/ women)

4

(390/4280)

4

(325/2837)

7

(460/5893)

5

(359/3491)

17

(1037/49827)

3

(157/2992)

2

(74/1222)

2

(116/2705)

Studies (cases/ women)

Sensitivity % (95% CI)

25 (18 to 34)

52 (39 to 65)

42 (36 to 48)

55 (46 to 63)

68 (65 to 71)

49 (39 to 60)

74 (63 to 83)

74 (65 to 81)

PAPP‐A

4

(325/2837)

52 (39 to 65)

2.05

(1.37 to 3.09), P = 0.001

Age, free ßhCG

7

(460/5893)

42 (36 to 48)

1.66

(1.17 to 2.36), P = 0.004

0.81

(0.61 to 1.08), P = 0.15

Age, PAPP‐A

5

(359/3491)

55 (46 to 63)

2.16

(1.51 to 3.10), P < 0.001

1.05

(0.78 to 1.42), P = 0.73

1.30

(1.05 to 1.61), P = 0.015

Age, PAPP‐A, free ßhCG

17

(1037/49827)

68 (65 to 71)

2.70

(1.95 to 3.73), P < 0.001

1.31

(1.02 to 1.70), P = 0.037

1.62

(1.40 to 1.88), P < 0.001

1.25

(1.05 to 1.47), P = 0.01

Age, free ßhCG, AFP

3

(157/2992)

49 (39 to 60)

1.95

(1.33 to 2.86), P = 0.001

0.95

(0.69 to 1.32), P = 0.76

1.18

(0.92 to 1.51), P = 0.20

0.90

(0.69 to 1.17), P = 0.45

0.72

(0.59 to 0.89), P = 0.003

Age, ADAM 12, PAPP‐A, free ßhCG

2

(74/1222)

74 (63 to 83)

2.94

(2.07 to 4.16), P < 0.001

1.43

(1.07 to 1.90), P = 0.014

1.77

(1.46 to 2.14), P < 0.001

1.36

(1.10 to 1.67), P = 0.004

1.09

(0.95 to 1.25), P = 0.24

1.50

(1.17 to 1.92), P = 0.001

Age, PAPP‐A, free ßhCG, AFP

2

(116/2705)

74 (65 to 81)

2.93

(2.09 to 4.11), P < 0.001

1.43

(1.08 to 1.88), P = 0.011

1.76

(1.48 to 2.10), P < 0.001

1.35

(1.11 to 1.64), P = 0.002

1.09

(0.97 to 1.22), P = 0.16

1.50

(1.19, to 1.89).

P = 0.001

1.00

(0.84 to 1.18), P = 0.98

Age, PlGF, PAPP‐A, free ßhCG

2

(160/1144)

76 (69 to 82)

3.01

(2.16 to 4.20), P < 0.001

1.47

(1.12 to 1.91), P = 0.005

1.81

(1.54 to 2.14), P < 0.001

1.39

(1.16 to 1.67), P < 0.001

1.12

(1.01 to 1.23), P = 0.024

1.54

(1.23 to 1.93), P < 0.001

1.03

(0.87 to 1.20), P = 0.75

1.03

(0.90 to 1.18), P = 0.7

Ratio of sensitivities were computed by division of the sensitivity for the column by the sensitivity for the row. If the ratio of sensitivity is greater than one then the sensitivity of the test for the column is higher than that for the row, if less than one the sensitivity of the test in the row is higher than in the column.

AFP: alpha‐fetoprotein; αhCG: alpha human chorionic gonadotrophin; ßhCG: beta human chorionic gonadotrophin; CI: confidence interval; PAPP‐A: Pregnancy‐associated plasma protein A

Figures and Tables -
Table 2. Indirect comparisons of the sensitivity of nine test strategies at the 5% false positive rate
Table 3. Summary of study characteristics

Study

PAPP‐A, free ßhCG and age*

Maternal age (years)

Reference standard

Population

Study design

Study location

Baviera 2010

Mean 35.3 for Down's cases, 30.4 for control

Amniocentesis or follow‐up to birth

Routine screening

Case‐control

Italy

Benattar 1999

Mean 32 (16‐46), 8.3% > 35

Amniocentesis due to maternal age > 38 years (6.1% or women). Karyotyping encouraged for women with positive result on one or more index test. No details of reference standard for index test negative women

Routine screening

Prospective cohort

France

Biagiotti 1995

Not reported

Amniocentesis or CVS

High‐risk referral for invasive testing

Case‐control

Italy

Biagiotti 1998

X

Unclear (maybe all ≥ 38)

Amniocentesis or CVS

High‐risk referral for invasive testing

Retrospective case‐control

Italy

Brambati 1993

Median 38 (20‐47)

CVS

High‐risk referral for invasive testing

Retrospective cohort

Italy

Brambati 1994

X

Not reported

CVS

High‐risk referral for invasive testing

Case‐control

Italy

Brameld 2008

Median 31 (14‐47), 20% ≥ 35

Karyotyping or follow‐up to birth

Routine screening

Retrospective cohort

Australia

Brizot 1994

Median 38 (22‐45)

Fetal karyotyping

High‐risk referral for invasive testing

Retrospective case‐control

UK

Casals 1996

94.4% > 35

CVS

High‐risk referral for invasive testing

Retrospective case‐control

Spain

Christiansen 1999

Not reported

Karyotyping

High‐risk referral for invasive testing

Case‐control

Denmark

Christiansen 2004

Not reported

CVS (for 120 of cases of Down's) or follow‐up to birth (for 36 of cases of Down's)

Routine screening

Case‐control

Denmark

Christiansen 2005

Not reported

Karyotyping

Screening programmes for syphilis and Down's syndrome

Case‐control

Denmark

Christiansen 2007a

X

Median 37.7 (24‐48) for Down's cases, 36.4 (22‐44) for controls

Karyotyping or follow‐up to birth

Routine screening

Case‐control

Denmark

Christiansen 2009

X

Median 37.5 for Down's cases, 36.4 for controls

Karyotyping or follow‐up to birth

Routine screening

Case‐control

Denmark

Christiansen 2010

X

Median 36 (25‐44) for Down's cases, 29 (17‐45) for controls

Karyotyping or follow‐up to birth

Routine screening

Case‐control

Denmark

Cowans 2010

X

Mean 37.0 (IQR 32.9‐40.5) for Down's cases, 32.4 (IQR 29.0‐35.9) for controls

Karyotyping or follow‐up to birth

Routine screening

Case‐control

UK

Crandall 1993

90% > 35

Amniocentesis

High‐risk referral for invasive testing

Retrospective cohort

USA

Crossley 2002a

Median 29.9, 15.4% ≥ 35

CVS offered where women had high NT measurements. Also amniocentesis or follow‐up to birth

Routine screening

Prospective cohort

UK

De Graaf 1999a

X

Not reported

Amniocentesis or CVS

High‐risk referral for invasive testing

Case‐control

The Netherlands

Forest 1995

Mean 29.1 (SD 4.7), 10.7% ≥ 35

Follow‐up to birth

Routine screening

Case‐control

Canada

Forest 1997

X

Mean 27.9, 10.7% ≥ 35

Follow‐up to birth

Routine screening

Case‐control

Canada

Gyselaers 2005

Not reported

Amniocentesis, CVS and postnatal karyotyping

Routine screening

Prospective cohort

Belgium

Haddow 1998

X

Median 37 (15‐51)

Amniocentesis or CVS

High‐risk referral for invasive testing

Prospective cohort

USA

Kagan 2009

X

Mean 35.4 (14.1‐52.2)

Karyotyping or follow‐up to birth

Routine screening

Prospective cohort

UK

Kornman 1998

Not reported

CVS

High‐risk referral for invasive testing

Case‐control

The Netherlands

Kozlowski 2007 GC

Median 32 (15‐48), 26.4% ≥ 35

Karyotyping or follow‐up to birth

Routine screening

Cohort

Germany

Kozlowski 2007 PC

Median 34 (14‐46), 43.2% ≥ 35

Karyotyping or follow‐up to birth

Routine screening

Cohort

Germany

Krantz 2000

34.7% ≥ 35

Not reported

Routine screening

Prospective cohort

USA

Kratzer 1991

Missing

CVS

High‐risk referral for invasive testing

Case‐control

USA

Laigaard 2003

Not reported

Karyotyping, unclear reference standard for controls

Routine screening

Case‐control

Denmark

Macintosh 1993

Median 38 (27‐40)

CVS

High‐risk referral for invasive testing

Retrospective cohort

UK and Italy

Muller 2003a

Not reported

Invasive testing (offered to women with high NT measurement) or follow‐up to birth

Routine screening

Retrospective cohort

France

Nebiolo 1990

Approximately 75% ≥ 35

CVS

High‐risk referral for invasive testing

Retrospective cohort

Italy

Niemimaa 2001a

17.5% ≥35

Invasive testing (patients considered high‐risk based on NT screening) or follow‐up to birth

Routine screening

Prospective cohort

Finland

Noble 1995

Median 34 (15‐47), 47% ≥ 35

Karyotyping performed (27%), ultrasound examination at 20 weeks (65%), or follow‐up to birth (9%).

Routine screening in a high‐risk population

Prospective cohort

UK

Noble 1997

Median 34 (15‐47)

CVS, follow‐up to birth not reported

Routine screening

Case‐control

UK

O'Leary 2006

Median 31 (14‐47), 20% ≥ 35 years

CVS or amniocentesis (women assessed to be high risk on screening) or follow‐up to birth

Routine screening

Prospective cohort

Australia

Orlandi 1997

Range 15‐46, 35% ≥ 35

Not reported

Routine screening

Prospective cohort

Italy

Palomaki 2007

Mean maternal age 32.3 years (SD 4.6 years)

Karyotyping or follow‐up to birth

Routine screening

Prospective cohort

Canada

Qin 1997

Not reported

CVS, amniocentesis, karyotyping at birth, unclear reference standard for control

Routine screening

Case‐control

Denmark

Sahota 2010

X

Median 33.1, 30.1% ≥ 35

Karyotyping or follow‐up to birth

Routine screening

Prospective cohort

China

Schaelike 2009

31.0% ≥35

Karyotyping or follow‐up to birth

Routine screening

Prospective cohort

Germany

Scott 2004

Median 32 (15‐44), 29% ≥ 35

Invasive testing or follow‐up to birth

Routine screening

Prospective cohort

Australia

Spencer 1999a

X

Median cases 38 (19‐46), controls 36 (15‐47)

Invasive testing (high‐risk women) or follow‐up to birth

Referred for invasive testing or self‐referred for screening

Case‐control

UK

Spencer 2002a

Median cases 36 (20‐44), controls 30 (16‐41)

Not reported

Routine screening

Case‐control

UK

Torring 2010

X

Mean 35 for Down's, 31 for controls

Karyotyping or follow‐up to birth

Routine screening

Case‐control

Denmark

Tsukerman 1999

X

Not reported

Karyotyping, karyotyping at birth, follow‐up to birth not reported

Routine screening

Case‐control

Belarus

Valinen 2007

Mean 29.6, 18.6% ≥ 35

Karyotyping or follow‐up to birth

Routine screening

Retrospective cohort

Finland

Valinen 2009

Not reported

Karyotyping or follow‐up to birth

Routine screening

Case‐control

Finland

Van Lith 1992

Not reported

CVS

High‐risk referral for invasive testing

Case‐control

The Netherlands

Wald 2003a

X

Missing

Invasive testing (following second trimester screening) or follow‐up to birth

Routine screening

Case‐control

UK and Austria

Wallace 1995

Mean 32 (22‐44) for Down's cases, 28 (19‐38) for controls

Not reported

Routine screening

Case‐control

UK

Wapner 2003

X

Mean 35 (SD 4.6), 50% ≥ 35

Invasive testing, miscarriage with cytogenetic testing, follow‐up to birth

Routine screening

Prospective cohort

USA

Weinans 2005

Mean 38 (SD 2.7) for Down's cases, 37 (SD 3.0) for controls

CVS

High‐risk referral for invasive testing

Case‐control

The Netherlands

Wojdemann 2005

Mean 29, 10.8% ≥ 35

Invasive testing (in cases of increased risk) or follow‐up to birth

Routine screening

Prospective cohort

Denmark

Zaragoza 2009

X

Median 37.9 (19.1‐46.5) for Down's cases, 32.7 (16.1‐45.2) for controls

Karyotyping or follow‐up to birth

Routine screening

Case‐control

UK

*The PAPP‐A, free ßhCG and age test combination was the only test evaluated by at least 10 studies. X indicates that the test was evaluated in the study.

CVS: chorionic villus sampling; IQR: interquartile range; SD: standard deviation.

Figures and Tables -
Table 3. Summary of study characteristics
Table Tests. Data tables by test

Test

No. of studies

No. of participants

1 1T PAPP‐A, 5% FPR Show forest plot

4

2837

2 1T PAPP‐A, ≤5th percentile Show forest plot

1

22280

3 1T PAPP‐A, mixed cut‐points Show forest plot

6

25510

4 1T free ßhCG, 5% FPR Show forest plot

4

4280

5 1T total hCG, 5FPR Show forest plot

2

2482

6 1T AFP, 5% FPR Show forest plot

2

2248

10 1T AFP, mixed cut‐points Show forest plot

3

2724

11 1T Inhibin, 5FPR Show forest plot

3

2098

12 1T ADAM 12, 5FPR Show forest plot

1

579

13 1T SP1, 5% FPR Show forest plot

3

1080

17 ba_hcg_ratio, 0.25MoM Show forest plot

1

476

18 1T uE3, 5% FPR Show forest plot

1

1110

19 1T PlGF, 5FPR Show forest plot

1

699

20 1T PAPP‐A and 1T free ßhCG, 5% FPR Show forest plot

2

795

21 1T PAPP‐A and 1T free ßhCG, mixed cut‐points Show forest plot

2

795

22 1T PAPP‐A and 1T AFP, 5% FPR Show forest plot

1

96

23 1T PAPP‐A and 1T ITA, 3% FPR Show forest plot

1

344

24 1T PAPP‐A and 1T ITA, 5% FPR Show forest plot

1

344

25 1T free ßhCG and 1T Inhibin, 5% FPR Show forest plot

1

876

26 1T free ßhCG and 1T AFP, 5% FPR Show forest plot

1

1138

27 1T PAPP‐A and 1T ITA, 10% FPR Show forest plot

1

344

28 1T PAPP‐A, 1T free ßhCG and 1T ITA, 5% FPR Show forest plot

1

344

29 1T PAPP‐A, 1T free ßhCG and 1T ITA,3% FPR Show forest plot

1

344

30 1T PAPP‐A, 1T free ßhCG and 1T ITA, 10% FPR Show forest plot

1

344

31 1T total hCG, 1T free αhCG and 1T progesterone, 0.34 MoM Show forest plot

1

129

32 Age, 1T Inhibin, risk 1:100 Show forest plot

1

40

33 Age, 1T Inhibin, risk 1:250 Show forest plot

1

40

34 Age, 1T Inhibin, risk 1:400 Show forest plot

1

40

35 Age, 1T Inhibin, 5FPR Show forest plot

1

1110

36 Age, 1T Inhibin, mixed cut‐points Show forest plot

2

1150

37 Age, 1T PAPP‐A, 5FPR Show forest plot

5

3491

38 Age, 1T PAPP‐A, mixed cut‐points Show forest plot

6

13742

39 Age, 1T free ßhCG, 5FPR Show forest plot

7

5893

40 Age, 1T free ßhCG, risk 1:384 Show forest plot

1

512

41 Age, 1T free ßhCG, mixed cut‐points Show forest plot

9

16656

42 Age, 1T total hCG,risk 1:384 Show forest plot

1

512

43 Age, 1T total hCG, mixed cut‐points Show forest plot

2

1622

44 Age, 1T AFP, 5FPR Show forest plot

2

1397

45 Age, 1T AFP, risk1:384 Show forest plot

1

512

46 Age, 1T AFP,mixed cut‐points Show forest plot

3

1909

47 Age, 1T uE3, risk 1:384 Show forest plot

1

512

48 Age, 1T uE3, mixed cut‐points Show forest plot

2

799

49 Age, 1T free αhCG, risk 1:384 Show forest plot

1

512

50 Age, 1T SP1, 5FPR Show forest plot

2

804

51 Age, 1T ProMBP, risk 1:250 Show forest plot

1

181

52 Age, 1T ITA, 5FPR Show forest plot

1

278

53 Age, 1T ADAM 12, risk 1:400 Show forest plot

2

703

54 Age, 1T PAPP‐A and 1T free ßhCG, risk 1:250 Show forest plot

11

60484

55 Age, 1T PAPP‐A and 1T free ßhCG, 5FPR Show forest plot

17

49827

56 Age, 1T PAPP‐A and 1T free ßhCG, mixed cut‐points Show forest plot

31

158878

57 Age, 1T PAPP‐A and 1T free ßhCG, mixed cut‐points without 5FPR Show forest plot

20

138731

58 Age, 1T total hCG and 1T PAPP‐A, 5FPR Show forest plot

2

4327

59 Age, 1T PAPP‐A and 1T Inhibin, risk 1:100 Show forest plot

1

41

60 Age, 1T PAPP‐A and 1T Inhibin, risk 1:250 Show forest plot

1

40

61 Age, 1T PAPP‐A and 1T Inhibin, risk 1:400 Show forest plot

1

40

62 Age, 1T PAPP‐A and 1T Inhibin, 5FPR Show forest plot

1

1110

63 Age, 1T PAPP‐A and 1T Inhibin, mixed cut‐points Show forest plot

2

1150

64 Age, 1T PAPP‐A and 1T ITA, 5FPR Show forest plot

2

622

65 Age, 1T PAPP‐A and 1T AFP, 5FPR Show forest plot

2

2705

66 Age, 1T free ßhCG and 1T Inhibin, risk 1:100 Show forest plot

1

40

67 Age, 1T free ßhCG and 1T Inhibin, risk 1:250 Show forest plot

1

40

68 Age, 1T free ßhCG and 1T Inhibin, risk 1:400 Show forest plot

1

40

69 Age, 1T free ßhCG and 1T Inhibin, 5FPR Show forest plot

1

1110

70 Age, 1T free ßhCG and 1T Inhibin, mixed cut‐points Show forest plot

2

1150

71 Age, 1T free ßhCG and 1T AFP, 5FPR Show forest plot

3

2992

72 Age, 1T free ßhCG and 1T AFP, risk 1:250 Show forest plot

1

1656

73 Age, 1T free ßhCG and 1T AFP, risk 1:384 Show forest plot

1

512

74 Age, 1T free ßhCG and 1T AFP, mixed cut‐points Show forest plot

5

5160

75 Age, 1T AFP and 1T uE3, risk 1:384 Show forest plot

1

512

76 Age, 1T AFP and 1T free αhCG, risk 1:384 Show forest plot

1

512

77 Age, 1T free ßhCG and 1T total hCG, risk 1:384 Show forest plot

1

512

78 Age, 1T free ßhCG and 1T uE3, risk 1:384 Show forest plot

1

512

79 Age, 1T free ßhCG and 1T uE3, 5FPR Show forest plot

1

287

80 Age, 1T free ßhCG and 1T uE3, mixed cut‐points Show forest plot

2

799

81 Age, 1T free ßhCG and 1T SP1, 5FPR Show forest plot

1

60

82 Age, 1T free ßhCG and 1T SP1 risk 1:250 Show forest plot

1

60

83 Age, 1T AFP and 1T total hCG, 1:384 Show forest plot

1

512

84 Age, 1T free ßhCG and 1T free αhCG, risk 1:384 Show forest plot

1

512

85 Age, 1T total hCG and 1T uE3, risk 1:384 Show forest plot

1

512

86 Age, 1T total hCG and 1T Inhibin, 5FPR Show forest plot

1

1110

87 Age, 1T total hCG and 1T free αhCG, risk 1:384 Show forest plot

1

512

88 Age, 1T uE3 and 1T free αhCG, risk 1:384 Show forest plot

1

512

89 Age, 1T PAPP‐A, 1T free ßhCG and 1T AFP, 5FPR Show forest plot

2

2705

90 Age, 1T PAPP‐A, 1T free ßhCG and 1T AFP, mixed cut‐points Show forest plot

3

8188

91 Age, 1T free ßhCG, 1T AFP and 1T uE3, 5FPR Show forest plot

1

287

92 Age, 1T free ßhCG, 1T AFP and 1T uE3, risk 1:384 Show forest plot

1

512

93 Age, 1T free ßhCG, 1T AFP and 1T uE3, mixed cut‐points Show forest plot

2

799

94 Age, 1T total hCG, 1T AFP and 1T uE3, risk 1:384 Show forest plot

1

512

95 Age, 1T total hCG, 1T AFP and 1T uE3, mixed cut‐points Show forest plot

2

1505

96 Age, 1T AFP, free αhCG and 1T uE3, risk 1:384 Show forest plot

1

512

97 Age, 1T PAPP‐A, 1T free ßhCG and 1T Inhibin, 5FPR Show forest plot

1

1110

98 Age, 1T PAPP‐A, 1T total hCG and 1T Inhibin, 5FPR Show forest plot

1

1110

99 Age, 1T PAPP‐A, sp1 and 1T ProMBP, 5FPR Show forest plot

1

192

100 Age, 1T PAPP‐A, sp1 and 1T ProMBP, risk 1:250 Show forest plot

1

192

101 Age, 1T free ßhCG, 1T total hCG, 1T AFP and 1T uE3, risk 1:384 Show forest plot

1

512

102 Age, 1T total hCG, 1T AFP, 1T uE3 and 1T free αhCG, risk 1:384 Show forest plot

1

512

103 Age, 1T PAPP‐A, 1T free ßhCG, 1T AFP, 1T uE3 and 1T Inhibin, 5FPR Show forest plot

1

1110

104 Age, 1T PAPP‐A, 1T total hCG, 1T AFP, 1T uE3 and 1T Inhibin, 5FPR Show forest plot

1

1110

105 Age, 1T free ßhCG, 1T total hCG, 1T AFP, 1T uE3 and 1T free αhCG, risk 1:384 Show forest plot

1

512

106 Age, 1T hPL, risk 1:250 Show forest plot

1

183

107 Age, 1T hPL, 1T PAPP‐A, risk 1:250 Show forest plot

1

183

108 Age, 1T hPL, 1T free ßhCG, risk 1:250 Show forest plot

1

183

109 Age, 1T hPL, 1T PAPP‐A, 1T free ßhCG, risk 1:250 Show forest plot

1

183

110 Age, 1T PGH, risk 1:250 Show forest plot

1

335

111 Age, 1T PGH, 1T PAPP‐A , risk 1:250 Show forest plot

1

335

112 Age, 1T PGH, 1T free ßhCG , risk 1:250 Show forest plot

1

335

113 Age, 1T PGH, 1T PAPP‐A, 1T free ßhCG , risk 1:250 Show forest plot

1

335

114 Age, 1T GHBP, risk 1:250 Show forest plot

1

335

115 Age, 1T GHBP, 1T PAPP‐A, risk 1:250 Show forest plot

1

335

116 Age, 1T GHBP, 1T free ßhCG, risk 1:250 Show forest plot

1

335

117 Age, 1T GHBP, 1T PGH, risk 1:250 Show forest plot

1

335

118 Age, 1T GHBP, 1T PAPP‐A, 1T free ßhCG , risk 1:250 Show forest plot

1

335

119 Age, 1T GHBP, 1T PGH, 1T PAPP‐A, 1T free ßhCG , risk 1:250 Show forest plot

1

335

120 Age, 1T ADAM 12, risk 1:250 Show forest plot

1

531

121 Age, 1T ADAM 12, 1T PAPP‐A, 1T free ßhCG, risk 1:250 Show forest plot

3

1501

122 Age, PlGF, 1T PAPP‐A, 1T free ßhCG, 5FPR Show forest plot

2

1144

123 Age, 1T PAPP‐A and 1T free ßhCG, risk 1:300 Show forest plot

4

41172

124 Age, 1T PAPP‐A, 1T Hyperglycosylated hCG, 5FPR Show forest plot

1

10775

128 Age, ADAM 12, 1T PAPP‐A, 5FPR Show forest plot

1

691

129 Age, ADAM 12, 1T PAPP‐A, 1T free ßhCG, 5FPR Show forest plot

2

1222

130 Age, 1T PlGF, 5FPR Show forest plot

1

699

131 1T PlGF, 1T PAPP‐A, 1T free ßhCG, 5FPR Show forest plot

1

699

132 Age, 1T ADAM 12, 1T PAPP‐A, 1T free ßhCG, mixed cut‐points Show forest plot

3

1501

133 Age, 1T PAPP‐A, 1T free ßhCG and 1T Inhibin, risk 1:250 Show forest plot

1

40

134 Age, 1T PAPP‐A, 1T free ßhCG, and 1T Inhibin, mixed cut‐points Show forest plot

2

1150

Figures and Tables -
Table Tests. Data tables by test