Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Single and Serial Fetal Biometry to Detect Preterm and Term Small- and Large-for-Gestational-Age Neonates: A Longitudinal Cohort Study

  • Adi L. Tarca ,

    romeror@mail.nih.gov (RR); atarca@med.wayne.edu (ALT)

    Affiliations Perinatology Research Branch, NICHD/NIH/DHHS, Bethesda, MD, and Detroit, Michigan, United States of America, Department of Obstetrics and Gynecology, Wayne State University School of Medicine, Detroit, Michigan, United States of America, Department of Computer Science, Wayne State University College of Engineering, Detroit, Michigan, United States of America

  • Edgar Hernandez-Andrade,

    Affiliations Perinatology Research Branch, NICHD/NIH/DHHS, Bethesda, MD, and Detroit, Michigan, United States of America, Department of Obstetrics and Gynecology, Wayne State University School of Medicine, Detroit, Michigan, United States of America

  • Hyunyoung Ahn,

    Affiliations Perinatology Research Branch, NICHD/NIH/DHHS, Bethesda, MD, and Detroit, Michigan, United States of America, Department of Obstetrics and Gynecology, Wayne State University School of Medicine, Detroit, Michigan, United States of America

  • Maynor Garcia,

    Affiliations Perinatology Research Branch, NICHD/NIH/DHHS, Bethesda, MD, and Detroit, Michigan, United States of America, Department of Obstetrics and Gynecology, Wayne State University School of Medicine, Detroit, Michigan, United States of America

  • Zhonghui Xu,

    Affiliation Perinatology Research Branch, NICHD/NIH/DHHS, Bethesda, MD, and Detroit, Michigan, United States of America

  • Steven J. Korzeniewski,

    Affiliations Perinatology Research Branch, NICHD/NIH/DHHS, Bethesda, MD, and Detroit, Michigan, United States of America, Department of Obstetrics and Gynecology, Wayne State University School of Medicine, Detroit, Michigan, United States of America, Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, Michigan, United States of America

  • Homam Saker,

    Affiliations Perinatology Research Branch, NICHD/NIH/DHHS, Bethesda, MD, and Detroit, Michigan, United States of America, Department of Obstetrics and Gynecology, Wayne State University School of Medicine, Detroit, Michigan, United States of America

  • Tinnakorn Chaiworapongsa,

    Affiliations Perinatology Research Branch, NICHD/NIH/DHHS, Bethesda, MD, and Detroit, Michigan, United States of America, Department of Obstetrics and Gynecology, Wayne State University School of Medicine, Detroit, Michigan, United States of America

  • Sonia S. Hassan,

    Affiliations Perinatology Research Branch, NICHD/NIH/DHHS, Bethesda, MD, and Detroit, Michigan, United States of America, Department of Obstetrics and Gynecology, Wayne State University School of Medicine, Detroit, Michigan, United States of America

  • Lami Yeo,

    Affiliations Perinatology Research Branch, NICHD/NIH/DHHS, Bethesda, MD, and Detroit, Michigan, United States of America, Department of Obstetrics and Gynecology, Wayne State University School of Medicine, Detroit, Michigan, United States of America

  • Roberto Romero

    romeror@mail.nih.gov (RR); atarca@med.wayne.edu (ALT)

    Affiliations Perinatology Research Branch, NICHD/NIH/DHHS, Bethesda, MD, and Detroit, Michigan, United States of America, Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, Michigan, United States of America, Department of Obstetrics and Gynecology, University of Michigan, Ann Arbor, Michigan, United States of America, Center for Molecular Medicine and Genetics, Wayne State University, Detroit, Michigan, United States of America

Abstract

Objectives

To assess the value of single and serial fetal biometry for the prediction of small- (SGA) and large-for-gestational-age (LGA) neonates delivered preterm or at term.

Methods

A cohort study of 3,971 women with singleton pregnancies was conducted from the first trimester until delivery with 3,440 pregnancies (17,334 scans) meeting the following inclusion criteria: 1) delivery of a live neonate after 33 gestational weeks and 2) two or more ultrasound examinations with fetal biometry parameters obtained at ≤36 weeks. Primary outcomes were SGA (<5th centile) and LGA (>95th centile) at birth based on INTERGROWTH-21st gender-specific standards. Fetus-specific estimated fetal weight (EFW) trajectories were calculated by linear mixed-effects models using data up to a fixed gestational age (GA) cutoff (28, 32, or 36 weeks) for fetuses having two or more measurements before the GA cutoff and not already delivered. A screen test positive for single biometry was based on Z-scores of EFW at the last scan before each GA cut-off so that the false positive rate (FPR) was 10%. Similarly, a screen test positive for the longitudinal analysis was based on the projected (extrapolated) EFW at 40 weeks from all available measurements before each cutoff for each fetus.

Results

Fetal abdominal and head circumference measurements, as well as birth weights in the Detroit population, matched well to the INTERGROWTH-21st standards, yet this was not the case for biparietal diameter (BPD) and femur length (FL) (up to 9% and 10% discrepancy for mean and confidence intervals, respectively), mainly due to differences in the measurement technique. Single biometry based on EFW at the last scan at ≤32 weeks (GA IQR: 27.4–30.9 weeks) had a sensitivity of 50% and 53% (FPR = 10%) to detect preterm and term SGA and LGA neonates, respectively (AUC of 82% both). For the detection of LGA using data up to 32- and 36-week cutoffs, single biometry analysis had higher sensitivity than longitudinal analysis (52% vs 46% and 62% vs 52%, respectively; both p<0.05). Restricting the analysis to subjects with the last observation taken within two weeks from the cutoff, the sensitivity for detection of LGA, but not SGA, increased to 65% and 72% for single biometry at the 32- and 36-week cutoffs, respectively. SGA screening performance was higher for preterm (<37 weeks) than for term cases (73% vs 46% sensitivity; p<0.05) for single biometry at ≤32 weeks.

Conclusions

When growth abnormalities are defined based on birth weight, growth velocity (captured in the longitudinal analysis) does not provide additional information when compared to the last measurement for predicting SGA and LGA neonates, with both approaches detecting one-half of the neonates (FPR = 10%) from data collected at ≤32 weeks. Unlike for SGA, LGA detection can be improved if ultrasound scans are scheduled as close as possible to the gestational-age cutoff when a decision regarding the clinical management of the patient needs to be made. Screening performance for SGA is higher for neonates that will be delivered preterm.

Introduction

The detection of fetal growth disorders is an important goal in modern prenatal care [14]. Sonographic fetal biometry is the method of choice to assess fetal size and growth [58]. Yet, there has been considerable controversy about the need for serial biometry measurements during pregnancy [9,10], and at the present time, professional organizations do not recommend this approach [11]. Controversies include: which reference range or standard to use for fetal size and growth (e.g., local vs. international) [1215], the timing at which fetal biometry should be obtained [16], the need for serial measurements [17], which specific anatomic parameters to monitor (e.g., abdominal circumference, head circumference, estimated fetal weight), and the precise cutoff points to identify abnormalities. Moreover, the diagnostic endpoint is also a subject of controversy. While emphasis has been placed on identification of the small- or large-for-gestational-age infant, some have argued that the ideal end-point should be one at which not only smallness at birth but also morbidity be considered [1820].

An important issue is whether serial fetal biometry should be used in all pregnancies [21,22]. A prospective cohort study of unselected nulliparous women with a singleton, viable gestation reported that universal sonography in the third trimester almost tripled the detection of SGA neonates compared to clinically indicated sonography (from 20% to 57%) [23]. This improved sensitivity was associated with an increased rate of false positive diagnoses. The current study addresses whether or not serial fetal biometry can improve the detection of SGA and LGA neonates over that of a single fetal biometric examination.

Methods

This longitudinal study was conducted at the Center for Advanced Obstetrical Care and Research of the Perinatology Research Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD), National Institutes of Health (NIH), and Wayne State University, at Hutzel Women’s Hospital, Detroit, Michigan, from January 2009 until December 2014. All patients provided written informed consent and were enrolled in research protocols approved by the Human Investigation Committee of Wayne State University and by the Institutional Review Board of the NICHD.

The study population consisted of 3,971 eligible women with singleton pregnancies without congenital or chromosomal anomalies, who underwent serial ultrasound examination every four weeks until 24 weeks of pregnancy and thereafter every two weeks until delivery. However, since the gestational age at first scan varied among patients and not all patients returned to the clinic on schedule, the number and gestational ages at ultrasound examinations varied among patients Exclusion criteria were: delivery prior to 33 gestational weeks, missing data about gestational age at delivery, fetal gender, and birth weight. To be included in this analysis, subjects must have undergone at least two fetal biometry examinations at ≤36 weeks with biparietal diameter (BPD), head circumference (HC), abdominal circumference (AC), and femur length (FL) measurements. The resulting dataset of 3,440 pregnancies (17,334 scans) was analyzed to predict SGA and LGA outcomes for three different time points: 1) delivery after 28 weeks, 2) delivery after 32 weeks, and 3) delivery after 36 weeks. Subjects were included if they underwent two or more fetal biometric examinations at least three weeks apart prior to delivery.

Ultrasound examinations

Gestational age was determined by the crown-rump length measurement or by the biparietal diameter during the first ultrasound scan (median gestational age 13.4 weeks; IQR 10–19 weeks). The BPD, HC, AC, and FL measurements were obtained by following the recommendations of the International Society of Ultrasound in Obstetrics and Gynecology [24]. Fetal weight was estimated according to a modified Hadlock formula: log10(EFW) = 1.4035+0.0441∙AC+0.177∙FL-0.0037∙AC∙ FL+0.0027∙BPD2. [25]

Outcome definitions

The INTERGROWTH-21st birthweight standards according to gestational age at delivery and fetal sex were used to define SGA and LGA neonates [26]. Since the INTERGROWTH-21st birth weight standards were defined for neonates delivered at 33 weeks or more, we limited the study group herein to those delivering a neonate at ≥ 33 gestational weeks. SGA was defined as a birthweight <5th centile, and LGA was defined as a birthweight >95th centile.

Two approaches described below (single biometry and serial/longitudinal biometry) were used to screen pregnancies under three scenarios that differ by: i) the gestational age cut-off (28, 32, and 36 weeks) up to which ultrasound scans were used for prediction of outcomes, provided that two or more scans were available up to the cut-off and ii) the earliest gestational age at delivery of patients included in the analysis (33+1, 33+1, and 36+1 weeks, respectively). These scenarios allow assessing how SGA and LGA prediction performance changes as more data becomes available prior to delivery. Although the population of patients changes somewhat from one scenario to another the comparison between single biometry and longitudinal analysis is performed on the data from the same patients.

Screening based on the last fetal biometric examination (single biometry)

The first approach used for screening relied on a single (last) EFW evaluation before the GA cut-off under consideration. To account for differences in the GA values at last evaluation between subjects, EFW data was transformed into Z-scores by subtracting the mean and dividing by the standard deviation computed from longitudinal EFW profiles of all fetuses included in this study as previously suggested [27]. The mean and standard deviation of EFW at each GA was derived using a linear mixed-effects model similar to previous work by Royston [28] but using a 5th degree polynomial function instead of fractional polynomials. International standards for EFW were not used for screening due to the differences demonstrated in Fig 1 for BPD and FL between the data collected in this study and INTERGROWTH-21st standards. The INTERGROWTH-21st references were developed based on biparietal diameter measurements from outer-to-outer of the parietal bones, whereas in this study, BPD is measured from outer-to-inner of the parietal bones [29]. The screen status of each patient was set as SGA positive (Z score EFW <zFPR = 10%), LGA positive (Z score EFW > zFPR = 10%) or appropriate for GA, otherwise. The cut-off zFPR = 10% represents the Z-score percentile that resulted in a false positive rate of 10% for each outcome (SGA or LGA) separately to enable direct comparison between the sensitivity of different prediction methods and cut-off points.

thumbnail
Fig 1. A comparison of this study data and INTERGROWTH 21st standards.

Longitudinal measurements of abdominal circumference (AC) (top left), head circumference (HC) (top right), biparietal diameter (BPD) (bottom left), and femur length (FL) (bottom right), as well as 90% confidence intervals, derived from both the Hutzel population used in this study (red) and the INTERGROWTH-21st study (blue).

https://doi.org/10.1371/journal.pone.0164161.g001

Screening based on serial fetal biometric examinations (longitudinal)

The procedure to predict an outcome (SGA or LGA) based on two or more longitudinal measurements collected up to a given GA cutoff involved two main steps:

1) For each subject, forecast (extrapolate) the EFW value at term (GA = 40 weeks) from longitudinal determinations of EFW up to the GA cut-off considered using linear mixed-effects models. This step is similar to that reported by Albert [30] implemented in publically available software from the Biostatistics and Bioinformatics Branch of the NICHD, yet adapted for unbalanced data (GA at sample varies among patients), and using a Rossavik [31] instead of a quadratic equation to correlate EFW to GA at examination:

The linear mixed-effects model included, in addition to the fixed effects coefficients (c, k, s), a corresponding random effect that allowed the trajectory of each subject to depart from the overall population average. The model was fit using the complete dataset to compute fixed effects coefficients, variance components, and properties of the distribution of random effects. For each subject separately, the EFW at two or more time points until the GA cut-off were used to compute subject-specific random effects. Using population-derived fixed effects and subject-specific random effects, the EFW at 40 weeks of gestation was determined (projected).

2) The resulting projections of EFW at 40 weeks GA (EFW40) were used to set the screen status of each patient as SGA positive (EFW40< cFPR = 10%) or LGA positive (EFW40> cFPR = 10%), where cFPR = 10% is the percentile on EFW40 such that the false positive rate is 10% for each outcome (SGA or LGA) separately.

Results

Comparison to the INTERGROWTH-21st standards

Demographic characteristics of the population in this study are presented in Table 1. Our patients were mainly African-American women (91.3%) with a high prevalence of BMI>30 (36%) and with a high prevalence of tobacco usage (20%). Of the 3,440 pregnancies included for analysis, 161 (4.7%) were classified as SGA (<5th percentile) and 174 (5.1%) were classified as LGA (>95th percentile) based on the INTERGROWTH-21st birth weight standards.

thumbnail
Table 1. Demographic characteristics of the studied cohort.

https://doi.org/10.1371/journal.pone.0164161.t001

The rates of SGA were higher among nulliparous, preeclamptic, and women who reported smoking during pregnancy, while being lower in obese women (all p<0.05). The rates of LGA were higher in obese women as well as those with gestational diabetes Mellitus (all p<0.05) (see Table 1).

The 17,334 longitudinal ultrasound measurements for AC, HC, BPD, and FL of the study population are displayed in Fig 1. The number of samples per fetus was 5(3–6), 5(4–6) and 4(3–6) for AGA, SGA and LGA fetuses, respectively [median (IQR)] (Wilcoxon test p = 0.01 for the difference in number of samples per pregnancies between LGA and non-LGA pregnancies). Mean AC and HC derived using linear mixed-effects models for the population in this study matched those published by the INTERGROWTH-21st project [29] with at most 1% discrepancy in the interval of 15 to 39 weeks while the 95% confidence intervals diverged by at most 4%. In the gestational age interval 15–39 weeks GA, the mean of FL was higher by up to 3% while the mean BPD was lower up to 9% in our study population that the one of the INTERGROWTH-21st study, while the 95% confidence intervals diverged by up to 10% for BPD and 7% for FL. These results suggest systematic differences in the protocols for measurements of lengths but similar protocols for circumferences between the current study and INTERGROWTH-21st standards (Fig 1).

Comparison between single biometry and longitudinal biometric evaluation for detection of small- and large-for-gestational-age fetuses

Single biometry analysis had a sensitivity of 39%, 51%, and 56% (FPR = 10%) to identify preterm and term SGA neonates delivered after 28, 32, and 36 weeks of gestation based on the last evaluation of EFW available at ≤28, ≤32, ≤36 weeks, respectively (Fig 2). The longitudinal biometry analysis based on projected EFW at 40 weeks from two or more observations before each cutoff resulted in almost identical sensitivities (see Fig 2), (McNemar’s test for correlated proportions; p>0.2 for all cutoffs). Similarly, single biometry analysis had a sensitivity of 40%, 53%, and 63% (FPR = 10%) to identify preterm and term LGA neonates delivered after 28, 32, and 36 weeks of gestation based on the last evaluation of EFW available at ≤28, ≤32, ≤36 weeks, respectively (Fig 2). The longitudinal biometry had lower sensitivity, i.e. 37%, 46%, and 51%, for the same cutoffs, respectively (McNemar’s test; p<0.05 for the 32- and 36-week cutoffs). When data from nulliparous, smoking, obese, preeclamptic, diabetic and chronic hypertensive women was excluded from the analysis, a modest decrease in SGA and LGA prediction performance was noted (AUC decreased up to 0.06, and sensitivity [at 90% specificity] decreased up to 0.15) over the 24 possible combinations of four outcomes (SGA<5th, SGA<10th, LGA>95th, LGA>90th) three gestational age cut-off scenarios (28, 32, 36) and two types of analysis methods (single biometry and longitudinal analysis) as shown in S2 Table.

thumbnail
Fig 2. SGA and LGA prediction performance for the entire study population.

Sensitivity of predicting small-for-gestational–age (SGA; <5th percentile) (left) and large-for-gestational–age neonates (LGA; >95th percentile) (right) for multiple GA cutoffs. Bars show sensitivity at a 10% false positive rate when using data up to a given GA cutoff (x-axis) to predict the outcome of infants delivered after that cutoff. The number of controls/cases based on which sensitivity is estimated is given under each cutoff. Blue bars correspond to single biometry analysis (last available sample) while red bars are used for longitudinal analysis. * denotes a significant difference in sensitivity for the given cutoff (p<0.05).

https://doi.org/10.1371/journal.pone.0164161.g002

Impact of gestational age at last sample on prediction performance of SGA and LGA

Since ultrasound scans were not scheduled at fixed gestational weeks for each woman, there was substantial variability in the gestational age at the last sample of women included in the SGA and LGA prediction analysis for each of the cutoffs considered (28, 32, and 36 weeks of gestation). Table 2 gives the summary statistics for the last GA value before each cutoff for cases and controls. For example, when the GA cutoff considered was 32 weeks, the median GA at the last point for fetuses that had two or more scans at ≤32 weeks was 30 weeks (IQR: 28–31) for both SGA cases and respective controls as well as for LGA cases and respective controls. To determine the impact of the GA at the last sample on the SGA and LGA prediction performance, we have restricted the analysis to those fetuses having the last scan no more than two weeks prior to the threshold (see Fig 3). The sensitivity of SGA prediction was 41%, 51%, and 61% for the single biometry and 44%, 49%, and 63% for the longitudinal analysis at the ≤28-, ≤32-, ≤36-week GA cutoffs, respectively (FPR = 10%). This corresponds to a net increase in sensitivity of about 5% for both single and longitudinal analyses at the 36-week cutoff only. The sensitivity of LGA prediction in this restricted analysis was 49%, 64%, and 71% for the single biometry and 46%, 57%, and 62% for the longitudinal analysis at the ≤28-, ≤32-, ≤36-week GA cutoffs, respectively (FPR = 10%). This corresponds to a net increase in sensitivity of about 10% for both single biometry and longitudinal analysis for all three cutoffs in the restricted analysis compared to the overall analysis.

thumbnail
Fig 3. SGA and LGA prediction performance for a restricted study population.

Sensitivity of predicting small-for-gestational–age (SGA; <5th percentile) (left) and large-for-gestational–age neonates (LGA; >95th percentile) (right) for multiple GA cutoffs when restricting analysis to subjects with the last measurement before each cutoff within two weeks from the cutoff.

https://doi.org/10.1371/journal.pone.0164161.g003

thumbnail
Table 2. Distribution of gestational ages at the final evaluation before each gestational age cutoff.

https://doi.org/10.1371/journal.pone.0164161.t002

Fig 4 shows the Z-score-transformed EFW value for the last measurement at ≤ 32 weeks as a function of the GA at delivery for both the overall and restricted analyses. The classification model is a cutoff (horizontal line) on the Z-score chosen such that the false positive rate is 10%.

thumbnail
Fig 4. Estimated fetal weight Z-scores as a function of gestational age at delivery and outcome.

Estimated fetal weight Z-scores for the last available sample at ≤32 weeks of gestation for all pregnancies (top) and those with the last sample between 30 and 32 weeks (bottom) as a function of gestational age at delivery. Horizontal lines denote the Z-score cutoff that leads to a false positive rate of 10% for each outcome (small- or large-for-gestational-age newborns) separately.

https://doi.org/10.1371/journal.pone.0164161.g004

Effect of gestational age at delivery on the prediction performance of SGA

As shown in Fig 4, the overlap of the EFW Z-scores (last measurement at ≤ 32 weeks) between SGA cases and controls is reduced when delivery occurs earlier. Indeed, when computed separately for preterm neonates (delivery at <37 weeks), the sensitivity for SGA screening was higher (73% at 10% FPR) than for term cases (delivery ≥37 weeks) (46% at 10% FPR) (p = 0.024). The differences in SGA screening performance between preterm and term cases were even more significant and were detected at both the 28- and 32-week cutoffs when the SGA<10th percentile was used as the outcome instead of the SGA<5th percentile. Similar improvements were observed for longitudinal analysis as well (see S1 Table).

The effect of fetal weight estimation bias

The EFW calculations used in this study for SGA and LGA screening relied on Hadlock’s formula linking birth weight to ultrasound parameters. Since the coefficients in this formula were derived in a different population, biases in fetal weight estimation are known to occur [25] with smaller weights being more underestimated than larger weights (see S1 Fig). As smaller birth weights tend to be obtained at lesser gestational age values, such biases are mitigated by estimating the mean and SD EFW as a function of gestation in the analysis. Therefore, using custom coefficients in Hadlock’s formula had a minor impact on the SGA and LGA prediction performance (see S2 Table).

Comparison with other studies contrasting single and longitudinal biometry

In order to perform a meaningful comparison between the current study and previous work [30] involving LGA screening delivered after 36 weeks, we have defined LGA as birth weight >4,000g and limited the analysis to those having the last scan after 34 weeks. In this analysis scenario, the single biometry analysis resulted in a sensitivity of 51% (AUC = 0.84), which was slightly higher than the 43% sensitivity (AUC = 0.77) obtained for the longitudinal analysis (p = 0.052). These AUC estimates were in good agreement with those obtained by Albert [30] who reported an AUC of 0.80 for single biometry at 36 weeks and 0.76 for longitudinal analysis based on four observations (including the one at 36 weeks of gestation).

Discussion

The principal results of this study are as follows:

  1. Birthweights in the cohort of 3,440 pregnancies from our study population were similar to those of the INTERGROWTH-21st fetal sex-specific birth weight standards, with 4.7% of neonates being classified as SGA (<5th centile) and 5.1% as LGA (>95th centile).
  2. Our locally derived reference for fetal size parameters matched the INTERGROWTH-21st study standards for fetal head and abdominal circumferences (at most 1% discrepancy for mean values and 4% for confidence intervals) but not those for biparietal diameter and femur length (up to 9% for the mean and 10% for confidence intervals).
  3. Single biometry (the last available assessment of EFW) as well as longitudinal analysis (from two or more serial measurements of EFW taken three or more weeks apart) at ≤32 weeks can identify one-half of neonates who will be SGA (<5th centile) (FPR = 10%). For LGA prediction at the ≤32- and ≤36-week cutoffs, single biometry is preferable (p<0.05).
  4. The sensitivity of single biometry for the detection of LGA increased from 53% to 64% and from 63% to 71% for the 32- and 36-week cutoffs, respectively, when the last scan was within two weeks from the cutoff.
  5. Preterm (delivery at <37 weeks) SGA cases are easier to detect based on a single biometry at ≤32 than term SGA cases (73% vs 46% sensitivity at 10% FPR; p<0.05).

Using universal standards for fetal size evaluation

Universal standards generated by the INTERGROWTH-21st project define optimal fetal size and birth weight based on a population of healthy, well-nourished pregnant women and their fetuses [29]. These standards were proposed for screening for growth abnormalities across all health care systems with the hope that they will improve pregnancy outcomes compared to locally produced references in use worldwide [29]. Although the birth weight distribution of our study population of 3,440 newborns was well-matched to the distribution of birth weight of newborns included in the INTERGROWTH-21st project, this was not the case for two of the four ultrasound parameters (biparietal diameter and femur length). The most likely reasons for discrepancies are differences in the measurement techniques between the two studies (for BPD) and racial differences for femur length [15,32]. Therefore, in this study, we have used INTERGROWTH-21st standards to classify neonates as SGA and LGA but relied on locally derived reference ranges for fetal biometry.

Single biometry for SGA and LGA screening

Most studies reporting prediction performance of ultrasound-based biometry rely on a single measurement taken at a fixed gestation (e.g., 36 weeks [30,33]) or a gestational-age interval (e.g., 30–34 weeks [16], 34–37 weeks [16], 35–37 weeks [34], 34–42 weeks [27], 30–32 weeks [35], and 34–36 weeks [35]). When the GA at measurement is the same for all subjects, the performance of SGA or LGA prediction can be directly assessed using a cutoff point on the ultrasound parameter value that provides the optimal accuracy or desired fixed specificity [30]. However, when the gestational age at sample varies among fetuses, the ultrasound parameter value or EFW needs to be first converted into a Z-score using an external standard [16,27,34]. This approach was also used in this work to evaluate single biometry.

Performance estimates for single biometry were reported by Souka et al. [16] to be about 50% sensitivity (10% FPR) for both SGA (<5th percentile) and LGA (>95th percentile) for a single EFW evaluation between 30–34 weeks in a Caucasian population of 3,690 women. A similar performance was obtained by Fadigas et al. [34] for SGA prediction in a population of 5,515 patients having an ultrasound scan taken between 35–37 weeks. The authors reported an 89% detection rate for SGA newborns with a 10% false positive rate, combining maternal characteristics, obstetric history, and fetal biometric measurements. Bakalis et al. [36] reported a 65% detection rate of SGA (<5th centile) neonates with delivery at ≥37 weeks when the EFW assessed at 30–34 weeks was combined with maternal factors, uterine artery PI, mean arterial pressure, and serum placental growth factor. The results described herein for single biometry at ≤32 weeks in a mostly African-American population are in very good agreement with these previous findings as we report about a 50% sensitivity for both SGA (<5th centile) and LGA (>95th centile) prediction (FPR = 10%).

A recent study [23] reported screening for fetal growth restriction with universal third-trimester ultrasonography in 3,977 nulliparous women. The authors reported the performance of a single biometry before birth as being 57% (10% FPR) (AUC = 0.87) in detecting SGA <10th. For the same outcome (SGA <10th) using single biometry at ≤36 weeks, we obtained a 55% sensitivity (AUC = 0.85) (S2 Table). The good agreement between studies is even more relevant considering the differences in the birth weight standards used to define the outcome (INTERGROWTH-21st birth weight standards vs a UK reference), as well as the differences in the distribution of GA at scan for this analysis (median|IQR = 34|32–35, see Table 2, vs 36 weeks).

Longitudinal biometry for SGA and LGA screening

The primary aim of this study was to evaluate whether the knowledge of fetal-size parameters from prior ultrasound examinations can improve prediction of SGA and LGA neonates compared to the last available fetal biometry examination prior to a given GA cut-off. Previous studies analyzing longitudinal ultrasound data involved either different outcomes and/or asked different questions regarding the benefit of considering longitudinal biometry as a part of clinical care [37,38].

For example, Chang et al [27] reported a study in which weekly ultrasound examinations were performed in 156 women with SGA fetuses based on an ultrasound examination in an attempt to identify those with a reduced ponderal index (weight/length3) at birth, and those who will develop suboptimal neonatal outcomes [39]. The information used for discrimination was the rate of change in abdominal circumference and EFW between the first (18–20 weeks) and last (36 weeks) available examinations, after converting these parameters into Z-scores. The authors reported an improved prediction performance using this longitudinal-based approach compared to a single (last available) examination.

Subsequently, Royston [28] described a methodology to construct standards for growth vs. size, suggesting that these should be conditional at early measurements in each subject as opposed to one-fits-all growth rate standards. Owen et al. [40] reported weekly growth velocity charts derived from 274 low-risk women examined monthly. The authors [41] studied the effect of scan intervals on predicting infants with anthropometric features of fetal growth restriction; they reported that velocity over 4- and 6-week intervals had a likelihood ratio positive (LR+) of about 10.0 and was better than those derived from a 2-week interval (LR+ about 5.0).

Others have reported that two ultrasound examinations increase the detection of small fetuses and those at a higher risk of perinatal complications; however, the data were treated as separate evaluations, and no growth rate information was used [42].

More recently, a framework to summarize longitudinal biometry by extrapolating individual growth trajectories at a fixed time point in pregnancy (e.g., at term) was proposed [30,43], yet it was limited to term LGA detection and to subjects with four measurements at fixed time points. Our work extends the evaluation of this projection-based approach to longitudinal biometry for preterm and term SGA and LGA screenings under routinely available longitudinal sampling, which involves varying numbers of observations and gestational ages at sampling among subjects. Albert et al. [44] reported that for LGA prediction the last available measurement of AC is more informative than the projected AC value from two or more earlier observations. Our results also suggest that for an outcome like SGA and LGA that are based on birth weight, the most informative variable is the last EFW evaluation, and that especially for LGA detection, the most recent EFW assessment prior to delivery, the higher the detection rate.

Clinical implications of single or longitudinal biometric fetal evaluations

For preterm and term SGA detection, a single (most recent) or multiple biometry provide similar performance, while for detection of LGA after 32 weeks of gestation, single biometry is preferred [2,45]. For LGA detection, scheduling EFW evaluations closer to the GA cutoff at which a decision needs to be taken increases sensitivity. Since this work was limited to assessing outcomes defined by birthweight, it does not inform on the possible use of serial biometry to detect complications during pregnancy that would help to determine optimal time for steroid administration and the time of delivery. Although several maternal factors presented in Table 1 had a significant association with fetal size and hence risk of an SGA or LGA outcome, the predictive performance of EFW determinations in predicting SGA and LGA was only moderately affected when data from nulliparous, smoking, obese, preeclamptic, diabetic and chronic hypertensive women were excluded from the analysis.

Strengths and limitations

Among the strengths of this study is the fact that the comparison between single biometry and longitudinal analysis for SGA and LGA screenings was based on outcomes defined using the INTERGROWTH-21st fetus sex-specific birth weight standards as opposed to local reference ranges for birth weight or fixed cutoffs on birth weight. Moreover, by using multiple GA cut-offs, we have assessed the differences between these methods both earlier and later in pregnancy for preterm and term delivered neonates. To these strengths we can add a large sample size of 3,440 women. The main limitation of this study is that the number and gestational age at ultrasound scans varied among patients.

Supporting Information

S1 Fig. Relation between estimated fetal weight and actual birthweight depending whether the original or customized Hadlock formula coefficients are used.

Birthweight (g) as a function of Estimated Fetal Weight (g) are shown in pregnancies where the last scan was taken within one week from the delivery. The Hadlock model with original coefficients has a bias, i.e. the smaller the baby the more underestimated is the fetal weight (left). Using a revised formula in which the coefficients are updated for the current population removed the bias (right).

https://doi.org/10.1371/journal.pone.0164161.s001

(TIF)

S1 Table. Comparison of SGA and LGA prediction performance between term and preterm delivery.

GA: gestational age. FPR: False Positive Rate.

https://doi.org/10.1371/journal.pone.0164161.s002

(XLS)

S2 Table. SGA and LGA prediction performance under additional analysis scenarios.

SGA and LGA prediction performance, sensitivity at 10% false positive rate (FPR), and area under the receiver operating characteristic curve (AUC) is shown for the original analysis as well as two other scenarios.

https://doi.org/10.1371/journal.pone.0164161.s003

(XLS)

Acknowledgments

The authors would like to thank Dr. Russell Deter for constructive comments and suggestions. The ultrasound experience and technical support of senior Registered Diagnostic Medical Sonographers (RDMS) Catherine Ducharme and Denise Haggerty are gratefully acknowledged.

Author Contributions

  1. Conceptualization: ALT EHA HA TC SSH LY RR.
  2. Data curation: ALT EHA HA MG HS TC SSH LY RR.
  3. Formal analysis: ALT EHA.
  4. Funding acquisition: ALT EHA SJK HS TC SSH LY RR.
  5. Investigation: EHA HA MG SJK HS TC SSH LY RR.
  6. Methodology: ALT EHA SJK HS TC SSH LY RR.
  7. Project administration: ALT EHA SSH RR.
  8. Resources: ALT EHA SSH LY RR.
  9. Software: ALT.
  10. Supervision: ALT EHA SSH RR.
  11. Validation: ALT EHA.
  12. Visualization: ALT EHA ZX.
  13. Writing – original draft: ALT EHA LY RR.
  14. Writing – review & editing: ALT EHA HA MG SJK HS TC SSH LY RR.

References

  1. 1. Lindqvist PG, Molin J (2005) Does antenatal identification of small-for-gestational age fetuses significantly improve their outcome? Ultrasound Obstet Gynecol 25: 258–264. pmid:15717289
  2. 2. Henriksen T (2008) The macrosomic fetus: a challenge in current obstetrics. Acta Obstet Gynecol Scand 87: 134–145. pmid:18231880
  3. 3. Degani S (2001) Fetal biometry: clinical, pathological, and technical considerations. Obstet Gynecol Surv 56: 159–167. pmid:11254152
  4. 4. Claesson R, Aberg A, Marsal K (2007) Abnormal fetal growth is associated with gestational diabetes mellitus later in life: population-based register study. Acta Obstet Gynecol Scand 86: 652–656. pmid:17520394
  5. 5. Conde-Agudelo A, Papageorghiou AT, Kennedy SH, Villar J (2013) Novel biomarkers for predicting intrauterine growth restriction: a systematic review and meta-analysis. BJOG 120: 681–694. pmid:23398929
  6. 6. Brodszki J, Marsal K (2000) Management of pregnancies with suspected intrauterine growth retardation in Sweden. Results of a questionnaire. Acta Obstet Gynecol Scand 79: 723–728. pmid:10993094
  7. 7. Marsal K (1992) Antenatal diagnosis of intrauterine growth retardation by ultrasound. Int J Technol Assess Health Care 8 Suppl 1: 160–169.
  8. 8. Chauhan SP, Magann EF (2006) Screening for fetal growth restriction. Clin Obstet Gynecol 49: 284–294. pmid:16721107
  9. 9. Deter RL, Lee W, Sangi-Haghpeykar H, Tarca AL, Yeo L, Romero R (2014) Individualized fetal growth assessment: critical evaluation of key concepts in the specification of third trimester size trajectories. J Matern Fetal Neonatal Med 27: 543–551. pmid:23962305
  10. 10. Iraola A, Gonzalez I, Eixarch E, Meler E, Illa M, Gardosi J, et al. (2008) Prediction of adverse perinatal outcome at term in small-for-gestational age fetuses: comparison of growth velocity vs. customized assessment. J Perinat Med 36: 531–535. pmid:18990039
  11. 11. Bricker L, Neilson JP, Dowswell T (2008) Routine ultrasound in late pregnancy (after 24 weeks' gestation). Cochrane Database Syst Rev: CD001451. pmid:18843617
  12. 12. Gardosi J, Francis A (2009) A customized standard to assess fetal growth in a US population. Am J Obstet Gynecol 201: 25 e21–27.
  13. 13. Sjaarda LA, Albert PS, Mumford SL, Hinkle SN, Mendola P, Laughon SK (2014) Customized large-for-gestational-age birthweight at term and the association with adverse perinatal outcomes. Am J Obstet Gynecol 210: 63 e61–63 e11.
  14. 14. Ogasawara KK (2009) Variation in fetal ultrasound biometry based on differences in fetal ethnicity. Am J Obstet Gynecol 200: 676 e671–674.
  15. 15. Buck Louis GM, Grewal J, Albert PS, Sciscione A, Wing DA, Grobman WA, et al. (2015) Racial/ethnic standards for fetal growth: the NICHD Fetal Growth Studies. Am J Obstet Gynecol 213: 449 e441–449 e441.
  16. 16. Souka AP, Papastefanou I, Pilalis A, Michalitsi V, Panagopoulos P, Kassanos D (2013) Performance of the ultrasound examination in the early and late third trimester for the prediction of birth weight deviations. Prenat Diagn 33: 915–920. pmid:23703542
  17. 17. Callec R, Lamy C, Perdriolle-Galet E, Patte C, Heude B, Morel O, et al. (2015) Impact on obstetric outcome of third-trimester screening for small-for-gestational-age fetuses. Ultrasound Obstet Gynecol 46: 216–220. pmid:25487165
  18. 18. Unterscheider J, Daly S, Geary MP, Kennelly MM, McAuliffe FM, O'Donoghue K, et al. (2013) Optimizing the definition of intrauterine growth restriction: the multicenter prospective PORTO Study. Am J Obstet Gynecol 208: 290 e291–296.
  19. 19. Laurin J, Persson PH, Polberger S (1987) Perinatal outcome in growth retarded pregnancies dated by ultrasound. Acta Obstet Gynecol Scand 66: 337–343. pmid:3321859
  20. 20. Unterscheider J, O'Donoghue K, Daly S, Geary MP, Kennelly MM, McAuliffe FM, et al. (2014) Fetal growth restriction and the risk of perinatal mortality-case studies from the multicentre PORTO study. BMC Pregnancy Childbirth 14: 63. pmid:24517273
  21. 21. Romero R, Deter R (2015) Should serial fetal biometry be used in all pregnancies? Lancet 386: 2038–2040. pmid:26360241
  22. 22. White SW, Marsh JA, Lye SJ, Briollais L, Newnham JP, Pennell CE (2015) Improving customized fetal biometry by longitudinal modelling. J Matern Fetal Neonatal Med: 1–7.
  23. 23. Sovio U, White IR, Dacey A, Pasupathy D, Smith GC (2015) Screening for fetal growth restriction with universal third trimester ultrasonography in nulliparous women in the Pregnancy Outcome Prediction (POP) study: a prospective cohort study. Lancet.
  24. 24. Salomon LJ, Alfirevic Z, Berghella V, Bilardo C, Hernandez-Andrade E, Johnsen SL, et al. (2011) Practice guidelines for performance of the routine mid-trimester fetal ultrasound scan. Ultrasound Obstet Gynecol 37: 116–126. pmid:20842655
  25. 25. Lee W, Balasubramaniam M, Deter RL, Yeo L, Hassan SS, Gotsch F, et al. (2009) New fetal weight estimation models using fractional limb volume. Ultrasound Obstet Gynecol 34: 556–565. pmid:19725080
  26. 26. Villar J, Cheikh Ismail L, Victora CG, Ohuma EO, Bertino E, Altman DG, et al. (2014) International standards for newborn weight, length, and head circumference by gestational age and sex: the Newborn Cross-Sectional Study of the INTERGROWTH-21st Project. Lancet 384: 857–868. pmid:25209487
  27. 27. Chang TC, Robson SC, Spencer JA, Gallivan S (1993) Identification of fetal growth retardation: comparison of Doppler waveform indices and serial ultrasound measurements of abdominal circumference and fetal weight. Obstet Gynecol 82: 230–236. pmid:8336870
  28. 28. Royston P (1995) Calculation of unconditional and conditional reference intervals for foetal size and growth from longitudinal measurements. Stat Med 14: 1417–1436. pmid:7481181
  29. 29. Papageorghiou AT, Ohuma EO, Altman DG, Todros T, Cheikh Ismail L, Lambert A, et al. (2014) International standards for fetal growth based on serial ultrasound measurements: the Fetal Growth Longitudinal Study of the INTERGROWTH-21st Project. Lancet 384: 869–879. pmid:25209488
  30. 30. Albert PS (2012) A linear mixed model for predicting a binary event from longitudinal data under random effects misspecification. Stat Med 31: 143–154. pmid:22081439
  31. 31. Rossavik IK, Deter RL (1984) Mathematical modeling of fetal growth: I. Basic principles. J Clin Ultrasound 12: 529–533. pmid:6439746
  32. 32. Ghi T, Cariello L, Rizzo L, Ferrazzi E, Periti E, Prefumo F, et al. (2016) Customized Fetal Growth Charts for Parents' Characteristics, Race, and Parity by Quantile Regression Analysis: A Cross-sectional Multicenter Italian Study. J Ultrasound Med 35: 83–92. pmid:26643757
  33. 33. Hadlock FP, Harrist RB, Martinez-Poyer J (1991) In utero analysis of fetal growth: a sonographic weight standard. Radiology 181: 129–133. pmid:1887021
  34. 34. Fadigas C, Saiid Y, Gonzalez R, Poon LC, Nicolaides KH (2015) Prediction of small-for-gestational-age neonates: screening by fetal biometry at 35–37 weeks. Ultrasound Obstet Gynecol 45: 559–565. pmid:25728139
  35. 35. Di Lorenzo G, Monasta L, Ceccarello M, Cecotti V, D'Ottavio G (2013) Third trimester abdominal circumference, estimated fetal weight and uterine artery doppler for the identification of newborns small and large for gestational age. Eur J Obstet Gynecol Reprod Biol 166: 133–138. pmid:23122032
  36. 36. Bakalis S, Peeva G, Gonzalez R, Poon LC, Nicolaides KH (2015) Prediction of small-for-gestational-age neonates: screening by biophysical and biochemical markers at 30–34 weeks. Ultrasound Obstet Gynecol.
  37. 37. Barker ED, McAuliffe FM, Alderdice F, Unterscheider J, Daly S, Geary MP, et al. (2013) The role of growth trajectories in classifying fetal growth restriction. Obstet Gynecol 122: 248–254. pmid:23969791
  38. 38. Blake KV, Gurrin LC, Beilin LJ, Stanley FJ, Kendall GE, Landau LI, et al. (2002) Prenatal ultrasound biometry related to subsequent blood pressure in childhood. J Epidemiol Community Health 56: 713–718. pmid:12177091
  39. 39. Chang TC, Robson SC, Spencer JA, Gallivan S (1994) Prediction of perinatal morbidity at term in small fetuses: comparison of fetal growth and Doppler ultrasound. Br J Obstet Gynaecol 101: 422–427. pmid:8018615
  40. 40. Owen P, Donnet ML, Ogston SA, Christie AD, Howie PW, Patel NB (1996) Standards for ultrasound fetal growth velocity. Br J Obstet Gynaecol 103: 60–69. pmid:8608100
  41. 41. Owen P, Maharaj S, Khan KS, Howie PW (2001) Interval between fetal measurements in predicting growth restriction. Obstet Gynecol 97: 499–504. pmid:11275017
  42. 42. McKenna D, Tharmaratnam S, Mahsud S, Bailie C, Harper A, Dornan J (2003) A randomized trial using ultrasound to identify the high-risk fetus in a low-risk population. Obstet Gynecol 101: 626–632. pmid:12681862
  43. 43. Liu D, Albert PS (2014) Combination of longitudinal biomarkers in predicting binary events. Biostatistics 15: 706–718. pmid:24831103
  44. 44. Albert PS, Liu D Combination of longitudinal biomarkers in predicting binary events: pattern mixture model and shared random effects model.
  45. 45. Boulvain M, Senat MV, Perrotin F, Winer N, Beucher G, Subtil D, et al. (2015) Induction of labour versus expectant management for large-for-date fetuses: a randomised controlled trial. Lancet.