Skip to main content

Main menu

  • Home
  • About
    • Clinical Chemistry
    • Editorial Board
    • Most Read
    • Most Cited
    • Alerts
    • CE Credits
  • Articles
    • Current Issue
    • Early Release
    • Future Table of Contents
    • Archive
    • Browse by Subject
  • Info for
    • Authors
    • Reviewers
    • Subscribers
    • Advertisers
    • Permissions & Reprints
  • Resources
    • AACC Learning Lab
    • Clinical Chemistry Trainee Council
    • Clinical Case Studies
    • Clinical Chemistry Guide to Scientific Writing
    • Clinical Chemistry Guide to Manuscript Review
    • Journal Club
    • Podcasts
    • Q&A
    • Translated Content
  • Abstracts
  • Submit
  • Contact
  • Other Publications
    • The Journal of Applied Laboratory Medicine

User menu

  • Subscribe
  • My alerts
  • Log in
  • Log out

Search

  • Advanced search
Clinical Chemistry
  • Other Publications
    • The Journal of Applied Laboratory Medicine
  • Subscribe
  • My alerts
  • Log in
  • Log out
Clinical Chemistry

Advanced Search

  • Home
  • About
    • Clinical Chemistry
    • Editorial Board
    • Most Read
    • Most Cited
    • Alerts
    • CE Credits
  • Articles
    • Current Issue
    • Early Release
    • Future Table of Contents
    • Archive
    • Browse by Subject
  • Info for
    • Authors
    • Reviewers
    • Subscribers
    • Advertisers
    • Permissions & Reprints
  • Resources
    • AACC Learning Lab
    • Clinical Chemistry Trainee Council
    • Clinical Case Studies
    • Clinical Chemistry Guide to Scientific Writing
    • Clinical Chemistry Guide to Manuscript Review
    • Journal Club
    • Podcasts
    • Q&A
    • Translated Content
  • Abstracts
  • Submit
  • Contact
Research ArticleArticle

Detection of Fetal Subchromosomal Abnormalities by Sequencing Circulating Cell-Free DNA from Maternal Plasma

Chen Zhao, John Tynan, Mathias Ehrich, Gregory Hannum, Ron McCullough, Juan-Sebastian Saldivar, Paul Oeth, Dirk van den Boom, Cosmin Deciu
DOI: 10.1373/clinchem.2014.233312 Published March 2015
Chen Zhao
Sequenom Laboratories,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
John Tynan
Sequenom Laboratories,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mathias Ehrich
Sequenom Inc., San Diego, CA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gregory Hannum
Sequenom Laboratories,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ron McCullough
Sequenom Laboratories,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Juan-Sebastian Saldivar
Sequenom Laboratories,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Paul Oeth
Sequenom Laboratories,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dirk van den Boom
Sequenom Inc., San Diego, CA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Cosmin Deciu
Sequenom Laboratories,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • Info & Metrics
  • PDF
Loading

Abstract

BACKGROUND: The development of sequencing-based noninvasive prenatal testing (NIPT) has been largely focused on whole-chromosome aneuploidies (chromosomes 13, 18, 21, X, and Y). Collectively, they account for only 30% of all live births with a chromosome abnormality. Various structural chromosome changes, such as microdeletion/microduplication (MD) syndromes are more common but more challenging to detect. Recently, several publications have shown results on noninvasive detection of MDs by deep sequencing. These approaches demonstrated the proof of concept but are not economically feasible for large-scale clinical applications.

METHODS: We present a novel approach that uses low-coverage whole genome sequencing (approximately 0.2×) to detect MDs genome wide without requiring prior knowledge of the event's location. We developed a normalization method to reduce sequencing noise. We then applied a statistical method to search for consistently increased or decreased regions. A decision tree was used to differentiate whole-chromosome events from MDs.

RESULTS: We demonstrated via a simulation study that the sensitivity difference between our method and the theoretical limit was <5% for MDs ≥9 Mb. We tested the performance in a blinded study in which the MDs ranged from 3 to 40 Mb. In this study, our algorithm correctly identified 17 of 18 cases with MDs and 156 of 157 unaffected cases.

CONCLUSIONS: The limit of detection for any given MD syndrome is constrained by 4 factors: fetal fraction, MD size, coverage, and biological and technical variability of the event region. Our algorithm takes these factors into account and achieved 94.4% sensitivity and 99.4% specificity.

The discovery of circulating cell-free (ccf)3 fetal DNA in maternal plasma (1, 2) has greatly enhanced research in noninvasive prenatal testing (NIPT). In recent years, several studies have demonstrated the performance of fetal aneuploidy detection using massively parallel sequencing (MPS) (3–8). Although detection of whole-chromosome aneuploidy such as trisomy 13, 18, and 21 is becoming routine practice (8), subchromosomal abnormality detection remains challenging. A study published by Jensen et al. (9) demonstrated detection of a 3-Mb deletion on chr22q11.2 that causes the DiGeorge syndrome. The results showed a statistically significant reduction of standardized read count (z < −3) in the DiGeorge region for the affected samples. However, to achieve statistical significance, relatively high coverage was required with approximately 200 million reads per sample. A study published by Srinivasan et al. (10) showed that deep sequencing allowed detection of several small fetal copy number variations (CNVs). These researchers partitioned the genome into 1-Mb bins and tested statistical significance for individual bins to identify a CNV. A similar publication by Yu et al. (11) used locally weighted scatterplot smoothing (LOESS) for GC correction (median counts 144 million per sample) and they required 3 consecutive bins having |z| > 3 to identify a CNV. A major drawback of individual bin methods is that they tend to produce many false positives (FPs) and false negatives (FNs) (12) and the rules requiring that multiple bins be significant are study specific. Recent work by Straver et al. demonstrated improvements over the individual bin method by applying a within-sample sliding window approach (12). Their method detected aberrations down to 20 Mb at relatively shallow sequencing coverage (0.15–1.66x). However, an attempt to further increase detection resolution resulted in an increased number of FPs. Another recent study by Chen et al. proposed a binary segmentation and dynamic threshold strategy to detect CNVs >10 Mb using low-coverage sequencing data (13). Out of the 1311 tested cases, 3 affected samples with known G-banding karyotype results were correctly identified. Although the method showed promising results on large CNVs, it is unclear how it will perform on CNVs smaller than 10 Mb. Most recently, Rampasek et al. presented a probabilistic method for detecting fetal CNVs from maternal plasma using a unified Hidden Markov model (14). With use of deep sequencing and a trio analysis, the authors of this study estimated that CNVs bigger than 400 kb could be detected at 90% sensitivity if the fetal fraction is sufficiently high (13%).

In this report, we present a novel approach which uses shallow whole-genome sequencing (approximately 0.2× coverage) data to detect genome-wide microdeletions or microduplications (MDs) (also referred to in this report as “events”) without requiring a priori knowledge of an event's location. Unlike previously published approaches, our focus is on improving the normalization method to remove as much sequencing “noise” as possible upfront. Following that, a rigorous statistical method is applied to detect consistently increased or decreased regions in the normalized data without any individual bin-level z-score rules. We show that our method achieves the theoretical limit of detection for large MDs by “spiking in” synthetic MDs in real sequencing data. Finally, we test the performance of our algorithm in a blinded study and demonstrate its performance.

The scripts and data files used in this paper are available at: https://sourceforge.net/projects/mddetection.

Methods

DATA NORMALIZATION

To remove sequencing bias, we developed a 2-step normalization approach. Briefly, reads were aligned to hg19, allowing for only perfect matches within the seed sequence using Bowtie 2 (15). The genome was then partitioned into 50-kbp nonoverlapping bins and the raw count for each bin was determined. After binning, regions with high variability or low mappability were excluded according to a previously established method (8). To normalize the 50-kbp raw bin count, GC bias was removed by employing a LOESS-based correction for a sample-wise correction, similar to the one described in Alkan et al. (16). Then, principal component analysis (PCA) was used to remove higher-order artifacts for a population-based correction (17, 18). To train the PCA normalization step, we used a data matrix comprising LOESS normalized bin counts from 1000 female pregnancies and transformed the data matrix into the principal component space. Then for a new sample, we built a linear regression model between its LOESS normalized bin count and the top principal components from the training set. The residuals of this model serve as final normalized values for this given sample. Intuitively, the top principal components represent noise commonly seen in euploid samples, and therefore removing them can effectively improve normalization. The combination of the last 2 steps replaces the older, region-based normalization we previously published (8).

CIRCULAR BINARY SEGMENTATION

Our goal is to develop an algorithm for detection of MDs regardless of their genomic coordinates. As such, the proposed algorithm needs to search for such events in a nontargeted fashion. To do so, we apply the circular binary segmentation (CBS) method which has been extensively used by the array comparative genomic hybridization (CGH) community for CNV detection (19, 20). CBS works by iteratively partitioning a chromosome into equal copy number regions using the likelihood ratio statistic, and it can pinpoint the change point precisely. Many studies have demonstrated its superior performance over other methods (21, 22). Although powerful, CBS tends to overly partition the genome when the signal-to-noise ratio is low. To compensate for oversegmentation we further applied a segment-merging algorithm similar to the one described by Willenbrock and Fridlyand (22).

QUANTIFYING THE STATISTICAL SIGNIFICANCE OF DETECTED EVENTS

After CBS, each chromosome was partitioned into regions of equal copy number. To quantify the statistical significance of the detected event, we developed 2 approaches. First, for each chromosome, the segment with the largest area was treated as a potential MD, and its z-score, zCBS, was calculated by comparing the summed bin count with respect to a reference set of samples in the same region (8). The whole-chromosome z score, zCHR, was also calculated in the same way. Second, we calculated the log odds ratio (LOR) to quantify the likelihood of the event being true at the measured fetal fraction, f (see the LOR Section 1 in the Data Supplement that accompanies the online version of this article at http://www.clinchem.org/content/vol61/issue4).

DECISION TREE FOR GENOME-WIDE ANEUPLOIDY OR MD DETECTION

For each chromosome, 4 statistics can be inferred:

  1. The z score and LOR for the largest segment partitioned by the CBS process: zCBS and LORCBS.

  2. The whole-chromosome z score and LOR: zCHR and LORCHR.

Note that the largest segment is defined to be the segment with the largest area (height × length) out of all the segments on a given chromosome. For example, in Fig. 1B, there are 2 segments, and the second segment has the largest area. The latter represents the most significant finding on the chromosome of interest. On the basis of these 4 statistics, we propose the following decision tree method for aneuploidy or MD detection:

  1. A chromosome is classified to be trisomy or monosomy if:

    • |zCHR| ≥ C, LORCHR > 0, and |zCHR| ≥ a|zCBS|;

  2. A chromosome is classified to have microdeletion/microduplication if:

    • It is not a trisomy or monosomy, and |zCBS| ≥ C, LORCBS > 0,

where C is a predefined z-score cutoff that controls the tradeoff between sensitivity and specificity. In this report, we set C to be 3 for chromosome 21 and 3.95 for all the other chromosomes. The threshold was selected on the basis of our previous study on trisomy detection, in which more than 99.9% sensitivity and specificity were achieved for all 3 major aneuploidies (8). The comparison |zCHR| ≥ a|zCBS| allows one to distinguish a whole chromosome event from a subchromosomal event; in practice, we find that a between 0.6 and 0.8 works well. The value was determined by assessing its impact on the detection of in silico–created MDs at different sizes and fetal fractions. See the online Supplemental Material for detailed descriptions of the selection of a. Because there might be multiple MD events on the same chromosome, the decision tree process moves to the next-largest segment on the chromosome until there are no further significant segments.

Fig. 1.
  • Download figure
  • Open in new tab
Fig. 1. Bin count profiles (50 kb) for a sample with microduplication before and after normalization.

Blue color is the diploid region and maroon color is the microduplication region. The karyogram is also shown for each trace, where the coloring scheme follows the UCSC genome browser coloring convention. Due to bin filtering, very few bins locate at the centromere regions (red triangles). The x-axis is the coordinate of each bin in Mb. (A), Scaled raw bin count. (B), Normalized bin count. After normalization, the expected bin count for the diploid region is 1.

GENOMIC AND PLASMA DNA SAMPLE PREPARATION

We conducted a blinded clinical evaluation study to test the performance of the proposed MD detection algorithm. The study comprised 2 parts: an analytical validation using genomic DNA (gDNA) mixtures and a clinical validation using plasma samples.

Forty-five gDNA samples (130 μL of 30 ng/μL) were processed for library preparation, and sequencing libraries were created as previously described by Jensen et al. (8). gDNA mixture models were created by normalizing gDNA and nonpregnant female plasma (NPP) DNA libraries to 1.6 nmol/L. A starting mix of 20% gDNA library was created in a background of NPP library and subsequently used to create 17.5%, 15%, 12.5%, 10, 7.5%, and 5% mixtures by serial dilution. These were then multiplexed, clustered, and sequenced on the HiSeq2000 for 36 + 7 cycles in a 12-plex format (see online Supplemental Material for detailed protocols).

A total of 183 clinical plasma samples were collected using Investigational Review Board–approved clinical protocols. Samples with abnormal G-band karyotype results collected through these clinical protocols were selected for sequencing to determine detection rates for MDs. Euploid samples were chosen on the basis of normal karyotype reports using either fluorescence in situ hybridization (FISH) or G-banding methods. Library preparation and sequencing were conducted similarly to that of gDNA samples.

Because the karyotype outcome was based on FISH or G-banding, we anticipated that some of the “euploid” samples contained MD events. In case of putative FP results at 12-plex sequencing, we further sequenced the library at uniplex to confirm its validity. If the results were confirmed by uniplex sequencing, we counted the results as true positives (TPs) instead of FPs. Previous results have described the limitations of traditional karyotyping and also compared it with sequencing-based results (10).

Results

NORMALIZATION

Sequencing bias can cause nonuniform coverage in data, which complicates downstream CNV analysis (23, 24). Fig. 1 shows the normalization result on a 12-plex sequencing sample. Without normalization, many regions would aberrantly appear as depletions or duplications (Fig. 1A). After normalization, the microduplication event became more evident (Fig. 1B).

MD LIMIT OF DETECTION

For subchromosomal abnormality detection, it is of fundamental importance to understand the limit of detection (LoD) for MDs. Four factors affect the LoD: fetal fraction (f), size of the event, coverage, and the biological and technical variability of the event region. The first 3 factors are easy to understand: it is easier to detect an event with higher fetal fraction, larger size, and higher sequencing coverage. The fourth factor is related to the fact that certain regions are more variable than others due to various factors (such as GC bias, repetitive elements, and mapping ability) and are harder to detect. Fig. 2 illustrates the impact of the 4 factors. Through a simulation study, we observed that at 0.2× coverage, the sensitivity difference between the proposed decision tree method and the theoretical limit was <5% for MDs ≥9 Mb (see online Supplemental Fig. 1).

Fig. 2.
  • Download figure
  • Open in new tab
Fig. 2. Four factors affect the detectability of microdeletions.

The plots follow the same convention as Fig. 1. The orange line represents the median value of the respective regions. (A), Low fetal fraction (6.5%) vs high fetal fraction (21%). (B), Small-size MD (7.5 Mb) vs large-size MD (31.75 Mb). (C), Low coverage (15 M reads) vs high coverage (180 M reads). (D), Bin count SD differs from region to region.

ANALYTICAL VALIDATION USING GENOMIC DNA MIXTURES

A blinded gDNA study was conducted to test the performance of the proposed decision tree method on several selected syndromes with high clinical relevance. Due to the rarity of these syndromes, we created a gDNA mixture model system for analytical validation. Forty-five gDNA samples from individuals with DiGeorge, Cri-du-chat, Prader-Willi, Angelman, or 1p36 deletion syndromes were obtained from Coriell Cell Repositories or from CombiMatrix Diagnostics. The source, disease, and karyotype results of the gDNAs are provided in online Supplemental Table 1. In total, 17 were positive for Prader-Willi or Angelman syndrome, 14 were positive for Cri-du-chat, 13 were positive for DiGeorge, and 1 was positive for 1p36. All 45 gDNA samples were sequenced once from a 20%–7.5% mixture, and they were sequenced twice at a 5% mixture. A total of 360 gDNA mixtures were sequenced.

Fetal fraction plays a crucial role in the detectability of any given MD. We measured the observed fetal fraction by chromosome Y on male samples to confirm the planned titration levels. Online Supplemental Fig. 2 shows that the measured values agreed well with the planned titrations, with slight overdilution. The median measured titration concentrations were 18.3%, 16.0%, 14.0%, 11.6%, 9.3%, 7.1%, and 4.7% respectively.

Before the gDNA unblinding, we predicted the sensitivity to detect each of the syndromes using the LoD framework. Briefly, the genomic location and size definition of each syndrome is acquired from the DECIPHER database (25). A syndrome-specific microdeletion or microduplication is created by sampling the observed titration values shown in online Supplemental Fig. 2. After unblinding, we found that the observed sensitivity for detection of each of these 3 syndromes fell within the 95% CI, with the exception of the 4.7% gDNA fractions (Fig. 3). Such high concordance suggests that the LoD framework is predictive of the actual performance. We note that the discrepancy at 4.7% can be mainly attributed to the size difference between the database size and the tested cases, and that such differences are most obvious at low fetal fractions. For example, the Cri-du-chat syndrome has a size of 12.52 Mb according to the DECIPHER database (25), and our testing cases ranged from 8 to 20 Mb, which made the detection easier at low fetal fractions.

Fig. 3.
  • Download figure
  • Open in new tab
Fig. 3. Predicted sensitivity (red cycles), 95% CI (red bars), and observed sensitivity (black crosses) for Cri-du-chat, Prader-Willi/Angelman, and DiGeorge at the observed titration levels.

The plot is not shown for 1p36 due to limited sample size (n = 1), which is detectable until 9.3% titration.

For the 1p36 sample, we were able to detect at 9.3% titration and above. An example is shown in online Supplemental Fig. 3. We note that this 1p36 sample has a size of only approximately 3 Mb, which is much smaller than its database definition of 12.83 Mb (25). In real life applications, we expect the overall sensitivity to be higher.

Within this set of gDNA mixtures each gDNA had a combination of karyotype and microarray results confirming the presence of an MD event for 1 of the 5 syndromes queried in this study. We used this information to determine specificity for the nonaffected syndrome regions in each of the gDNA samples. There was no false-positive result by the decision tree algorithm for any of the 360 gDNA mixtures. An overall specificity of 100% (95% CI, 98.7%–100%) was achieved in the gDNA study. The high specificity value is paramount for good positive predictive values, given the low prevalence of these conditions.

CLINICAL VALIDATION USING PLASMA SAMPLES

To further test the performance of the proposed decision tree method in a genome-wide fashion, we conducted a blinded study using maternal plasma DNAs with matched karyotype. For this study, 183 samples were analyzed using 12-plex sequencing. Fetal fractions were measured by the same method used in our current clinical practice (26, 27).

On the completion of sequencing, 1 (1) affected sample and 4 (4) euploid samples failed QC and were subsequently excluded from analysis. Hence, we had a total of 178 plasma samples with a median fetal fraction of 9.2% (range, 2.8%–24.9%). We did not impose any QC cutoff for fetal fraction. According to the karyotype table, the expected MD sizes ranged from 3 to 40 Mb (see online Supplemental Table 2).

The algorithm detected 13 of the 16 affected samples. Of the 3 supposed FNs, one was expected to show trisomy 8, one was expected to show a deletion of chromosome 4q34, and the third one was expected to show unbalanced translocation between chromosome 12 and chromosome 19. Subsequent detailed review of the first karyotype reports indicated the trisomy 8 sample was a low-level mosaic sample (approximately 8% cells affected). Because this low level of mosaicism would not have met the sample inclusion criteria, we excluded this sample from the remaining analysis. Review of the second karyotype report indicated that the mother also carried the same chromosome 4q34 deletion as the fetus. Our algorithm did successfully detect the event and correctly called it as a maternal event. Because we focused only on de novo fetal events, this sample was also excluded from the analysis. Review of the third sample did not reveal any complex karyotype information, and the negative algorithm outcome was probably a result of a low fetal fraction of 4.8%. We note that this FN sample was still nondetectable at uniplex sequencing.

For the remaining 162 presumed euploid samples, 5 MD events were detected. Additionally, in 2 of the 16 affected samples, we detected 2 events, whereas the karyotype report indicated only 1 event. Because the study collection protocol provided only the karyotyping result of amniocytes for these samples, no pure fetal tissue was available for sequencing or array confirmation. Therefore, to confirm the validity of our findings, we resequenced at higher coverage (uniplex) all the putative FP samples. Of these, 1 sample led to a different outcome based on the higher coverage sequencing and was deemed an FP result. The remaining 4 samples produced the same outcome when sequenced in uniplex. The MD events detected in these 4 samples included MDs of 2.85, 3.25, 22.15, 23.55, 25, 32.45, and 42.3 Mb. Fig. 4 illustrates the comparisons between 12-plex and uniplex results for (A) a confirmed TP sample by uniplex, (B) a confirmed FP by uniplex, and (C) the maternal deletion detected by both 12-plex and uniplex.

Fig. 4.
  • Download figure
  • Open in new tab
Fig. 4. Comparison between 12-plex (left panel) and uniplex (right panel) results.

The plots follow the same convention as Fig. 1. The uniplex results show significantly improved signal to noise ratio compared to the 12-plex results. (A), A TP event detected at 12-plex and confirmed by uniplex sequencing. (B), A FP event detected at 12-plex but not by uniplex sequencing. (C), The maternal deletion detected by both the 12-plex and uniplex sequencing.

Table 1 summarizes the performance of our algorithm on 176 plasma samples (excluding the mosaic trisomy 8 and maternal chr4q34 deletion). Our algorithm attained a 94.4% sensitivity (95% CI, 70.6%–99.7%), and a 99.4% specificity (95% CI, 95.96%–99.97%) for the plasma samples. The CI was calculated on the basis of the proportion test with continuity correction (28). We want to emphasize again that 4 out of 5 putative FP samples were confirmed by uniplex sequencing and subsequently counted as TP samples. Because we did not have pure tissue samples for further invasive arrayCGH confirmation, the interpretation of specificity was limited to the signal detection perspective. We note that other researchers have also observed discrepancies between sequencing and the traditional karyotyping methods (10).

View this table:
  • View inline
  • View popup
Table 1.

Contingency table for the plasma study.a

Discussion

The methodology described in this report for noninvasive detection of fetal subchromosomal abnormalities relies on a simple yet fundamental assumption: namely, that any plasma sample pertaining to a mixture of maternal DNA and fetal DNA in which the latter component has a genomic abnormality will lead to sequencing results similar to those of euploid samples over the entire genome except for the region where the fetus is affected. Moreover, for the affected region, the sequencing results will show a deviation in sequenced reads that is directly proportional to the fetal fraction but described by the same variability as observed in a euploid population. We have tested this hypothesis over a set of 16 plasma samples from women carrying affected fetuses and have proven it to be consistent with these assumptions. This result has warranted the exploration of the performance of this methodology over a specific set of genomic abnormalities that were experimentally created to mimic the fetal and maternal ccf DNA mixture as present in maternal plasma.

Compared to our previous clinical validation efforts (4, 6–8), in this study we have performed an analytical, genome-wide validation of this bioinformatics algorithm. None of the abnormalities investigated were statistically powered individually in a classical sense. Given the low prevalence of many of these abnormalities and given the fact this is a genome-wide assay, it would have been impossible to collect a sufficiently large number of affected sample for each possible abnormality. This kind of difficulty is not unlike the one encountered in the development of genome-wide chromosomal array CGH invasive prenatal diagnosis (29). The performance of the chromosomal microarray analysis has been evaluated through larger cohort studies, such as reported by Wapner et al. in 2012 (30). Based on the findings of this particular study, the American College of Obstetricians and Gynecologists and the Society for Maternal-Fetal medicine have recently issued the recommendation that in cases where in which one or more major fetal structural abnormalities have been identified by ultrasonography and the prospective mother is undergoing invasive prenatal diagnosis, chromosomal microarray analysis can replace the need for fetal karyotyping (31).

A complexity associated with the reporting of genome-wide abnormalities is the increased prevalence of maternal-only and maternal-and-fetal abnormalities. The analyte queried by noninvasive testing is always a mixture of maternal and fetal DNA and although the maternal copy number for chromosomes 21, 13, and 18 is highly likely to be known in advance, this is not necessarily the case for other genomic abnormalities. A recently published study (32) demonstrated how a significant fraction of the NIPT FP results for chromosome X are due to mosaic maternal abnormalities. The solution proposed in the aforementioned report is to process the maternal-only component (buffy coat) for many if not all the samples undergoing NIPT. Currently our algorithms detect and report any genomic abnormality (of sufficient signal-to-noise ratio), regardless of whether its origin is fetal, maternal, or a combination. An issue common to both the genome-wide array CGH invasive testing as well as sequencing-based NIPT is that both have the potential for findings of unknown significance. A conservative approach is to limit the findings to a subset of categories of high clinical relevance. This was indeed the strategy behind the selection of the 6 conditions we have investigated in our study by means of sheared gDNA + ccf DNA mixtures. This is only a temporary and intentional limitation: going forward, whole-genome sequencing–based noninvasive testing will have the benefit of the improved knowledge of the human genome as derived from the findings of microarray analysis.

Acknowledgments

The authors thank Dongyan Liu and Zhanyang Zhu for performing the sequencing data processing; Amin R. Mazloom and Sung K. Kim for providing critical discussions during the project; Sarah L. Kinnings for assistance with the visualization of the genomic data; Jason Chibuk for interpreting the plasma sample karyotype information; and Rick Hockett for providing several of the genomic DNA samples used in the analytical validation. We thank Graham McLennan for his assistance in the management of the clinical samples. Finally, we thank Margo Maeder for all her project management support.

Footnotes

  • ↵3 Nonstandard abbreviations:

    ccf,
    circulating cell-free;
    NIPT,
    noninvasive prenatal testing;
    MPS,
    massively parallel sequencing;
    CNV,
    copy number variation;
    LOESS,
    locally weighted scatterplot smoothing;
    FP,
    false positive;
    FN,
    false negative;
    MDs,
    microdeletions or microduplications;
    PCA,
    principal component analysis;
    CBS,
    circular binary segmentation;
    CGH,
    comparative genomic hybridization;
    LOR,
    log odds ratio;
    NPP,
    nonpregnant female plasma;
    gDNA,
    genomic DNA;
    FISH,
    fluorescence in situ hybridization;
    TP,
    true positive;
    LoD,
    limit of detection.

  • Author Contributions: All authors confirmed they have contributed to the intellectual content of this paper and have met the following 3 requirements: (a) significant contributions to the conception and design, acquisition of data, or analysis and interpretation of data; (b) drafting or revising the article for intellectual content; and (c) final approval of the published article.

  • Authors' Disclosures or Potential Conflicts of Interest: Upon manuscript submission, all authors completed the author disclosure form. Disclosures and/or potential conflicts of interest:

  • Employment or Leadership: C. Zhao, Sequenom Laboratories; J. Tynan, Sequenom; M. Ehrich, Sequenom; G. Hannum, Sequenom; R. McCullough, Sequenom; J.-S. Saldivar, Sequenom Laboratories; P. Oeth, Sequenom Center for Molecular Medicine; D. van den Boom, Sequenom; C. Deciu, Sequenom Laboratories and Illumina.

  • Consultant or Advisory Role: C. Deciu, Sequenom Laboratories.

  • Stock Ownership: C. Zhao, Sequenom Laboratories; J. Tynan, Sequenom; M. Ehrich, Sequenom; G. Hannum, Sequenom; R. McCullough, Sequenom; J.-S. Saldivar, Sequenom; P. Oeth, Sequenom; D. van den Boom, Sequenom; C. Deciu, Sequenom Laboratories and Illumina.

  • Honoraria: None declared.

  • Research Funding: None declared.

  • Expert Testimony: C. Deciu, Sequenom.

  • Patents: C. Zhao, patent number SEQ-6068-PC; C. Deciu, patent numbers US 8,688,388, 20130338933, 20130325360, 20130310260, 20130309666, 20130304392, 20130288244, 20130261983, 20130150253, 20130130923, 20130103320, and 20130085681.

  • Role of Sponsor: No sponsor was declared.

  • Received for publication September 25, 2014.
  • Accepted for publication January 30, 2015.
  • © 2015 American Association for Clinical Chemistry

References

  1. 1.↵
    1. Lo YM,
    2. Corbetta N,
    3. Chamberlain PF,
    4. Rai V,
    5. Sargent IL,
    6. Redman CW,
    7. Wainscoat JS
    . Presence of fetal DNA in maternal plasma and serum. Lancet 1997;350:485–7.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  2. 2.↵
    1. Lo YM,
    2. Tein MS,
    3. Lau TK,
    4. Haines CJ,
    5. Leung TN,
    6. Poon PM,
    7. et al
    . Quantitative analysis of fetal DNA in maternal plasma and serum: Implications for noninvasive prenatal diagnosis. Am J Hum Genet 1998;62:768–75.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  3. 3.↵
    1. Chiu RW,
    2. Chan KC,
    3. Gao Y,
    4. Lau VY,
    5. Zheng W,
    6. Leung TY,
    7. et al
    . Noninvasive prenatal diagnosis of fetal chromosomal aneuploidy by massively parallel genomic sequencing of DNA in maternal plasma. Proc Natl Acad Sci U S A 2008;105:20458–63.
    OpenUrlAbstract/FREE Full Text
  4. 4.↵
    1. Ehrich M,
    2. Deciu C,
    3. Zwiefelhofer T,
    4. Tynan JA,
    5. Cagasan L,
    6. Tim R,
    7. et al
    . Noninvasive detection of fetal trisomy 21 by sequencing of DNA in maternal blood: A study in a clinical setting. Am J Obstet Gynecol 2011;204:205 e1–11.
    OpenUrl
  5. 5.
    1. Fan HC,
    2. Blumenfeld YJ,
    3. Chitkara U,
    4. Hudgins L,
    5. Quake SR
    . Noninvasive diagnosis of fetal aneuploidy by shotgun sequencing DNA from maternal blood. Proc Natl Acad Sci U S A 2008;105:16266–71.
    OpenUrlAbstract/FREE Full Text
  6. 6.↵
    1. Palomaki GE,
    2. Deciu C,
    3. Kloza EM,
    4. Lambert-Messerlian GM,
    5. Haddow JE,
    6. Neveux LM,
    7. et al
    . DNA sequencing of maternal plasma reliably identifies trisomy 18 and trisomy 13 as well as down syndrome: An international collaborative study. Genet Med 2012;14:296–305.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  7. 7.
    1. Palomaki GE,
    2. Kloza EM,
    3. Lambert-Messerlian GM,
    4. Haddow JE,
    5. Neveux LM,
    6. Ehrich M,
    7. et al
    . DNA sequencing of maternal plasma to detect down syndrome: An international clinical validation study. Genet Med 2011;13:913–20.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  8. 8.↵
    1. Jensen TJ,
    2. Zwiefelhofer T,
    3. Tim RC,
    4. Dzakula Z,
    5. Kim SK,
    6. Mazloom AR,
    7. et al
    . High-throughput massively parallel sequencing for fetal aneuploidy detection from maternal plasma. PLoS One 2013;8:e57381.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  9. 9.↵
    1. Jensen TJ,
    2. Dzakula Z,
    3. Deciu C,
    4. van den Boom D,
    5. Ehrich M
    . Detection of microdeletion 22q11.2 in a fetus by next-generation sequencing of maternal plasma. Clin Chem 2012;58:1148–51.
    OpenUrlAbstract/FREE Full Text
  10. 10.↵
    1. Srinivasan A,
    2. Bianchi DW,
    3. Huang H,
    4. Sehnert AJ,
    5. Rava RP
    . Noninvasive detection of fetal subchromosome abnormalities via deep sequencing of maternal plasma. Am J Hum Genet 2013;92:167–76.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  11. 11.↵
    1. Yu SC,
    2. Jiang P,
    3. Choy KW,
    4. Chan KC,
    5. Won HS,
    6. Leung WC,
    7. et al
    . Noninvasive prenatal molecular karyotyping from maternal plasma. PLoS One 2013;8:e60968.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  12. 12.↵
    1. Straver R,
    2. Sistermans EA,
    3. Holstege H,
    4. Visser A,
    5. Oudejans CB,
    6. Reinders MJ
    . Wisecondor: detection of fetal aberrations from shallow sequencing maternal plasma based on a within-sample comparison scheme. Nucleic Acids Res 2014;42:e31.
    OpenUrlAbstract/FREE Full Text
  13. 13.↵
    1. Chen S,
    2. Lau TK,
    3. Zhang C,
    4. Xu C,
    5. Xu Z,
    6. Hu P,
    7. et al
    . A method for noninvasive detection of fetal large deletions/duplications by low coverage massively parallel sequencing. Prenatal Diagnosis 2013;33:584–90.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  14. 14.↵
    1. Rampášek L,
    2. Arbabi A,
    3. Brudno M
    . Probabilistic method for detecting copy number variation in a fetal genome using maternal plasma sequencing. Bioinformatics 2014;30:i212–8.
    OpenUrlAbstract/FREE Full Text
  15. 15.↵
    1. Langmead B,
    2. Salzberg SL
    . Fast gapped-read alignment with bowtie 2. Nat Methods 2012;9:357–9.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  16. 16.↵
    1. Alkan C,
    2. Kidd JM,
    3. Marques-Bonet T,
    4. Aksay G,
    5. Antonacci F,
    6. Hormozdiari F,
    7. et al
    . Personalized copy number and segmental duplication maps using next-generation sequencing. Nat Genet 2009;41:1061–7.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  17. 17.↵
    1. Price AL,
    2. Patterson NJ,
    3. Plenge RM,
    4. Weinblatt ME,
    5. Shadick NA,
    6. Reich D
    . Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 2006;38:904–9.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  18. 18.↵
    1. Leek JT,
    2. Storey JD
    . Capturing heterogeneity in gene expression studies by surrogate variable analysis. PLoS Genet 2007;3:1724–35.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  19. 19.↵
    1. Olshen AB,
    2. Venkatraman ES,
    3. Lucito R,
    4. Wigler M
    . Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics 2004;5:557–72.
    OpenUrlAbstract
  20. 20.↵
    1. Venkatraman ES,
    2. Olshen AB
    . A faster circular binary segmentation algorithm for the analysis of array cgh data. Bioinformatics 2007;23:657–63.
    OpenUrlAbstract/FREE Full Text
  21. 21.↵
    1. Lai WR,
    2. Johnson MD,
    3. Kucherlapati R,
    4. Park PJ
    . Comparative analysis of algorithms for identifying amplifications and deletions in array cgh data. Bioinformatics 2005;21:3763–70.
    OpenUrlAbstract/FREE Full Text
  22. 22.↵
    1. Willenbrock H,
    2. Fridlyand J
    . A comparison study: Applying segmentation to array CGH data for downstream analyses. Bioinformatics 2005;21:4084–91.
    OpenUrlAbstract/FREE Full Text
  23. 23.↵
    1. Dohm JC,
    2. Lottaz C,
    3. Borodina T,
    4. Himmelbauer H
    . Substantial biases in ultra-short read data sets from high-throughput DNA sequencing. Nucleic Acids Res 2008;36:e105.
    OpenUrlAbstract/FREE Full Text
  24. 24.↵
    1. Benjamini Y,
    2. Speed TP
    . Summarizing and correcting the GC content bias in high-throughput sequencing. Nucleic Acids Res 2012;40:e72.
    OpenUrlAbstract/FREE Full Text
  25. 25.↵
    1. Firth HV,
    2. Richards SM,
    3. Bevan AP,
    4. Clayton S,
    5. Corpas M,
    6. Rajan D,
    7. et al
    . Decipher: Database of chromosomal imbalance and phenotype in humans using ensembl resources. Am J Hum Genet 2009;84:524–33.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  26. 26.↵
    1. Geis J,
    2. Kinnings S,
    3. Almarsri E,
    4. Wang H,
    5. Guan X,
    6. McCullough R,
    7. et al
    . Fetal fraction distributions in noninvasive prenatal testing (NIPT) for multifetal gestations. Presented at: 2014 ACMG Annual Clinical Genetics Meeting; 2014 Mar 25–29; Nashville, TN.
  27. 27.↵
    1. Kinnings S,
    2. Almarsri E,
    3. Bombard A,
    4. Deciu C,
    5. Geis J,
    6. Guan X,
    7. et al
    . Factors affecting levels of circulating cell-free fetal DNA in maternal plasma and their implications for noninvasive prenatal testing. ACMG Annu Clinical Genetics Meeting, Vol. Nashville, Tennessee., 2014.
  28. 28.↵
    1. Newcombe RG
    . Interval estimation for the difference between independent proportions: Comparison of eleven methods. Statistics in Medicine 1998;17:873–90.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  29. 29.↵
    1. Baldwin EL,
    2. Lee JY,
    3. Blake DM,
    4. Bunke BP,
    5. Alexander CR,
    6. Kogan AL,
    7. et al
    . Enhanced detection of clinically relevant genomic imbalances using a targeted plus whole genome oligonucleotide microarray. Genet Med 2008;10:415–29.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  30. 30.↵
    1. Wapner RJ,
    2. Martin CL,
    3. Levy B,
    4. Ballif BC,
    5. Eng CM,
    6. Zachary JM,
    7. et al
    . Chromosomal microarray versus karyotyping for prenatal diagnosis. N Engl J Med 2012;367:2175–84.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  31. 31.↵
    American College of Obstetricians and Gynecologists. The use of chromosomal microarray analysis in prenatal diagnosis. Committee opinion no. 581. Obstet Gynecol 2013;122:1374–7.
    OpenUrlCrossRefPubMed Order article via Infotrieve
  32. 32.↵
    1. Wang Y,
    2. Chen Y,
    3. Tian F,
    4. Zhang J,
    5. Song Z,
    6. Wu Y,
    7. et al
    . Maternal mosaicism is a significant contributor to discordant sex chromosomal aneuploidies associated with noninvasive prenatal testing. Clin Chem 2014;60:251–9.
    OpenUrlAbstract/FREE Full Text
View Abstract
PreviousNext
Back to top

In this issue

Clinical Chemistry: 61 (4)
Vol. 61, Issue 4
April 2015
  • Table of Contents
  • About the Cover
  • Index by author
  • Table of Contents (PDF)
  • Cover (PDF)
  • Advertising (PDF)
  • Ed Board (PDF)
  • Front Matter (PDF)
  • Audio summary of this issue
Print
Share
Detection of Fetal Subchromosomal Abnormalities by Sequencing Circulating Cell-Free DNA from Maternal Plasma
Chen Zhao, John Tynan, Mathias Ehrich, Gregory Hannum, Ron McCullough, Juan-Sebastian Saldivar, Paul Oeth, Dirk van den Boom, Cosmin Deciu
Clinical Chemistry Apr 2015, 61 (4) 608-616; DOI: 10.1373/clinchem.2014.233312
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Article Alerts
Sign In to Email Alerts with your Email Address
Citation Tools
Detection of Fetal Subchromosomal Abnormalities by Sequencing Circulating Cell-Free DNA from Maternal Plasma
Chen Zhao, John Tynan, Mathias Ehrich, Gregory Hannum, Ron McCullough, Juan-Sebastian Saldivar, Paul Oeth, Dirk van den Boom, Cosmin Deciu
Clinical Chemistry Apr 2015, 61 (4) 608-616; DOI: 10.1373/clinchem.2014.233312

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero

  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • Methods
    • Results
    • Discussion
    • Acknowledgments
    • Footnotes
    • References
  • Figures & Data
  • Info & Metrics
  • PDF

Related Articles

Cited By...

More in this TOC Section

  • Evaluation of the Non-HDL Cholesterol to Apolipoprotein B Ratio as a Screening Test for Dysbetalipoproteinemia
  • Low-Grade Inflammation in the Association between Mild-to-Moderate Hypertriglyceridemia and Risk of Acute Pancreatitis: A Study of More Than 115000 Individuals from the General Population
  • Concurrent Single-Cell RNA and Targeted DNA Sequencing on an Automated Platform for Comeasurement of Genomic and Transcriptomic Signatures
Show more Articles

Similar Articles

Subjects

  • SUBJECT AREAS
    • Molecular Diagnostics and Genetics

Options

  • Home
  • About
  • Articles
  • Information for Authors
  • Resources
  • Abstracts
  • Submit
  • Contact
  • RSS

Other Publications

  • The Journal of Applied Laboratory Medicine
Footer logo

© 2019 American Association for Clinical Chemistry

Powered by HighWire