Description

The cancer genome is abnormal genome, and the ability to monitor its sequence had undergone a technological revolution. Yet prognosis and diagnosis remain an expert-based decision, with only limited abilities to provide machine-based decisions. We introduce a heterogeneity-based method for

The cancer genome is abnormal genome, and the ability to monitor its sequence had undergone a technological revolution. Yet prognosis and diagnosis remain an expert-based decision, with only limited abilities to provide machine-based decisions. We introduce a heterogeneity-based method for stratifying and visualizing whole-genome sequencing (WGS) reads. This method uses the heterogeneity within WGS reads to markedly reduce the dimensionality of next-generation sequencing data; it is available through the tool HiBS (Heterogeneity-Based Subclassification) that allows cancer sample classification. We validated HiBS using >200 WGS samples from nine different cancer types from The Cancer Genome Atlas (TCGA). With HiBS, we show progress with two WGS related issues: (i) differentiation between normal (NB) and tumor (TP) samples based solely on the information structure of their WGS data, and (ii) identification of specific regions of chromosomal amplification/deletion and their association with tumor stage. By comparing results to those obtained through available WGS analyses tools, we demonstrate some of the novelties obtained by the approach implemented in HiBS and also show nearly perfect normal/tumor classification, used to identify known and unknown chromosomal aberrations. Finally, the HiBS index has been associated with breast cancer tumor stage.

Reuse Permissions
  • Downloads
    PDF (3.7 MB)

    Details

    Title
    • Quantification of Read Species Behavior Within Whole Genome Sequencing of Cancer Genomes for the Stratification and Visualization of Genomic Variation
    Contributors
    Date Created
    2016-01-24
    Resource Type
  • Text
  • Collections this item is in
    Identifier
    • Digital object identifier: 10.1093/nar/gkw031
    • Identifier Type
      International standard serial number
      Identifier Value
      1362-4962
    • Identifier Type
      International standard serial number
      Identifier Value
      0305-1048
    Note
    • The final version of this article, as published in Nucleic Acids Research, can be viewed online at: https://academic.oup.com/nar/article-lookup/doi/10.1093/nar/gkw031

    Citation and reuse

    Cite this item

    This is a suggested citation. Consult the appropriate style guide for specific citation guidelines.

    Hibsh, D., Buetow, K. H., Yaari, G., & Efroni, S. (2016). Quantification of read species behavior within whole genome sequencing of cancer genomes for the stratification and visualization of genomic variation. Nucleic Acids Research, 44(9). doi:10.1093/nar/gkw031

    Machine-readable links