CTLPScanner: A Web Server for Chromothripsis-like Pattern Detection
 ScanSearchDownloadTutorialAboutContact  
 

CTLPScanner Interface

  • Data input
  • Parameters & Thresholds
  • Output

    Public Data Exploration

  • Data processing pipeline
  • Input & Output

    Case Study

  • Chromothripsis in the original publication
  • Chromothripsis and BFBs

  •  

    CTLPScanner Interface

    1. Data input

    The detection of chromothripsis event is based on DNA copy number segmentation data, which can be generated by array Comparative Genomic Hybridisation (aCGH) or Single Nucleotide Polymorphism (SNP) array platforms. The minimum required data contents include the sample name, chromosome number, start and stop positions of each segment, and normalized signal intensity. The input file should be tab-delimited with an optional first line identifying the columns. Table 1 indicates an example of the input data format and content.

    The uploaded data file should be plain text format with tab separator. The supported file types include: .txt, .csv, .tab, .zip. The minimum required data fields for CTLPScanner:
    1. sample: The name of the data file. This name will be displayed in the figure of the result page.
    2. chro: The chromosome identifier (e.g. chr1, chr6).
    3. start: The starting position of the segment.
    4. stop: The ending position of the segment.
    5. mean: The normalized fluorescence intensity of the segment. It is usually log2 transformed.

    The segment data can be “copy and paste” into the text box directly or uploaded as a flat file (Figure 1). A compressed file (.zip) can be submitted for batch processing.

    Figure 1. The data input interface.

     

    2. Parameters & thresholds

    CTLPScanner offers a set of parameters for accurate detection of chromothripsis. The web server provides optimized default values for all parameters, which may also be adjusted for customized screening (Figure 2).

    Figure 2. The parameters and thresholds for chromothripsis detection.

    (1) Genome assembly: The human reference genome assembly. The available versions range from NCBI34/hg16 to NCBI38/hg38.

    (2) Copy number status change times: The oscillation of copy number status change is one of the most striking features of chromothripsis. The default value is 20, which was used by the original study that reported chromothripsis phenomenon.

    (3) Log10 of likelihood ratio: The clustering of chromosomal breakpoints is a key feature of chromothripsis. This parameter is used to evaluate the statistical significance of breakpoints clustering.

    (4) Minimum segment size (Kb): This parameter is used to filter out the small segments that may be generated by background noise or hyper-segmentation. The default value is 10Kb, which means that segments smaller than 10 Kb will be discarded.

    (5) Signal distance between adjacent segments: Due to local correlation effects between probes or the existence of background noise, the segmentation profiles occasionally exhibit subtle striation patterns. This pattern is constituted with a large number of small segments, which is unlikely to be a biological phenomenon. To reduce artificial copy number status change, the distance of signal intensity between adjacent segments was introduced. If the distance of two adjacent segments differed by less than this threshold, the copy number status change was not considered.

     

    The customized visualization parameters are provided by CTLPScanner (Figure 3).

    Figure 3. The visualization options.

    (1) The thresholds for calling genomic gains and losses: These values are used to “call” genomic gains and losses. The default values are 0.15 and -0.15, which are commonly used across multiple array platforms.

    (2) Color for genomic gains or losses: The colors used to indicate copy number gains and losses.

    (3) Show detected chromothripsis-like region only: If the check box is checked, only the pulverized chromosome will be displayed.

     

    3. Output

    Parameters & Thresholds: The parameters and thresholds defined by users to perform chromothripsis detection. Users can reuse these parameters for chromothripsis detection or re-analyze the same sample with changed thresholds (Figure 4).

    Figure 4. The parameters and thresholds used are shown on the result page.

     

    Chromothripsis Region: In identified chromothripsis region, the two main features of chromothripsis (CNA status change times & Likelihood ratio) are shown (Figure 5). The result file can be downloaded.

    Figure 5. The detected chromothripsis region.

     

    Chromothripsis & Copy Number Aberrations Plots:

    If a chromothripsis event is detected, a general view of copy number aberration profile of the sample will be shown (Figure 6). In the plot for each chromosome, the chromothripsis region will be highlighted with a light orange rectangle (Figure 7).

    Figure 6. The general view of copy number aberration profile .

     

    Figure 7. An example of result visualization.

    (1) For each chromosome, the “called” genomic gains and losses are shown according to user-defined colors. The X and Y axes represent genomic positions and log ratio of segments, respectively. The orange rectangle represents identified chromothripsis regions.
    (2) Users can zoom in specific genomic regions by inputting the start and stop locations.
    (3) The histogram of copy number segment switches. It facilitates the identification of chromothripsis event arising in conjunction with telomere crises. The patterns and distribution of copy number state switches indicate the emergence and temporal order of the major genomic rearrangement events.

     

    Cancer Genes: COSMIC (Catalogue Of Somatic Mutations In Cancer) cancer genes that are involved in the detected chromothripsis regions (Figure 8). These genes are affected by genomic gain or loss, which are represented by user-defined colors. The "segment size" indicates the size of copy number aberrations that the gene involved in. In general, the relatively small CNAs, which known as "focus changes", are in favour of identifying "driver" cancer genes.

    Figure 8. The cancer genes that are involved in the detected chromothripsis regions.

     

    Scatter Plot: CTLPScanner creates a scatter plot to compare the two decision parameters, i.e. copy number status change times and likelihood ratio (Figure 9). Each point in the plot represents a chromosome, and the candidates falling in the upper right area are inferred chromothripsis cases. The closer the point is to the top-right corner, the more accurately the algorithm can distinguish chromothripsis from non-chromothripsis cases.

    Figure 9. The scatter plot of detected chromothripsis events.

     

     

    top

    Public Data Exploration

    1. Data processing pipeline

    CTLPScanner allows users to explore potential chromothripsis events from public data sources with self-defined constraints. It integrates more than 50,000 pre-processed genomic arrays, and all the data were converted into segmentation data format. The pipeline of raw array data processing is shown in Figure 10.

    The Cancer Genome Atlas (TCGA) copy number aberration data (SNP array) are also provided through CTLPScanner. There are 22,419 samples representing 33 cancer types. Data of “level 3” were collected and processed. The TCGA sample ID was extracted from the original name of each data file.

    Figure 10. The pipeline for GEO array data processing.

     

    2. Input & Output

    CTLPScanner allows users to search either by a single array or by a data series (Figure 11). The B allele frequency plot is provided by CTLPScanner to facilitate the identification of chromothripsis in context of hyperploidy (Figure 12).

    Figure 11. The data input interface for public data exploration module.

     

    Figure 12. The B allele frequency plot.


    (1) For search by array, users can specify the array accession ID to initiate the screening process.
    (2) For search by series, users can select tumor type in the selection box, which contains over 40 different cancer types as classified by tumor loci. The content of the batch accession list in the next selection box is dynamically determined by user’s selection of cancer type.
    (3) The entire list of array and series accession IDs can be downloaded.
    (4) To facilitate the identification of chromothripsis in hyperploidy samples, the B allele frequency (BAF) plots for SNP arrays are provided.

     

     

    top

    Case Study

    1. Chromothripsis in the original publication

    The small cell lung cancer cell line SCLC-21H was described in the “original” publication (Stephens PJ, et al. Cell,144(1):27-40, 2011) that coined the term “chromothripsis”. The cell line was analyzed by NimbleGen 85k oligo array (GSM784216). CTLPScanner reliably detected the chromothripsis event on chromosome 8 (Figure 13). This is consistent with the result from the “original” publication of chromothripsis.

    Figure 13. Chromothripsis on chromosome 8 of SCLC-21H.

     

    2. Chromothripsis and BFBs

    Recent studies revealed a potential link between chromothripsis and telomere dysfunction (Li Y, et al. Nature, 508:98-102, 2014; Mardin BR, et al. Mol. Syst. Biol. 11, 828, 2015). The patterns and distribution of copy number state switches indicate the emergence and temporal order of the major genomic rearrangement events. CTLPScanner provides a histogram of copy number segment switches to facilitate the identification of chromothripsis event arising in conjunction with telomere crises.

    The pattern of chromosome 18 of GSM681811 showed a stepwise increase in copy number toward the chromosome end, followed by a sharp drop. It indicates that chromothripsis may occur subsequent to initiating breakage fusion bridge cycles (BFBs) (Figure 14).

    Figure 14. Chromothripsis on chromosome 18 of GSM681811.

     

    The pattern of chromosome 1 of GSM681798 showed copy number oscillates among three or more states. This pattern indicates that chormothripsis may be the initiating event (Figure 15).

    Figure 15. Chromothripsis on chromosome 1 of GSM681798.

    top