Open Access Research article

Structural analysis of the genome of breast cancer cell line ZR-75-30 identifies twelve expressed fusion genes

Ina Schulte1, Elizabeth M Batty13, Jessica CM Pole14, Katherine A Blood15, Steven Mo16, Susanna L Cooke27, Charlotte Ng28, Kevin L Howe29, Suet-Feung Chin2, James D Brenton2, Carlos Caldas2, Karen D Howarth1* and Paul AW Edwards1*

Author Affiliations

1 Hutchison/MRC Research Centre and Department of Pathology, University of Cambridge, Cambridge, UK

2 Cancer Research UK Cambridge Research Institute and Department of Oncology, University of Cambridge, Li Ka-Shing Centre, Cambridge, UK

3 Current addresses: Department of Statistics, University of Oxford, 1 South Parks Road, Oxford, OX1 3TG, UK

4 Current addresses: BlueGnome Ltd, CPC4, Capital Park, Fulbourn, Cambridge, CB21 5XE, UK

5 Current addresses: Department of Medical Genetics, University of British Columbia, Vancouver, BC, V6H 2N1, Canada

6 Current addresses: Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, OX3 7DQ, UK

7 Current addresses: Cancer Genome Project, Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK

8 Current addresses: Breakthrough Breast Cancer Research Centre, Institute of Cancer Research, 237 Fulham Road, London, SW3 6JB, UK

9 Current addresses: European Bioinformatics Institute, Hinxton, Cambridgeshire, CB10 1SD, UK

For all author emails, please log on.

BMC Genomics 2012, 13:719  doi:10.1186/1471-2164-13-719

Published: 22 December 2012



It has recently emerged that common epithelial cancers such as breast cancers have fusion genes like those in leukaemias. In a representative breast cancer cell line, ZR-75-30, we searched for fusion genes, by analysing genome rearrangements.


We first analysed rearrangements of the ZR-75-30 genome, to around 10kb resolution, by molecular cytogenetic approaches, combining array painting and array CGH. We then compared this map with genomic junctions determined by paired-end sequencing. Most of the breakpoints found by array painting and array CGH were identified in the paired end sequencing—55% of the unamplified breakpoints and 97% of the amplified breakpoints (as these are represented by more sequence reads). From this analysis we identified 9 expressed fusion genes: APPBP2-PHF20L1, BCAS3-HOXB9, COL14A1-SKAP1, TAOK1-PCGF2, TIAM1-NRIP1, TIMM23-ARHGAP32, TRPS1-LASP1, USP32-CCDC49 and ZMYM4-OPRD1. We also determined the genomic junctions of a further three expressed fusion genes that had been described by others, BCAS3-ERBB2, DDX5-DEPDC6/DEPTOR and PLEC1-ENPP2. Of this total of 12 expressed fusion genes, 9 were in the coamplification. Due to the sensitivity of the technologies used, we estimate these 12 fusion genes to be around two-thirds of the true total. Many of the fusions seem likely to be driver mutations. For example, PHF20L1, BCAS3, TAOK1, PCGF2, and TRPS1 are fused in other breast cancers. HOXB9 and PHF20L1 are members of gene families that are fused in other neoplasms. Several of the other genes are relevant to cancer—in addition to ERBB2, SKAP1 is an adaptor for Src, DEPTOR regulates the mTOR pathway and NRIP1 is an estrogen-receptor coregulator.


This is the first structural analysis of a breast cancer genome that combines classical molecular cytogenetic approaches with sequencing. Paired-end sequencing was able to detect almost all breakpoints, where there was adequate read depth. It supports the view that gene breakage and gene fusion are important classes of mutation in breast cancer, with a typical breast cancer expressing many fusion genes.

Breast cancer; Chromosome aberrations; Genomics; Fusion genes