Open Access Research article

Distinct chromatin features characterize different classes of repeat sequences in Drosophila melanogaster

Kristina Krassovsky12 and Steven Henikoff13*

Author Affiliations

1 Basic Sciences Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, Washington 98109, USA

2 Molecular and Cellular Biology Program, University of Washington, Seattle, Washington 98195, USA

3 Howard Hughes Medical institute, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA

For all author emails, please log on.

BMC Genomics 2014, 15:105  doi:10.1186/1471-2164-15-105

Published: 6 February 2014



Repeat sequences are abundant in eukaryotic genomes but many are excluded from genome assemblies. In Drosophila melanogaster classical studies of repeat content suggested variability between individuals, but they lacked the precision of modern high throughput sequencing technologies. Genome-wide profiling of chromatin features such as histone tail modifications and DNA-binding proteins relies on alignment to the reference genome and hence excludes highly repetitive sequences.


By analyzing repeat libraries, sequence complexity and k-mer counts we determined the abundances of different D. melanogaster repeat classes in flies in two public datasets, DGRP and modENCODE. We found that larval DNA was depleted of all repeat classes relative to adult and embryonic DNA, as expected from the known depletion of repeat-rich pericentromeric regions during polytenization of larval tissues. By applying a method that is independent of alignment to the genome assembly, we found that satellite repeats associate with distinct H3 tail modifications, such as H3K9me2 and H3K9me3 for short repeats and H3K9me1 for 359 bp repeats. Short AT-rich repeats however are depleted of nucleosomes and hence all histone modifications and associated chromatin proteins.


The total repeat content and association of repeat sequences with chromatin modifications can be determined despite repeats being excluded from genome assemblies, revealing unexpected distinctions in chromatin features based on sequence composition.

DNA satellites; Next-generation sequencing; ChIP-seq; Histone modification