FB2024_02 , released April 23, 2024
Reference Report
Open Close
Reference
Citation
Arbel, H., Basu, S., Fisher, W.W., Hammonds, A.S., Wan, K.H., Park, S., Weiszmann, R., Booth, B.W., Keranen, S.V., Henriquez, C., Shams Solari, O., Bickel, P.J., Biggin, M.D., Celniker, S.E., Brown, J.B. (2019). Exploiting regulatory heterogeneity to systematically identify enhancers with high accuracy.  Proc. Natl. Acad. Sci. U.S.A. 116(3): 900--908.
FlyBase ID
FBrf0241230
Publication Type
Research paper
Abstract
Identifying functional enhancer elements in metazoan systems is a major challenge. Large-scale validation of enhancers predicted by ENCODE reveal false-positive rates of at least 70%. We used the pregrastrula-patterning network of Drosophila melanogaster to demonstrate that loss in accuracy in held-out data results from heterogeneity of functional signatures in enhancer elements. We show that at least two classes of enhancers are active during early Drosophila embryogenesis and that by focusing on a single, relatively homogeneous class of elements, greater than 98% prediction accuracy can be achieved in a balanced, completely held-out test set. The class of well-predicted elements is composed predominantly of enhancers driving multistage segmentation patterns, which we designate segmentation driving enhancers (SDE). Prediction is driven by the DNA occupancy of early developmental transcription factors, with almost no additional power derived from histone modifications. We further show that improved accuracy is not a property of a particular prediction method: after conditioning on the SDE set, naïve Bayes and logistic regression perform as well as more sophisticated tools. Applying this method to a genome-wide scan, we predict 1,640 SDEs that cover 1.6% of the genome. An analysis of 32 SDEs using whole-mount embryonic imaging of stably integrated reporter constructs chosen throughout our prediction rank-list showed >90% drove expression patterns. We achieved 86.7% precision on a genome-wide scan, with an estimated recall of at least 98%, indicating high accuracy and completeness in annotating this class of functional elements.
PubMed ID
PubMed Central ID
PMC6338827 (PMC) (EuropePMC)
Related Publication(s)
Personal communication to FlyBase

CEP-GAL4 constructs and insertions.
Celniker and Fisher, 2017.12.19, CEP-GAL4 constructs and insertions. [FBrf0237683]

Information on newly characterized CRMs: location.
Celniker, 2017.10.9, Information on newly characterized CRMs: location. [FBrf0236871]

Associated Information
Comments
Associated Files
Other Information
Secondary IDs
    Language of Publication
    English
    Additional Languages of Abstract
    Parent Publication
    Publication Type
    Journal
    Abbreviation
    Proc. Natl. Acad. Sci. U.S.A.
    Title
    Proceedings of the National Academy of Sciences of the United States of America
    Publication Year
    1915-
    ISBN/ISSN
    0027-8424
    Data From Reference
    Alleles (33)
    Genes (19)
    Sequence Features (8)
    Natural transposons (1)
    Insertions (34)
    Experimental Tools (1)
    Transgenic Constructs (33)