Document Type

Article

Publication Date

3-4-2024

Identifier

DOI: 10.1093/bioinformatics/btae136; PMCID: PMC10963074

Abstract

MOTIVATION: Human epigenomic data has been generated by large consortia for thousands of cell types to be used as a reference map of normal and disease chromatin states. Since epigenetic data contains potentially identifiable information, similarly to genetic data, most raw files generated by these consortia are stored in controlled-access databases. It is important to protect identifiable information, but this should not hinder secure sharing of these valuable datasets.

RESULTS: Guided by the Framework for responsible sharing of genomic and health-related data from the Global Alliance for Genomics and Health (GA4GH), we have developed an approach and a tool to facilitate the exploration of epigenomics datasets' aggregate results, while filtering out identifiable information. Specifically, the EpiVar Browser allows a user to navigate an epigenetic dataset from a cohort of individuals and enables direct exploration of genotype-chromatin phenotype relationships. Because individual genotypes and epigenetic signal tracks are not directly accessible, and rather aggregated in the portal output, no identifiable data is released, yet the interface allows for dynamic genotype-epigenome interrogation. This approach has the potential to accelerate analyses that would otherwise require a lengthy multi-step approval process and provides a generalizable strategy to facilitate responsible access to sensitive epigenomics data.

AVAILABILITY AND IMPLEMENTATION: Online portal: https://computationalgenomics.ca/tools/epivar; EpiVar Browser source code: https://github.com/c3g/epivar-browser; bw-merge-window tool source code: https://github.com/c3g/bw-merge-window.

Journal Title

Bioinformatics (Oxford, England)

Volume

40

Issue

3

MeSH Keywords

Humans; Epigenomics; Software; Genome; Genomics; Chromatin

Keywords

Epigenomics; Software; Genome; Genomics; Chromatin

Comments

Grants and funding

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Publisher's Link: https://doi.org/10.1093/bioinformatics/btae136

Share

COinS