DNA methylation patterns associate with genetic and gene expression variation in HapMap cell lines

icon

13

pages

icon

English

icon

Documents

2011

Écrit par

Publié par

Lire un extrait
Lire un extrait

Obtenez un accès à la bibliothèque pour le consulter en ligne En savoir plus

Découvre YouScribe et accède à tout notre catalogue !

Je m'inscris

Découvre YouScribe et accède à tout notre catalogue !

Je m'inscris
icon

13

pages

icon

English

icon

Documents

2011

Lire un extrait
Lire un extrait

Obtenez un accès à la bibliothèque pour le consulter en ligne En savoir plus

DNA methylation is an essential epigenetic mechanism involved in gene regulation and disease, but little is known about the mechanisms underlying inter-individual variation in methylation profiles. Here we measured methylation levels at 22,290 CpG dinucleotides in lymphoblastoid cell lines from 77 HapMap Yoruba individuals, for which genome-wide gene expression and genotype data were also available. Results Association analyses of methylation levels with more than three million common single nucleotide polymorphisms (SNPs) identified 180 CpG-sites in 173 genes that were associated with nearby SNPs (putatively in cis , usually within 5 kb) at a false discovery rate of 10%. The most intriguing trans signal was obtained for SNP rs10876043 in the disco-interacting protein 2 homolog B gene ( DIP2B , previously postulated to play a role in DNA methylation), that had a genome-wide significant association with the first principal component of patterns of methylation; however, we found only modest signal of trans -acting associations overall. As expected, we found significant negative correlations between promoter methylation and gene expression levels measured by RNA-sequencing across genes. Finally, there was a significant overlap of SNPs that were associated with both methylation and gene expression levels. Conclusions Our results demonstrate a strong genetic component to inter-individual variation in DNA methylation profiles. Furthermore, there was an enrichment of SNPs that affect both methylation and gene expression, providing evidence for shared mechanisms in a fraction of genes.
Voir icon arrow

Publié par

Publié le

01 janvier 2011

Langue

English

Poids de l'ouvrage

1 Mo

Bell et al. Genome Biology 2011, 12:R10
http://genomebiology.com/2011/12/1/R10
RESEARCH Open Access
DNA methylation patterns associate with genetic
and gene expression variation in HapMap cell lines
1,3* 1 1 1,2 1 1Jordana T Bell , Athma A Pai , Joseph K Pickrell , Daniel J Gaffney , Roger Pique-Regi , Jacob F Degner ,
1* 1,2*Yoav Gilad , Jonathan K Pritchard
Abstract
Background: DNA methylation is an essential epigenetic mechanism involved in gene regulation and disease, but
little is known about the mechanisms underlying inter-individual variation in methylation profiles. Here we
measured methylation levels at 22,290 CpG dinucleotides in lymphoblastoid cell lines from 77 HapMap Yoruba
individuals, for which genome-wide gene expression and genotype data were also available.
Results: Association analyses of methylation levels with more than three million common single nucleotide
polymorphisms (SNPs) identified 180 CpG-sites in 173 genes that were associated with nearby SNPs (putatively in
cis, usually within 5 kb) at a false discovery rate of 10%. The most intriguing trans signal was obtained for SNP
rs10876043 in the disco-interacting protein 2 homolog B gene (DIP2B, previously postulated to play a role in DNA
methylation), that had a genome-wide significant association with the first principal component of patterns of
methylation; however, we found only modest signal of trans-acting associations overall. As expected, we found
significant negative correlations between promoter methylation and gene expression levels measured by RNA-
sequencing across genes. Finally, there was a significant overlap of SNPs that were associated with both
methylation and gene expression levels.
Conclusions: Our results demonstrate a strong genetic component to inter-individual variation in DNA
methylation profiles. Furthermore, there was an enrichment of SNPs that affect both methylation and gene
expression, providing evidence for shared mechanisms in a fraction of genes.
Background MTHFR [8] associate with global DNA hypo-methyla-
DNA methylation plays an important regulatory role in tion in human blood. These changes occur at a genome-
eukaryotic genomes. Alterations in methylation can wide level and are distinct from genetic variants that
affect transcription and phenotypic variation [1], but the impact DNA methylation variability in targeted genomic
source of variation in DNA methylation itself remains regions, for example, genetic polymorphisms associated
poorly understood. Substantial evidence of inter- with differential methylation in the H19/IGF2 locus [9].
individual variation in DNA methylation exists with age Recent evidence suggests a dependence of DNA
[2,3], tissue [4,5], and species [6]. In mammals, DNA methylation on local sequence content [10-12]. A strong
methylation is mediated by DNA methyltransferases genetic effect is supported by studies of methylation pat-
(DNMTs) that are responsible for de novo methylation terns in families [13] and in twins [14], but stochastic
and environmental factors are also likely to play anand maintenance of methylation patterns during replica-
tion. Genes involved in the synthesis of methylation and important role [2,14]. Recent work indicates that genetic
in DNA demethylation can also affect methylation varia- variation may have a substantial impact on local methy-
tion. For example, mutations in DNMT3L [7] and lation patterns [5,15-18], but neither the extent to which
methylation is affected by genetic variation, nor the
mechanisms are yet clear. Furthermore, the degree to
* Correspondence: jordana@well.ox.ac.uk; gilad@uchicago.edu;
which variation in DNA methylation underlies variationpritch@uchicago.edu
1Department of Human Genetics, The University of Chicago, 920 E. 58th St, in gene expression across individuals remains unknown.
Chicago, IL 60637, USA
Full list of author information is available at the end of the article
© 2011 Bell et al; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons
Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in
any medium, provided the original work is properly cited.Bell et al. Genome Biology 2011, 12:R10 Page 2 of 13
http://genomebiology.com/2011/12/1/R10
DNA methylation has long been considered a key reg- knownsequencevariation,leavinguswithadatasetof
ulator of gene expression. The genetic basis of gene 22,290 CpG-sites in the promoter regions of 13,236
expression has been investigated across tissues [19] and genes (see Methods). Following hybridization, methyla-
populations [20]. Both lines of evidence suggest genetic tion levels were estimated as the ratio of intensity signal
variants associated with gene expression variation are obtained from the methylated allele over the sum of
located predominantly near transcription start sites. methylated and unmethylated allele intensity signals.
However, not much is known about the precise mechan- Methylation levels were quantile-normalized [28] across
isms by which genetic variants modify gene-expression. two replicates. We tested for correlations with potential
confounding variables that could affect methylation levelsCombining genetic, epigenetic, and gene expression data
can inform the underlying relationship between these in LCLs [29], such as LCL cell growth rate, copy numbers
processes, but such studies are rare on a genome-wide of Epstein-Barr virus, and other measures of biological
scale. Two recent studies have examined the link variation (see Additional file 1) that were available for 60
between DNA methylation and expression in human of the individuals in our study [30]; these did not signifi-
brain samples [5,18]. Both studies identified substantial cantly explain variation in the methylation levels in our
numbers of quantitative trait loci underlying each type sample (Figure S1 in Additional file 1). However, we
of phenotype, but few examples of individual loci driving observed an influence of HapMap Phase (samples from
variation in both methylation and expression. Phase 1/2 vs 3) on the distribution of the first principal
To better understand the role of genetic variation in component loadings in the autosomal data, suggesting
controlling DNA methylation variation, and its resulting that the first methylation principal component may in
effectsongeneexpressionvariation, we studied DNA part capture technical variation potentially related to
promoter methylation across the genome in 77 human LCL culture. In the downstream association mapping
lymphoblastoid cell lines (LCLs) from the HapMap col- analyses, we applied a correction using principal compo-
lection. These cell lines represent a unique resource as nent analysis regressing the first three principal compo-
they have been densely genotyped by the HapMap Pro- nents to account for unmeasured confounders and
ject [21], and are now being genome-sequenced by the increase power to detect quantitative trait loci.
1,000 Genomes Project. In addition, these cell lines have
been studied by numerous groups studying variation in Global patterns of methylation
gene expression using microarrays [20,22] and RNA Distinct of were observed for CpG-
sequencing [23,24], as well as smaller studies of varia- sites located on the autosomes, X-chromosome, and in
tion in chromatin accessibility and PolII binding [25,26]. the vicinity of imprinted genes (Figure 1a). The majority
Finally, one of the HapMap cell lines is now being (71.4%) of autosomal CpG-sites were primarily
intensely studied by the ENCODE Project [27]. This unmethylated (observed fraction of methylation <0.3),
convergence of diverse types of genome-wide data from 15.6% were hemi-methylated (fraction of methylation
the same cell lines should ultimately enable a clearer was between 0.3 and 0.7), and 13% were methylated. As
understanding of the mechanisms by which genetic var- expected, these patterns were consistent with previously
iation impacts gene regulation. observed lower levels of methylation near promoters
relative to genome-wide levels [4,31]. We did not find
Results evidence for sex-specific autosomal methylation pat-
Characteristics of DNA promoter methylation patterns terns, consistent with a previous report [4]. In contrast,
To study inter-individual variation in profiles CpG-sites on the X-chromosome exhibited highly signif-
we measured methylation levels across the genome in 77 icant sex-specific differences (Figure S2) with hemi-
lymphoblastoid cell lines (LCLs) derived from unrelated methylated patterns in females that were consistent with
individuals from the HapMap Yoruba (YRI) collection. X-chromosome inactivation. A similar hemi-methylation
For these samples we also had publicly available geno- peak was observed for CpG-sites located near the tran-
types [21], as well as estimates of gene expression levels scription start sites (TSSs) of known autosomal
from RNA-sequencing in 69 of the 77 samples [24]. imprinted genes in the entire sample.
Methylation profiling was performed in duplicate using We observed a previously reported [4] drop in methyla-
the Illumina HumanMethylation27 DNA Analysis Bead- tion levels for CpG-sites located within 1 kb of TSSs
Chip assay, which is based on genotyping of bisulfite- (Figure 1b). Promoter methylation levels have been
converted genomic DNA at individual CpG-sites to reported to vary with respect to CpG isla

Voir icon more
Alternate Text