Last updated: 2018-06-21
workflowr checks: (Click a bullet for more information) ✔ R Markdown file: up-to-date
Great! Since the R Markdown file has been committed to the Git repository, you know the exact version of the code that produced these results.
✔ Repository version: f77307a
wflow_publish
or wflow_git_commit
). workflowr only checks the R Markdown file, but you know if there are other scripts or data files that it depends on. Below is the status of the Git repository when the results were generated:
Ignored files:
Ignored: output/bmaonlybetasd5lfsr.txt.gz
Ignored: output/bmaonlybetasd5posterior.means.txt.gz
Ignored: output/independentsim.rds
Ignored: output/independentsimesd05.rds
Ignored: output/indsim05sdlfsr.txt.gz
Ignored: output/indsim05sdposterior.means.txt.gz
Ignored: output/noashsharedwithzerobmaalllfsr.txt.gz
Ignored: output/noashsharedwithzerobmaallposterior.means.txt.gz
Ignored: output/sharedashcutoffomega2jun15lfsr.txt.gz
Ignored: output/sharedashcutoffomega2jun15posterior.means.txt.gz
Ignored: output/simdata.rds
Ignored: output/univariate.ash.lfsr.txt.gz
Ignored: output/univariate.ash.pm.txt.gz
Ignored: output/univariate.ash.pmindesd.txt.gz
Ignored: output/univariate.ashind.lfsresd.txt.gz
Unstaged changes:
Modified: analysis/_site.yml
Note that any generated files, e.g. HTML, png, CSS, etc., are not included in this status report because it is ok for generated content to have uncommitted changes.
To apply multivariate adaptive shrinkage (mash) to data from the GTEx study, we created an R data set (serialized R object) containing matrices of SNP-gene association statistics. These association statistics include effect estimates, Z scores and corresponding standard errors.
See here for the scripts used to generate these statistics from the SNP-gene data that were provided by the GTEx Project.
These are the recommended steps for retrieving the GTEx SNP-gene association statistics:
Download or clone the git repository.
The association statistics are found in file MatrixEQTLSumStats.Portable.Z.rds
.
analysis
directory of the gtexresults
repository, e.g.,R setwd("gtexresults/analysis")
R dat <- readRDS("../data/MatrixEQTLSumStats.Portable.Z.rds")
R names(dat)
This file contains SNP-gene association statistics for 16,069 genes and 44 human tissues. These 16,069 genes were selected because they all showed some indication of being expressed in all 44 tissues. Therefore, the association statistics are stored as matrices each with 16,069 rows and 44 columns.
As input to mash, we use a matrix of expression quantitative trait loci (eQTL) effect estimate, and corresponding standard errors. (We also provide Z scores.) See the manuscript for details on how these association statistics were obtained.
These association statistics were subdivided into three subsets:
This reproducible R Markdown analysis was created with workflowr 1.0.1.9000