jeffspence.github.io
18/n
18/n
17/n
17/n
16/n
16/n
Burden tests aggregate signal across variants. Long genes have more variants, and so tend to get prioritized higher.
15/n
Burden tests aggregate signal across variants. Long genes have more variants, and so tend to get prioritized higher.
15/n
14/n
14/n
Homozygous LoFs in NPR2 cause severe short stature, but don’t affect intelligence, facial features, etc… seems like NPR2 is height specific!
13/n
Homozygous LoFs in NPR2 cause severe short stature, but don’t affect intelligence, facial features, etc… seems like NPR2 is height specific!
13/n
Burden tests prioritize trait-SPECIFIC GENES.
GWAS prioritize genes near trait-SPECIFIC VARIANTS.
Looking at height gives a couple of really nice examples: NPR2 and HHIP.
12/n
Burden tests prioritize trait-SPECIFIC GENES.
GWAS prioritize genes near trait-SPECIFIC VARIANTS.
Looking at height gives a couple of really nice examples: NPR2 and HHIP.
12/n
1. Coding variants in specifically-expressed genes are more highly ranked
AND
2. Non-coding variants in tissue-specific ATAC peaks are more highly ranked.
11/n
1. Coding variants in specifically-expressed genes are more highly ranked
AND
2. Non-coding variants in tissue-specific ATAC peaks are more highly ranked.
11/n
GWAS prioritize genes near trait-specific VARIANTS, whereas burden tests prioritize trait specific GENES.
Variants can be specific because they act on trait-specific genes, or because they act on pleiotropic genes in a context-specific way.
10/n
GWAS prioritize genes near trait-specific VARIANTS, whereas burden tests prioritize trait specific GENES.
Variants can be specific because they act on trait-specific genes, or because they act on pleiotropic genes in a context-specific way.
10/n
This is surprising! Burden tests DO NOT rank genes by IMPORTANCE!
These predictions play out in the UKB.
8/n
This is surprising! Burden tests DO NOT rank genes by IMPORTANCE!
These predictions play out in the UKB.
8/n
7/n
7/n
1. Most burden hits are near a GWAS hit (they converge!)
BUT
2. The ranking of hits is surprisingly discordant. E.g., the second most significant burden hit for height is ranked 243rd in GWAS!
4/n
1. Most burden hits are near a GWAS hit (they converge!)
BUT
2. The ranking of hits is surprisingly discordant. E.g., the second most significant burden hit for height is ranked 243rd in GWAS!
4/n
In line with this conceptual similarity, previous work (link.springer.com/article/10.1... , www.biorxiv.org/content/10.1...) suggested these tests “converge” on the same genes.
2/n
In line with this conceptual similarity, previous work (link.springer.com/article/10.1... , www.biorxiv.org/content/10.1...) suggested these tests “converge” on the same genes.
2/n
When trying to predict which guide would have the largest effect on the expression of a particular gene, our results were more mixed.
When trying to predict which guide would have the largest effect on the expression of a particular gene, our results were more mixed.
This lets us directly test the “causal understanding” of our deep learning models!
This lets us directly test the “causal understanding” of our deep learning models!
We were also struck by how discordant the rankings are, and arguably when you have this many significant loci, some kind of ranking is necessary. 5/6
We were also struck by how discordant the rankings are, and arguably when you have this many significant loci, some kind of ranking is necessary. 5/6
Burden tests aggregate signal across variants. Long genes have more variants, and so tend to get prioritized higher. 13/n
Burden tests aggregate signal across variants. Long genes have more variants, and so tend to get prioritized higher. 13/n
Height GWAS and burden tests give a couple of really nice examples: NPR2 and HHIP. 10/n
Height GWAS and burden tests give a couple of really nice examples: NPR2 and HHIP. 10/n
1) Coding variants in specifically-expressed genes are more highly ranked
2) Non-coding variants in tissue-specific ATAC peaks are more highly ranked
9/n
1) Coding variants in specifically-expressed genes are more highly ranked
2) Non-coding variants in tissue-specific ATAC peaks are more highly ranked
9/n
GWAS prioritize genes near trait-specific VARIANTS. This is profoundly different from prioritizing trait-specific GENES. Variants can be specific because they act on trait-specific genes, or because they act on pleiotropic genes in a context-specific way. 8/n
GWAS prioritize genes near trait-specific VARIANTS. This is profoundly different from prioritizing trait-specific GENES. Variants can be specific because they act on trait-specific genes, or because they act on pleiotropic genes in a context-specific way. 8/n