Josh Weinstock
@joshweinstock.bsky.social
200 followers 280 following 17 posts
Assistant Professor in the Department of Human Genetics at Emory University. Statistical genetics and genomics + genetic epidemiology of somatic mosaicism. weinstocklab.org
Posts Media Videos Starter Packs
Pinned
joshweinstock.bsky.social
What happens if you search the genomes of ~490K adults for variants that associate with age at blood draw? We actually did this crazy idea, using the UKB WGS (from peripheral blood).

Turns out this 'discovers' classic clonal hematopoiesis drivers and more 🧵👇

www.medrxiv.org/content/10.1...

#ASHG
Genome-wide characterization of clonal hematopoiesis reveals extensive non-coding putative driver mutations
As humans age, we acquire somatic mutations in our blood, leading to clonal hematopoiesis (CH). Despite the prevalence of clonal hematopoiesis (CH) in aged individuals, recent searches for selective s...
www.medrxiv.org
joshweinstock.bsky.social
Shout to extraordinary collaborators and co-authors on this, Karen Conneely, Cameron Russell, Mitchell Machiela, Marios Arvanitis, Janghee Woo.

For more info, I'm presenting this work on Thursday at #ASHG at 1:45pm, Room 210C
joshweinstock.bsky.social
To make results easier to parse, we also release a summary statistics "portal" to view our results, including lots of QC metrics: somatic.emory.edu

We also release lots of code for this, which was all done on the DNA Nexus RAP in a cost effective way.
Explore clonal hematopoiesis associations with proteins and disease in UK Biobank.
somatic.emory.edu
joshweinstock.bsky.social
The IGH and IGL mutations in particular had really large effect sizes, so we then did a GWAS of a combined IGH + IGL phenotype, which resulted in a single hit: GRAMD1B. Remarkably, this is well characterized risk locus for CLL, suggesting that we are converging on CLL relevant biology.
joshweinstock.bsky.social
We then quantified the variance explained on a liability scale of CH to 30 common aging-related diseases. CH explains far more 'liability' scale variance for hematologic malignancies than other classes (as one would expect), though chronic kidney disease appears high here as well.
joshweinstock.bsky.social
New hits are generally indeed "weaker" than most classic CH drivers, suggesting that these are just underneath the "tip of the iceberg". We also replicate the telomere attrition mechanism reported here www.nature.com/articles/s41..., finding that the phenomenon is broader than previously observed
joshweinstock.bsky.social
The strongest hits are indeed strongly enriched for canonical CH genes, but we also find non-coding mutations at FGF1, UGT2B7, DGKB, the TERT promoter, chr17 centromere, and immunoglobulin loci:
joshweinstock.bsky.social
What happens if you search the genomes of ~490K adults for variants that associate with age at blood draw? We actually did this crazy idea, using the UKB WGS (from peripheral blood).

Turns out this 'discovers' classic clonal hematopoiesis drivers and more 🧵👇

www.medrxiv.org/content/10.1...

#ASHG
Genome-wide characterization of clonal hematopoiesis reveals extensive non-coding putative driver mutations
As humans age, we acquire somatic mutations in our blood, leading to clonal hematopoiesis (CH). Despite the prevalence of clonal hematopoiesis (CH) in aged individuals, recent searches for selective s...
www.medrxiv.org
Reposted by Josh Weinstock
bennystrobes.bsky.social
Exciting updates!!
(1) I just opened my lab at Boston Children’s Hospital (Harvard-affiliated)
(2) I’m hiring a postdoc focused on integrating GWAS and functional genomic data. Reach out if you’re interested or connect at ASHG next week!
(3) Learn more at stroberlab.com
Strober Lab
The Strober lab is a computational group at Boston Children's Hospital (a Harvard Medical School affiliated hospital) focused on developing statistical and machine learning tools applied to human gene...
stroberlab.com
Reposted by Josh Weinstock
jengreitz.bsky.social
Excited for a major milestone in our efforts to map enhancers and interpret variants in the human genome:

The E2G Portal! e2g.stanford.edu

This collates our predictions of enhancer-gene regulatory interactions across >1,600 cell types and tissues.

Uses cases 👇

1/
Reposted by Josh Weinstock
sashagusevposts.bsky.social
I wrote about gene-gene interactions (epistasis) and the implications for heritability, trait definitions, natural selection, and therapeutic interventions. Biology is clearly full of causal interactions, so why don't we see them in the data? A 🧵:
Beneath the surface of the sum
When genetic interactions matter and when they don't
open.substack.com
joshweinstock.bsky.social
You can find PRSFNN code here: github.com/weinstockj/PRS . It takes in GWAS summary statistics + LD reference panel + annotations, and we compute the posterior using variational inference to make it fast.

A pleasure to build this with @aprilkim.bsky.social and @alexisbattle.bsky.social
joshweinstock.bsky.social
We observed similar non-linear effects with AlphaMissense predictions, where low impact coding variants were prioritized, but highly pathogenic variants were not prioritized (presumably because these are so rare in real GWAS).
joshweinstock.bsky.social
More compellingly, it also learns non-linear effects wrt to chromatin accessibility - variants in cCREs present in 10-50 cell types were more highly prioritized than variants in cCRE that are present in numerous (> 50) cell types, suggesting a preference for cell/tissue specificity.
joshweinstock.bsky.social
In our annotation curation, we included lots of scATAC, cCREs from ENCODE, conservation from Zoonomia, pathogenicity from AlphaMissense, among others. Generally - PRSFNN "learns" that low-frequency SNPs in accessible chromatin are likely to have larger effect sizes (maybe not that surprising).
joshweinstock.bsky.social
We connected the SNP annotations to the parameters of the prior distribution on the weights in a novel way with a neural network, so we're calling it Polygenic Risk Scores with Functional Neural Network (PRSFNN). We were excited to see that PRSFNN does well in benchmarks (at least in our hands).
joshweinstock.bsky.social
Sharing some of our lecture slides on statistical genetics! 🧬

Co-taught with Mike Epstein, Dave Cutler, Karen Conneely, Jingjing Yang, Jian Hu.

Mendelian randomization: weinstocklab.org/lecture_slid...

Biobank scale GWAS methods: weinstocklab.org/lecture_slid...

Hope they're helpful!
Reposted by Josh Weinstock
hakha.bsky.social
We have multiple postdoc positions available in my group at NYU. Join us if you're interested in complex trait genetics and biology. More information about the lab on our website: mostafavilab.org
Home | Mostafavi Lab
mostafavilab.org
Reposted by Josh Weinstock
biorxiv-genomic.bsky.social
Specificity, length, and luck: How genes are prioritized by rare and common variant association studies https://www.biorxiv.org/content/10.1101/2024.12.12.628073v1
joshweinstock.bsky.social
I'm developing a pipeline to call CHIP mutations in UK Biobank using the DNA Nexus RAP that is fast/cheap/reproducible. Initial results are promising; calls looks reasonable and cost to do this across all of UKB is likely < 500$.

Feel free to DM if of interest.
joshweinstock.bsky.social
Great talk from Zeyun Lu on integrating cis-eQTLs with perturb-seq to increase discovery in Mendelian randomization #ASHG2023 !