Konrad
konradjk.bsky.social
Konrad
@konradjk.bsky.social
Genomicist, computational biologist. Assistant professor @ MGH, HMS. Associate member @ Broad Institute

https://klab.is
gs://ukb-diverse-pops-public/misc/pairwise/pairwise_correlations_regressed.txt.bgz - it’s coded in the way that our pan-UKB phenotypes were so not sure if it’s super easy to use but that’s pairwise r_p for ~14k phenos
October 8, 2025 at 6:01 PM
Special thanks to all co-authors that got this here including @masakanai.bsky.social @rahulg603.bsky.social @dalygene.bsky.social @egatkinson.bsky.social and of course @genetisaur.bsky.social for driving this through 5 years of work (after the first GWASes were done!)
September 18, 2025 at 5:32 PM
Tons of lessons learned around carefully controlling population stratification, using heritability as a QC metric, and probably most importantly, quantifying novelty in a mega-phenotype analysis. Some really cool analyses to find interesting biology e.g. allelic series and ancestry-enriched variants
September 18, 2025 at 5:29 PM
Starter pack of people who create starter packs?
November 8, 2024 at 12:06 AM
You mean “ReNally???”?
November 7, 2024 at 6:23 PM
Heh, it was on our list but somehow never made it into the pre-submission checklist. Will do!
September 21, 2024 at 5:00 PM
Interesting question. We do have a “gnomAD-new” analysis in there but haven’t broken down by ancestry - i fear a lot is going to be driven by “not yet observed” (which is the same across all ancestries)
September 20, 2024 at 4:45 PM
It gets a bit more complicated though - these scores have a mix of impacts of variant-to-gene, as well as prioritizing which genes, when disrupted, lead to phenotypes. Perhaps a new method that combines both these insights optimally will outperform them all!
September 20, 2024 at 3:51 PM
We found that population-focused methods do best for identifying highly impactful variants (de novo’s in individuals with developmental disorders for instance), while the deep learning methods are better at prioritizing inherited variation in biobanks
September 20, 2024 at 3:49 PM
Paella is good. With LOEUF I assumed the culmination would be an omelette but CHARR is better in a paella, so maybe it’s a multi-course meal
December 7, 2023 at 6:10 PM
Extended data figure 2b has exclusive exon-only. I think we internally made some with intermediate overlaps and it was an intermediate result as you’d expect
December 7, 2023 at 6:09 PM
And thanks to Ryan Dhindsa and Slavé Petrovski for the excellent writeup and context around our work. Excited for the times ahead! www.nature.com/articles/d41...
An expanded genomic database for identifying disease-related variants
An expanded version of a human-genome database called gnomAD, containing 76,156 whole-genome sequences, has enabled investigation of how variants in non-protein-coding regions of the genome affect hea...
www.nature.com
December 6, 2023 at 5:14 PM
This is all thanks to an amazing production team, browser team, and steering committee @gnomad-project.bsky.social, the 76,156 individuals that provided their genomes, and support from Broad Genomics and Hail
December 6, 2023 at 5:12 PM
Interestingly, these scores also provide additional insight into genes regulated by these regions, even those underpowered by previous constraint metrics:
December 6, 2023 at 5:11 PM
Gnocchi extends our constraint metrics to the non-coding genome, highlighting for instance, disease-associated non-coding CNVs
December 6, 2023 at 5:11 PM
We built a new metric we called gnocchi (genomic non-coding constraint of haploinsufficient variation), building on methods that find depletions of variation (natural selection), which we show can prioritize functional variation
December 6, 2023 at 5:10 PM
Thanks to Wenhan Lu for driving this effort, Hail (hail.is) for building the scalable infrastructure that enabled this, and @gnomAD-project.bsky.social for the data and support
Hail | Index
hail.is
November 28, 2023 at 7:36 PM
CHARR operates only on homozygous alternate sites and scales very well (“cost per 1M samples” might be my new favorite metric):
November 28, 2023 at 7:35 PM