Shai Carmi
@shaicarmi.bsky.social
830 followers 210 following 320 posts
Associate professor at the Hebrew University of Jerusalem. Statistical, population, and medical genetics; preimplantation genetic testing. Views my own. http://scarmilab.org
Posts Media Videos Starter Packs
Pinned
shaicarmi.bsky.social
Given several new followers here, I'm posting the link to our Slack group "genetic genealogy science".

We post and discuss manuscripts on phasing/imputation, IBD, recombination, demographic inference, ancient DNA, ancestry/admixture, etc.
All are welcome.

join.slack.com/t/geneticgen...
Slack
join.slack.com
Reposted by Shai Carmi
biorxiv-genetic.bsky.social
FLARE2: local ancestry inference with poorly-matched reference panels https://www.biorxiv.org/content/10.1101/2025.10.13.681993v1
shaicarmi.bsky.social
Hebrew University graduate wins Economics Nobel prize.
Not bad for one day :)

www.nobelprize.org/prizes/econo...
www.nobelprize.org/events/nobel...
shaicarmi.bsky.social
I know Trump has inflicted great pain on his own country and on science everywhere. But we will be forever thankful. No other world leader cared.

In parallel, no rest until Netanyahu is behind bars for his crimes against humanity and for destroying the fabric of the Israeli society.
shaicarmi.bsky.social
Celebrating the release of the hostages in my neighborhood and all over Israel.

After two years, we can breathe again.

Let's hope this day follows with peace and prosperity to the whole region.
shaicarmi.bsky.social
Big day ahead, can't sleep :)
* Hostages return from Gaza after two years
* War ends
* Trump in Jerusalem
shaicarmi.bsky.social
What will be achieved first: in-vitro gametogenesis, or peace in the Middle East?
Reposted by Shai Carmi
sebatlab.bsky.social
Please repost: we are recruiting a postdoc for a project in my lab on genes and environment in autism
sebatlab.bsky.social
We are proud to be one of the 13 projects funded by the NIH Autism Data Science Initiative. Our project: "Elucidating the Interplay of Genes and Environment in Autism Using Genomic and Exposure Data from Large Populations" 🧵https://www.nytimes.com/2025/09/26/health/autism-research-trump-kennedy.html
Despite False Claims, Trump Funnels Millions Into Credible Autism Research
www.nytimes.com
shaicarmi.bsky.social
It's not my expertise, but my understanding is that this is also very difficult.
shaicarmi.bsky.social
where each chr has probability ~1/2 to end up in the egg. The probability of having exactly one copy of each of chrs 1,2,...,22,X is less than one in a million. Thus, the embryo will almost inevitably be aneuploid (and will either not be born or will be affected with disorders).
2/2
shaicarmi.bsky.social
There is a lot of hype around this new paper. It's a great advance, but it will never result in the birth of a healthy baby. The reason is that in this method, chrs that end up in the egg are a random sample of the 46 somatic chrs,
1/2

www.bbc.com/news/article...
Human skin DNA fertilised to make embryo for first time
US scientists testing the technique say it could help people overcome infertility and potentially allow same-sex couples to have a genetically related child.
www.bbc.com
shaicarmi.bsky.social
Thanks. With mtDNA the coverage is very high anyway, isn't it?
And just to understand, did you go back to the original GVCF for each individual and each variant with ./. ?
shaicarmi.bsky.social
Hi hive mind - would appreciate feedback on the following.

I have ~3k whole-genomes, each called individually with GATK. I have 3k GVCFs.

I don't have enough resources for joint calling.

Should I use bcftools to merge into a multi-sample VCF? Or joint call in batches (say, 100) and then merge?
Reposted by Shai Carmi
rjhfmstr.bsky.social
🚨 New preprint out!
We reconstructed parental haplotypes in >440k individuals (UK & Estonian biobanks) to estimate assortative mating directly in the parental generation.
This reveals intensified assortment in recent generations.
www.biorxiv.org/content/10.1...
Reposted by Shai Carmi
biorxiv-evobio.bsky.social
Comparing Neanderthal introgression maps reveals core agreement but substantial heterogeneity https://www.biorxiv.org/content/10.1101/2025.09.23.678138v1
Reposted by Shai Carmi
To analyze millions of genetic variants, we need tools that are fast and memory efficient. Some of the most efficient tools have problems with numerical instability on dense sets of snps. Bercovich et al. figured out how to overcome a common source of instability: www.biorxiv.org/content/10.1...
LD Matrix Approximations for Scalable Analysis of High-dimensional Genetic Data
Linkage disequilibrium (LD) matrices are an essential part of many statistical genetics methods. However, their high dimensionality makes their computation and storage impractical for large genomic da...
www.biorxiv.org
Reposted by Shai Carmi
shaicarmi.bsky.social
Some days it feels as if Netanyahu and Trump are competing who can destroy their country faster or while inflicting the most pain
Reposted by Shai Carmi
robel-alemu.bsky.social
Excited to share our preprint on PGI portability across ancestries—now on bioRxiv. With co-authors @aysuo.bsky.social, @paturley.bsky.social, @alextisyoung.bsky.social, and @Dan_J_Benjamin. Preprint: doi.org/10.1101/2025... Thread below for details.
shaicarmi.bsky.social
Oh I see now. But wasn't it a supervised ADMIXTURE run? In such a case the ancestral groups are defined by the training data regardless of sample size.
shaicarmi.bsky.social
Not sure I agree - in the plot you can see that EAS ancestry is inferred for even fewer UAE people.

Anyhow I wouldn't overinterpret given that the best solution is to simply use a more suitable reference panel.
shaicarmi.bsky.social
Yeah I was also wondering about that. I guess it's tagging some Asian ancestry. But why not use HGDP, with plenty of local populations.
shaicarmi.bsky.social
Interesting work. "we analyzed viral load of 31 DNA viruses in human blood and saliva using whole-genome sequencing from UK Biobank (n=490k), SPARK (n=13k), and All of Us (n=415k)... genetic variation at dozens of loci associated with load of seven viruses"

www.biorxiv.org/content/10.1...
Genes and environment profoundly affect the human virome
Many viruses have adapted to persist in infected humans for life. Variable host control of their abundance (or load) can lead to clearance or disease. Here, we analyzed the viral load of 31 DNA viruse...
www.biorxiv.org
shaicarmi.bsky.social
It was perhaps a bit sloppy phrasing. The more precise point is that when comparing siblings, the environment is no longer a confounder for the relation between genotype and phenotype.