Frank Harrell
@f2harrell.bsky.social
7.4K followers 130 following 900 posts
Professor of Biostatistics Vanderbilt University School of Medicine Expert Biostatistics Advisor FDA Center for Drug Evaluation and Research https://hbiostat.org https://fharrell.com
Posts Media Videos Starter Packs
f2harrell.bsky.social
Yes the main idea is to write about things others are interested in or will benefit from, which motivates them to take part after the first draft is posted online.
f2harrell.bsky.social
If you have non-stratified-on covariates the 0.5 threshold is for the covariate setting you request to be predicted.
f2harrell.bsky.social
Getting rid of academic journals in favor of capturing ongoing comments and suggestions from experts would make the world a better place.
matti.vuorre.com
Against Publishing: universonline.nl/nieuws/2025/...

Preprints are read, shared, and cited, yet still dismissed as incomplete until blessed by a publisher. I argue that the true measure of scholarship lies in open exchange, not in the industry’s gatekeeping of what counts as published.
f2harrell.bsky.social
To estimate the mean the highest time value must be uncensored. To estimate the median for a covariate value X=x the survival curve for X=x must drop below 0.5. This is for semiparametric/nonparametric estimators like Cox/K-M.
f2harrell.bsky.social
Yes for (only) the very special case of a linear model, if pre is linearly related to post and pre is put in both the left and right side of the model, this will rescue the error made in analyzing change. Why rely on a special case? Use a principled method that works for all models.
f2harrell.bsky.social
You're welcome. A major manifestation of the problem is the inability to interpret results when the rigid assumptions are not met, and having the same amount of change mean completely different things to different persons.
f2harrell.bsky.social
Briefly the two measures must already be perfectly transformed, there be no floor or ceiling effects, and difference must be unrelated to baseline. Details at hbiostat.org/bbr/change #Statistics #StatsSky
14  Transformations, Measuring Change, and Regression to the Mean – Biostatistics for Biomedical Research
hbiostat.org
Reposted by Frank Harrell
statsepi.bsky.social
I read and write, I explore and I question, I design and script and analyse, I interpret and communicate. I do this to train my mind in the hopes of one day generating new knowledge. New knowledge that might even be useful, and that no algorithm can yet be trained on.
hormiga.bsky.social
Y'all. I just got ChatGPT to do everything in R for this manuscript. I mean EVERYTHING. And it's all legit and reproducible. I'm shook.

How are we mentoring our trainees in statistics now? Who needs to learn coding in R line by line, and who doesn't?

scienceforeveryone.science/statistics-i...
Statistics in the era of AI
How do we mentor, teach, and do stats when AI can do so much of the work?
scienceforeveryone.science
Reposted by Frank Harrell
f2harrell.bsky.social
At @vanderbilt.edu I used to teach a biostat course for interdisciplinary biological sciences PhD students but then (1) a senior leader told all the students to drop the course and take an online one and (2) no one took the online course anyway.
f2harrell.bsky.social
Glad there are exceptions. Want to describe what you program teaches for confounder specification?
f2harrell.bsky.social
Agreed but also see major deficits in statistical training of epidemiologists. Just one example: their courses say that stepwise regression is OK.
f2harrell.bsky.social
And one of the biggest problems in epi is the large number of researchers who won't use causal methods (and won't even do sensitivity analysis to unmeasured confounders!) but who use causal conclusions in their papers.
f2harrell.bsky.social
Yes! Coding is thinking. My new rule of thumb: Use a LLM only to write code to do things that either are low priority or that I don’t have time to do and wouldn’t have done without LLM (while assuming the generated code is only 80% right).
f2harrell.bsky.social
Unfortunately true. Many researchers see statisticians as speedbumps. To me, worst offenders have been animal researchers and epidemiologists. An animal researcher once told me “There is a reason I don’t come to the free daily biostat clinics: I know you will tell me that 6 dogs isn’t enough.”
f2harrell.bsky.social
That's largely true. The biggest problem is the amount of medical practice that has not been researched.
f2harrell.bsky.social
Anyone who gets to work with Aki is very lucky …
f2harrell.bsky.social
The extremely problematic use of change scores is so poorly understood by researchers that it’s almost sickening. Most don’t even understand what is needed for the subtraction operator to work. hbiostat.org/bbr/change
f2harrell.bsky.social
An excellent argument. One related way I try to get course managers’ attention is to quote the cost of SPSS licenses to students once they leave the comfort of the university site license. The cost is obsene, for a product that does far less than R, & LLM can even help students learn to program R
Reposted by Frank Harrell
Reposted by Frank Harrell
andrew.heiss.phd
If you've ever wanted to learn how to make beautiful websites with #QuartoPub and #rstats , check out this workshop I'm giving in a couple weeks! It'll be a blast (and we're covering Quarto's brand new _brand dot yaml system!)
stathorizons.bsky.social
Learn to create and publish a professional, data-focused website in “Create an Online Presence with Quarto Websites” on October 16-17, with @andrew.heiss.phd‬! Discover how to use #Quarto to build a variety of websites like personal portfolios, research compendiums, and interactive dashboards.
Quarto Websites | Online Seminar | Code Horizons
This online course taught by Andrew Heiss, Ph.D., teaches you how to use Quarto to build a variety of data-focused websites.
codehorizons.com
Reposted by Frank Harrell
pwgtennant.bsky.social
"Uncooperative statistician": the term used (typically by a senior clinician) to describe a well-trained and knowledgeable statistician who refuses to conduct flawed or fraudulent research.
f2harrell.bsky.social
In addition an equally important requirement is that the study is actually designed, i.e., that the investigators cared enough about the question to get funding for prospective data collection, QC, and unbiased outcome evaluation.