pgdataviz.bsky.social
@pgdataviz.bsky.social
Statistical Programmer (Epidemiologist by training) using #Rstats and #Python mostly.
#DataScience #Dataviz

https://pgdatavizandstats.netlify.app/
One has to ask, did they even bother with a power review?

Still the paper got traction on the BBC and other news websites and, in the end, that's all that matters
You remember that Nature Aging paper about how multilingualism protects against accelerated aging? Well…
December 17, 2025 at 7:52 PM
Family/Christmas/Birthday/Work means this may be the last attempt at #TidyTuesday this year. First full year doing it and enjoyed. Not sure if I got better though!

Code Here: pgdatavizandstats.netlify.app/data_visuali...

#rstats | #tidyverse | #dataviz
December 9, 2025 at 6:50 PM
The adrenaline rush I am getting right now. This must be what Base jumping is like
November 28, 2025 at 12:45 PM
Working with datasets that definitely require the use of data.table. Genuinely very impressive package #rstats
November 25, 2025 at 6:32 PM
Here's a good one. Rendering my quarto html in #positron causes loads of errors implying that quarto isn't installed correctly (I've uninstalled and reinstalled multiple times) but if render the same html in #RStudio, it renders with no drama
November 24, 2025 at 4:56 PM
Tried doing a World cloud of the words in the entire Sherlock Holmes in python with a nasty head cold - can't recommend the head cold part.

Code: pgdatavizandstats.netlify.app/datavis

#PydyTuesday | #TidyTuesday | #Python
November 18, 2025 at 5:50 AM
This week's submission for #PydyTuesday. Missed ggplot slot when trying to make this plot

Code here: pgdatavizandstats.netlify.app/data_visuali...

#rstats | #python | #dataviz | #TidyTuesday
November 11, 2025 at 5:20 PM
My submission for #PydyTuesday. A quick jaunt back to Python and some Violin plots

Code here: pgdatavizandstats.netlify.app/data_visuali...

#rstats | #python | #dataviz | #TidyTuesday
November 6, 2025 at 11:01 AM
Anyone here better with the ggsankey package than I am? - I am trying to label the flow with the number of people transitioning through each stage. I have found no examples on the internet of labeling the flow (plenty labeling the nodes).

I have below what I need to try

#rstats #dataviz #ggplot2
November 5, 2025 at 10:58 AM
Looked at the percentage of Oxbridge educated people who have won various British Literary Prizes for #TidyTuesday. I used ggbrick and, as the bricks of look like books, tried to make it look like they were in bookshelves with meh results.

Code here: tinyurl.com/bddsuuc3

#rstats | #dataviz
October 30, 2025 at 9:09 AM
If anybody has any good tips/resources for a statistical programmer who has a technical interview as a data scientist, let me know. I've not done one before

(Background is epidemiology and clinical trials but this would be in health data science looking at massive datasets)

#rstats #datascience
October 23, 2025 at 8:06 PM
As a Welshman, taking the opportunity to ignore the English and focus only on Wales for #TidyTuesday. Looking at the average weather for Wales in 2024 and now I've made myself homesick.

Code here: pgdatavizandstats.netlify.app/data_visuali...

#rstats | #dataviz
October 23, 2025 at 5:56 AM
It's Monday which mean my chaotic self is submitting last weeks #TidyTuesday. I chose to look at water access as most people had looked at food in much nicer ways than me.

Code here: pgdatavizandstats.netlify.app/data_visuali...

#rstats | #dataviz
October 20, 2025 at 5:36 PM
Holiday and job-hunting sucking my soul mean I am a week behind and needed to do a(nother) rush job for #TidyTuesday. I have been playing with Reactable though

Code here: pgdatavizandstats.netlify.app/data_visuali...

#rstats | #dataviz
October 14, 2025 at 10:59 AM
I do like #positron but RStudio allows me to create .tex files from new.

Anyone know how I could get this on #positron, or is not a thing yet?
September 29, 2025 at 2:00 PM
Tried the datapasta package out, which is a form of minor miracle to get the country codes for the players.

Managed to crack a map of grandmasters per country before flying home to the UK for a break

Code here:

pgdatavizandstats.netlify.app/data_visuali...

#dataviz | #rstats | #tidytuesday
September 23, 2025 at 8:51 AM
A LLM trained on UK Biobank data to predict future disease presented uncritically by the media?
September 17, 2025 at 4:22 PM
September 16, 2025 at 10:26 AM
Missed last week's #TidyTuesday / #pydyTuesday due to work/toddler commitments but I'm ahead for this one. Such is life

Tried making a dashboard in python Whoever coded #streamlit is a genius

Dashboard here

tidytuespassportdashboard.streamlit.app

Code here

pgdatavizandstats.netlify.app/datavis
September 8, 2025 at 6:25 PM
For #TidyTuesday / #PydyTuesday I have struggled with the port of patchwork over to python. If anyone has any ideas on how to fix the titles of the graphs, I'm all ears.

Also Plotnine is a minor miracle.

Code Here:
pgdatavizandstats.netlify.app/data_visuali...

#dataviz | #python
August 27, 2025 at 6:58 PM
I logged on, opened RStudio, and accidentally pressed CTRL+SHIFT+P, and what sort of magic is this?
August 25, 2025 at 7:40 AM
(Reposted because I can't spell) Back to the warm blanket that is ggplot2 for #TidyTuesday.

A Sankey diagram of how the classifications of Scottish Munros have changed over time

#rstats | #dataviz | #ggplot2

Code here:

pgdatavizandstats.netlify.app/data_visuali...
August 19, 2025 at 3:49 PM
In a vain attempt to keep learning python I have tried imputing some CDISC data using #Quarto and so on - the report is rough around the edges but at least it worked.

#rstats | #python | #imputation

Code Here: pgdatavizandstats.netlify.app/biostatistic...
August 19, 2025 at 3:48 PM
Another #TidyTuesday / #PyDytuesday another week of thinking that making graphs in Python just isn't as nice as it is in R (Skill issue, I know)

Code here:
pgdatavizandstats.netlify.app/data_visuali...

#dataviz | #python
August 13, 2025 at 4:20 PM
August 7, 2025 at 8:39 AM