Eric Leung
erictleung.bsky.social
Eric Leung
@erictleung.bsky.social
860 followers 470 following 70 posts
marketing data scientist, generalist, math and library enthusiast, data scientist of the third kind, loves good stationary and pens, low tech enthusiast, open source tinkerer, opinions = mine #rstats
Posts Media Videos Starter Packs
Pinned
hello world!

lil about me, I'm a former computational biologist, working as a data scientist in media and entertainment marketing, and I like getting better at my tools and share those learnings.

I also love good stationary and fountain pens.
politics aside, it's pretty amazing to find high quality data visualizations for free on the Wikipedia page for the recent NYC mayoral election. and using a clever ternary plot to represent the breakdown of votes for three candidates in one color

en.wikipedia.org/wiki/2025_Ne...
yes yes! thanks for sharing your Libby fact. also, jetpens is indeed the best. I thought about a side project of scraping all their fountain pens and visualizing all the dimensions of pens (sizes, color, etc). never did, but I still think of it from time to time 😅
maybe I'll add it to the Christmas wish list 😂
a fellow stationary and pens nerd! I got a retractable fountain pen for my birthday and it is just the best. I've seen this pen advertised to me, but I keep on telling myself I have enough pens 🥲 so tempting!
i guess they could be useful to be consistent with specifying different quotes, instead of remembering which quotes need to be escaped etc.

either way, it's a "the more you know" kind of thing built into R
recently learned the sQuote() and dQuote() R functions

> cat("distinguish plain", sQuote("single"), "and", dQuote("double"), "quotes")
distinguish plain ‘single’ and “double” quotes
> cat("distinguish between 'single' and \"double\" quotes")
distinguish between 'single' and "double" quotes

#RStats
for fans of mcelreath's statistical rethinking book, it now has a wikipedia page en.wikipedia.org/wiki/Statist...
Statistical Rethinking - Wikipedia
en.wikipedia.org
thought I understood f-strings enough, untill I took this lil quiz. humbled me real quick fstrings.wtf
fstrings.wtf
Reposted by Eric Leung
If you ask me for advice, the answer I give you will almost always be to reach out and talk to people. Not to read blog posts or watch youtube videos, though those are wonderful resources, but to meet with & talk to actual humans. It's not easy, and it takes time and effort, but it's the way #databs
been dragging my feet on organizing my digital folders after finally finding this framework for organizing your digital.

in theory, i really like it. but with any digital cleaning, easier said than done.

anyone else have other systems they use to organize your life?

johnnydecimal.com
A system to organise your life
Johnny.Decimal is a system to organise your life. Find things, quickly, with more confidence, and less stress. It's free to use and the concepts are the same at home or work.
johnnydecimal.com
i've been writing sql code for a while now, but yesterday i learned the acronym CTAS, which stands for "create table as select"
been trying to get into using vscode a bit more, and had some of the keyboard shortcuts interfere with each other.

took me a bit to figure it out, but in case anyone else is having issues with it, go to File > Preferences > Keyboard Shortcuts menu

code.visualstudio.com/docs/configu...
Keyboard shortcuts for Visual Studio Code
Here you will find the complete list of keyboard shortcuts for Visual Studio Code and how to change them.
code.visualstudio.com
Reposted by Eric Leung
The #dataBS (Data Behind the Scenes) Conference Call for speakers is out! We're gonna do this!!

All online, single track, free to attend. Come talk about your messy experiences doing data stuff. At work, personal projects, whatever. A space to commiserate about nerdy things!

bit.ly/dataBSconf-cfs
Data Behind the Scenes Conf - Call for Speakers
What This Conference Is About "Data, Behind the Scenes" is a (free) online-only, single track conference centered on the real stories of data work from the folks in the trenches. We’re not here for th...
bit.ly
Reposted by Eric Leung
So freaking excited to have Cat Hicks on the Hangout tomorrow. She's keynoting at posit conf, too ✨ Go register and I'll see you there!! #databs

Event:
Posit Data Science Hangout with Cat Hicks
Thursday June 26th 2025 at 12PM Eastern/9am Pacific
Register for the Hangout event series at pos.it/dsh
so true. I've been doing the over documentation for a bit, but still working on over communicating
better to over communicate than assume when working across teams
i spent a whole day trying to get 5 lines of code to run properly. never have i had such a sigh of relief that it finally worked
following up on this, my problems is using recipes and workflows, and how parity between Spark and R is still being worked on and/or features even possible stackoverflow.com/a/68324650/2... but good to know that modeling is fine with parsnip tho!
Tidymodels + Spark
I'm trying to develop a simple logistic regression model using Tidymodels with the Spark engine. My code works fine when I specify set_engine = "glm", but fails when I attempt to set the
stackoverflow.com