Crystal Lewis
cghlewis.bsky.social
Crystal Lewis
@cghlewis.bsky.social
Research Data Management Consultant | cghlewis.com

Co-organizer @r-ladies-stl.bsky.social‬
Co-organizer POWER Data Management Hub | https://osf.io/ap3tk/

Author of DMLSER: https://datamgmtinedresearch.com/
RDM Weekly: https://rdmweekly.substack.com/
Pinned
Re-introduction for new followers!
Hello! 👋
I am currently a freelance research data management consultant. I also co-organize R-Ladies St. Louis. I mostly post about data management and #rstats data wrangling tips. I also recently wrote this book.
datamgmtinedresearch.com
Welcome | Data Management in Large-Scale Education Research
This is the in-progress version of Data Management in Large-Scale Education Research.
datamgmtinedresearch.com
Reposted by Crystal Lewis
new #dplyr filtering function who dis 😍 meet filter_out()!

👍 GO PUT A THUMBS UP ON DAVIS'S TIDYUP THINGY ON GITHUB TO SHARE MY ENTHUSIASMMMM!! #rstats #databs

And, you know, provide your thoughts on this newly-proposed function if you have them 😌
November 7, 2025 at 11:16 PM
Reposted by Crystal Lewis
💬 What are your top tips for setting up successful (research) projects, #AcademicSky #AcademicChatter?
📖 Any recommendations for books/ courses/ websites/ etc. on successful project management in academia?
a cartoon of a dog sitting in front of a laptop with a speech bubble saying it 's fine
ALT: a cartoon of a dog sitting in front of a laptop with a speech bubble saying it 's fine
media.tenor.com
November 7, 2025 at 12:30 PM
I was writing some code using #rstats dplyr to check some hand calculated values and then I was like, what am I doing? I'm not lame. I'm using genzplyr! 💅
November 7, 2025 at 8:15 PM
Stop being a lame-o and replace dplyr with the more slappin genzplyr.
November 7, 2025 at 2:06 AM
Many research teams still collect paper forms and when it comes to entering data from those forms, there are a series of decisions to be made (and documented) to ensure that data is entered accurately and in a secure and standardized way.

More information: datamgmtinedresearch.com/capture#capt...
November 6, 2025 at 1:54 PM
It's that time again.

"We noticed your personal access token (classic) "August 2025" with gist, repo, user, and workflow scopes will expire in 6 days."
a cartoon of a man wearing glasses and a watch
ALT: a cartoon of a man wearing glasses and a watch
media.tenor.com
November 5, 2025 at 9:44 PM
Reposted by Crystal Lewis
🪧
November 5, 2025 at 6:31 PM
About 100 new subscribers per month. For some that might not seem like much, but for this little free passion-project newsletter of mine, I think that's pretty awesome. 🌟
rdmweekly.substack.com
November 4, 2025 at 11:53 PM
Reposted by Crystal Lewis
TOMORROW, I'm hosting what I'm calling Data Science Lab. This debut session will be our fave Positron settings for #rstats & #python 😍

I called it DS Lounge at first, but that doesn't feel right. You'll see "Lounge" when you register at pos.it/dslab, but future sessions will be "Lab" 🧪😎 #databs
November 4, 2025 at 7:06 PM
Reposted by Crystal Lewis
The John D. and Catherine T. MacArthur Foundation has generously awarded us funding to secure our own storage. This critical processing space will be instrumental in ensuring that large datasets can be temporarily stored, curated, and described.

Thank you, MacArthur Foundation, for your support!
Data Rescue Projects receives support from the John D. and Catherine T. MacArthur Foundation to support data rescue efforts
FOR IMMEDIATE RELEASE Since launching in February 2025, the Data Rescue Project has grown substantially. At this point, the DRP has enabled the rescue of more than 1,000 datasets from US Federal…
www.datarescueproject.org
November 4, 2025 at 5:20 PM
Reposted by Crystal Lewis
Issue 20 of RDM Weekly is out! 📬

- Automate File Management in R With the {fs} Package @jadeynryan.bsky.social
- How to Start Your Own Code Club @sortee.bsky.social
- Guide to Social Science Data Preparation and Archiving @icpsr.bsky.social
and more!
rdmweekly.substack.com/p/rdm-weekly...
RDM Weekly - Issue 020
A weekly roundup of Research Data Management resources.
rdmweekly.substack.com
November 4, 2025 at 2:10 PM
Issue 20 of RDM Weekly is out! 📬

- Automate File Management in R With the {fs} Package @jadeynryan.bsky.social
- How to Start Your Own Code Club @sortee.bsky.social
- Guide to Social Science Data Preparation and Archiving @icpsr.bsky.social
and more!
rdmweekly.substack.com/p/rdm-weekly...
RDM Weekly - Issue 020
A weekly roundup of Research Data Management resources.
rdmweekly.substack.com
November 4, 2025 at 1:53 PM
Reposted by Crystal Lewis
PSA to just FINISH writing the damn things before tech infrastructure and corporate landscape changes on you*.

* this is about more than one project, unfortunately.
November 3, 2025 at 9:11 PM
Reposted by Crystal Lewis
Now I'm also looking for a research software engineer to implement a pile of research results to R packages loo, posterior, bayesplot, projpred, priorsense, brms or/and Python packages ArviZ, Bambi and Kulprit. Apply by email with no specific deadline (see contact info at users.aalto.fi/~ave/)
I'm now also looking for a postdoc with strong Bayesian background and interest in developing Bayesian cross-validation theory, methods and software. Apply by email with no specific deadline (see contact information at users.aalto.fi/~ave/).

Others, please share
I'm looking for a doctoral student with Bayesian background to work on Bayesian workflow and cross-validation (see my publication list users.aalto.fi/~ave/publica... for my recent work) at Aalto University.

Apply through the ELLIS PhD program (dl October 31) ellis.eu/news/ellis-p...
November 3, 2025 at 11:13 AM
Reposted by Crystal Lewis
The useR! 2025 conference virtual talks are available on YouTube 👇🏼👇🏼👇🏼

www.youtube.com/playlist?lis...

#rstats
useR! 2025 Virtual Talks - YouTube
A playlist containing all the virtual talks from useR! 2025. Please contact us if you see an issue with a video.
www.youtube.com
November 3, 2025 at 1:52 AM
Reposted by Crystal Lewis
Issue 19 of RDM Weekly is out! 🎃

- Handbook for Reproduction and Replication Studies @forrt.bsky.social
- A Crowdsourced Effort to Develop a Lab Manual Template @improvingpsych.org
- So, What’s the Deal with rlang, Anyway? @veerle.hypebright.nl
and more!

rdmweekly.substack.com/p/rdm-weekly...
October 28, 2025 at 1:26 PM
Reposted by Crystal Lewis
Best day of the year: getting author copies of my new book, “The Data Management Workbook.” It comes out on December 2 from @pelagic.bsky.social and is already available for pre-order.
October 31, 2025 at 4:38 PM
Reposted by Crystal Lewis
I remembered seeing this and it came in so clutch cleaning up data that had a number of different data versions in it #rstats
dplyr::rows_update() and dplyr::rows_patch() continue to be super helpful functions when you want to fill missing data with data collected from another source. #rstats

More examples: cghlewis.github.io/data-wrangli...
October 30, 2025 at 9:27 PM
Reposted by Crystal Lewis
I'm deeply grateful that @lyndamk.bsky.social wrote a lovely blurb for my book, "The Data Management Workbook," which comes out in just over a month.

I really admire Lynda's work, her books, and her recent efforts with the @datarescueproject.org.
October 30, 2025 at 6:38 PM
Reposted by Crystal Lewis
The recording is up if you weren't able to catch this session live and want a walk-through of how to use {fs} to clean up your messy folders!

Thanks again @rladies-bot.bsky.social for the invitation to get back into the #RStats community after my maternity hiatus! 💜
Thank you so much to @jadeynryan.bsky.social for the fantastic workshop last night. The recording for Efficient File Management in R with {fs} is now available!

youtu.be/X4i-yOBtn1s
R-Ladies STL: Efficient File Management in R with {fs} with Jadey Ryan
YouTube video by R Ladies STL
youtu.be
October 30, 2025 at 6:20 PM
Reposted by Crystal Lewis
I'm super excited to give a workshop during next week's R/Pharma conference! Join me if you'd like to discuss the ✨beautiful✨ outputs you can create with #Quarto and #RStats.

📅 Tuesday, November 4 at 9 am CT

🔗 Free to register, too! events.zoom.us/ev/Ai-geyS63...
October 30, 2025 at 3:43 PM
Reposted by Crystal Lewis
{fs} is one of those extremely dope R packages that I always forget about. the OS specific file path management functions are 👌
Thank you so much to @jadeynryan.bsky.social for the fantastic workshop last night. The recording for Efficient File Management in R with {fs} is now available!

youtu.be/X4i-yOBtn1s
R-Ladies STL: Efficient File Management in R with {fs} with Jadey Ryan
YouTube video by R Ladies STL
youtu.be
October 30, 2025 at 3:35 PM
Reposted by Crystal Lewis
Thank you so much to @jadeynryan.bsky.social for the fantastic workshop last night. The recording for Efficient File Management in R with {fs} is now available!

youtu.be/X4i-yOBtn1s
R-Ladies STL: Efficient File Management in R with {fs} with Jadey Ryan
YouTube video by R Ladies STL
youtu.be
October 30, 2025 at 3:26 PM
Right now! @jadeynryan.bsky.social is doing such an awesome job walking us through using the {fs} package to work with files, with bonus Halloween content! @r-ladies-stl.bsky.social
October 30, 2025 at 12:03 AM
Reposted by Crystal Lewis
When people learn with ChatGPT instead of following their own searches, they end up knowing less, caring less, and producing worse advice, even when the facts are the same.

Friction is an essential ingredient for learning! Convenience makes us shallow.

academic.oup.com/pnasnexus/ar...
Experimental evidence of the effects of large language models versus web search on depth of learning
Abstract. The effects of using large language models (LLMs) versus traditional web search on depth of learning are explored. A theory is proposed that when
academic.oup.com
October 28, 2025 at 3:14 PM