Kevin Markham
dataschool.io
Kevin Markham
@dataschool.io
Teaching Data Science 🤖 with Python 🐍 since 2014

Courses: https://courses.dataschool.io
Newsletter: https://tuesday.tips
Videos: https://www.youtube.com/@dataschool

Living in beautiful Asheville, North Carolina 🌲
VIDEO: How to use top AI models on a budget

Want to chat with the best AI models from OpenAI, Claude, and Google without paying $20/month?

I'll show you how to use API keys with TypingMind to access top models for a fraction of the cost!

Find out how:
www.youtube.com/watch?v=wvvT...
How to use top AI models on a budget
YouTube video by Data School
www.youtube.com
November 5, 2025 at 1:20 PM
Reposted by Kevin Markham
Vibe coding is irresponsibly building software through dice rolls, not caring what code is produced

What about when engineers at the top of their game use AI tools responsibly to accelerate their work?

I propose "vibe engineering"!

simonwillison.net/2025/Oct/7/v...
Vibe engineering
I feel like vibe coding is pretty well established now as covering the fast, loose and irresponsible way of building software with AI—entirely prompt-driven, and with no attention paid to …
simonwillison.net
October 7, 2025 at 2:35 PM
Reposted by Kevin Markham
i wrote about atproto and why it matters
Open Social — overreacted
The protocol is the API.
overreacted.io
September 26, 2025 at 3:33 PM
What's the difference between "AI" and "Machine Learning"?

My one-sentence summary:

> AI is a field that creates intelligent systems, and Machine Learning is the dominant approach for achieving that intelligence by learning patterns from data.

Full explanation: www.dataschool.io/ai-vs-machin...
What is "AI" versus "Machine Learning"? 🤔
AI is a field that creates intelligent systems, and Machine Learning is the dominant approach for achieving that intelligence by learning patterns from data.
www.dataschool.io
September 22, 2025 at 4:29 PM
Download the first 3 chapters of my NEW book:

👉 dataschool.kit.com/mlbook 👈

It's a practical guide to help you transform from Machine Learning novice to skilled Machine Learning practitioner.

Coming out in paperback 📖 and ebook 👨‍💻 soon!
I thoroughly enjoyed the first three chapters, Kevin. Easy to read and refreshingly practical.

I fully intend to buy the book.

Thank you
September 19, 2025 at 5:38 PM
OpenAI's analysis of "how people use ChatGPT"

Source: cdn.openai.com/pdf/a253471f...
September 16, 2025 at 4:35 PM
What's your one-sentence explanation of the difference between AI and Machine Learning?
September 15, 2025 at 7:46 PM
Dream unlocked: I'm publishing my first book! 🎉🎉🎉

It's called "Master Machine Learning with scikit-learn: A Practical Guide to Building Better Models with Python"

Download the first 3 chapters right now:
👉 dataschool.kit.com/mlbook 👈

Thanks for your support 🙏
September 11, 2025 at 5:53 PM
Reposted by Kevin Markham
A website that lets you select a country or state and move it around a Mercator projection map to yield better size comparisons.

thetruesize.com, created by James Talmage and Damon Maneice
August 30, 2025 at 11:34 PM
@simonwillison.net Pelican riding a bike! From the Trader Joe's August 2025 Fearless Flyer: www.traderjoes.com/home/ff

(I can confirm that this illustration is on the front page of the printed version as well.)
August 21, 2025 at 8:49 AM
@briancasel.com You like Claude Code and you like dashboards, so how about a Claude Code Analytics Dashboard?

github.com/chiphuyen/sn...
GitHub - chiphuyen/sniffly: Claude Code dashboard with usage stats, error analysis, and sharable feature
Claude Code dashboard with usage stats, error analysis, and sharable feature - chiphuyen/sniffly
github.com
July 23, 2025 at 8:24 PM
Mega Terminal Cheat Sheet by @b0rk.jvns.ca

Printable version here: wizardzines.com/terminal-che...
June 10, 2025 at 6:52 PM
My thoughts on the current state of AI progress and the most important developments in 2025:

www.dataschool.io/ai-progress-...
AI progress in 2025 📈
Thoughts on the current state of AI progress and the most important developments in 2025
www.dataschool.io
May 28, 2025 at 2:17 PM
Claude 4 API is now available!
May 22, 2025 at 5:50 PM
🙋‍♀️ Curious about Retrieval Augmented Generation (RAG)?

👨‍🏫 Allow me to explain! Here's the video outline:

0:00 RAG definition
0:14 Problem that RAG solves
0:49 RAG part 1: Indexing
1:43 RAG part 2: Retrieval and generation
2:38 How RAG prevents hallucinations
May 21, 2025 at 3:24 PM
🚀 Just launched my new course: Build #AI agents with #Python

🏷 Save $50 through Friday: courses.dataschool.io/build-ai-age...

🙏 Please share with a friend... thank you!
Course: Build AI agents with Python
Develop the skills to create AI apps that can think and act independently 🤖
courses.dataschool.io
May 20, 2025 at 2:09 PM
Reposted by Kevin Markham
Published some notes on Microsoft's phi4-reasoning model, an 11GB download (via Ollama) which may well overthink things... it produced 56 sentences of reasoning output in response to my prompt of "hi" simonwillison.net/2025/May/6/p...
May 6, 2025 at 6:47 PM
Claude finally integrated web search into its results...

But with LangChain & LangGraph, you can build a chatbot that integrates web search into ANY model you like!

You'll learn how to do that (and much more) in my new AI course...

Sign up for EARLY ACCESS:
👉 dataschool.kit.com/agents 👈
March 27, 2025 at 11:59 AM
Reposted by Kevin Markham
🗒️ Do you need to prepare a ML model, and you are working with text and strings?
Skrub provides four encoders to convert strings into numerical features. 🤗 models included!

What's the best? Check out our blog post to find out 👀

skrub-data.org/skrub-materi...
What’s the best way to encode categorical features? A use case with Skrub encoders – Skrub learning materials
skrub-data.org
March 26, 2025 at 8:50 AM
Just finished recording my new AI course 😅

Sign up for early access: dataschool.kit.com/agents
March 24, 2025 at 3:43 PM
Reposted by Kevin Markham
Many good advices/best practices for missing value imputation in the paper!

I now have a much deeper appreciation for Data School's course and regard it as the best scikit-learn course.

Master Machine Learning with scikit-learn: courses.dataschool.io/master-machi...
March 18, 2025 at 3:55 PM
In honor of March Madness 🏀, I've got a new blog post:

www.dataschool.io/pandas-strea...

Learn how to identify & analyze scoring streaks using pandas operations:

- shift()
- cumsum()
- boolean math
- groupby()
How to calculate "scoring streaks" with pandas 🏀
Learn how to identify & analyze consecutive events in your data using advanced DataFrame methods!
www.dataschool.io
March 17, 2025 at 1:53 PM
"To Type or Not to Type?" is an excellent introduction (and reference guide) to type hints in Python:

blog.jonathanchun.com/2025/02/16/t...
March 13, 2025 at 6:59 PM
"Some people today are discouraging others from learning programming on the grounds AI will automate it. This advice will be seen as some of the worst career advice ever given."

-- Andrew Ng, legendary AI researcher

Source: www.deeplearning.ai/the-batch/is...
DeepSeek-R1 Uncensored, QwQ-32B Puts Reasoning in Smaller Model, and more...
The Batch AI News and Insights: Some people today are discouraging others from learning programming on the grounds AI will automate it.
www.deeplearning.ai
March 13, 2025 at 6:05 PM