Author — The Precipice: Existential Risk and the Future of Humanity.
tobyord.com
This shift to inference-scaling has big implications for AI business, governance, and risk:
www.tobyord.com/writing/infe...
13/
This shift to inference-scaling has big implications for AI business, governance, and risk:
www.tobyord.com/writing/infe...
13/
Overall the inference scaling produced 82%, 63%, and 92% of the total performance gains on the different benchmarks.
12/
Overall the inference scaling produced 82%, 63%, and 92% of the total performance gains on the different benchmarks.
12/
The same is true for the other benchmarks I examined. Here are the raw scatterplots:
11/
The same is true for the other benchmarks I examined. Here are the raw scatterplots:
11/
• the RL boost taking the base model to the trend line
• the inference-scaling boost taking it to the top of the trend
10/
• the RL boost taking the base model to the trend line
• the inference-scaling boost taking it to the top of the trend
10/
9/
9/
8/
8/
7/
7/
6/
6/
5/
5/
For the largest AI companies, most costs come from deploying models to customers. If you need to 10x or 100x those costs, that is very expensive. And unlike training, it can't be made up in volume.
4/
For the largest AI companies, most costs come from deploying models to customers. If you need to 10x or 100x those costs, that is very expensive. And unlike training, it can't be made up in volume.
4/
This is the bull case for RL scaling — it started off small compared to internet-scale pre-training, so can be scaled 10x or 100x before doubling overall training compute.
3/
This is the bull case for RL scaling — it started off small compared to internet-scale pre-training, so can be scaled 10x or 100x before doubling overall training compute.
3/
This has two parts:
1. Scaling RL training
2. Scaling inference compute at deployment
2/
This has two parts:
1. Scaling RL training
2. Scaling inference compute at deployment
2/