Follow our Llama 3.3 provider page to stay up to date as more hosting providers launch endpoints: artificialanalysis.ai/models/llama...
Follow our Llama 3.3 provider page to stay up to date as more hosting providers launch endpoints: artificialanalysis.ai/models/llama...
It now leads Llama 3.1 405B in MATH and almost matches 405B in each of MMLU, GPQA Diamond and HumanEval.
It now leads Llama 3.1 405B in MATH and almost matches 405B in each of MMLU, GPQA Diamond and HumanEval.
➤ Smaller increase in MMLU (84% to 86%)
➤ Llama 3.3 70B now leads Llama 3.1 405B in Math-500, and scores nearly equal to 405B in MMLU, GPQA Diamond and HumanEval
➤ Smaller increase in MMLU (84% to 86%)
➤ Llama 3.3 70B now leads Llama 3.1 405B in Math-500, and scores nearly equal to 405B in MMLU, GPQA Diamond and HumanEval