Compute-Optimal LLMs

This tool helps you find the optimal LLM given your compute budget. Alternatively, you can also use it to find the optimal compute budget for a planned model. The tool is highly based on the work of Kaplan et. al (2020), Hoffman et. al (2022), and most recently with new results on varying modalities, Aghajanyan et. al (2023). I do not make any claims about the accuracy of these predictions, but merely provide a tool to try to easily calculate what these predictions actually mean for your model.

Compute-Optimal LLMs

Choose a modality (data type)

What do you want to calculate?

Calculate the optimal model size given a training budget.