Together AI Benchmark Results

Together AI is a cloud inference platform hosting a broad range of open-weight model families. Benchscope records public evaluation runs across 1 model family hosted on Together AI, covering MUSR, MATH, MMLU, IFEVAL.

About Together AI Endpoints

Together AI provides broad model coverage across open-weight families. Benchmark scores from Together AI endpoints reflect their specific deployment of each model. Results can differ from the same model hosted by another provider due to quantization choices, serving configuration, and infrastructure differences. Use canonical-prompt runs for the cleanest cross-provider comparisons.

Hosted Model Families

Model families with public evaluation runs on Together AI: Llama 3.3 70B.

Recent Together AI Runs

Benchscope is a JavaScript app. If the interactive interface does not load, enable JavaScript or use the links above for the main public sections.