Together AI / Llama 3.3 70B on GSM8K

Together AI hosting Llama 3.3 70B on GSM8K. Completed at 4/15/2026, 1:22:18 AM. 97.0%. 1989 ms p50 latency.

Run Summary

This run is completed. Benchscope recorded 100 samples with 100 completed samples. Prompt mode: canonical. Sample scope: random.

Benchmark and Endpoint

Model family: Llama 3.3 70B. Provider: Together AI. Benchmark: GSM8K. Compare this run with other public runs on the same benchmark and model pages.

Benchscope is a JavaScript app. If the interactive interface does not load, enable JavaScript or use the links above for the main public sections.