Installation
Getting started with Terminal-Bench.
Terminal-Bench provides a CLI for running the benchmark, creating custom tasks, and running other popular benchmarks we've adapted to our framework.
Install dependencies
Terminal-Bench requires git
and Docker
to be installed.
Install the CLI
You can then run the CLI using terminal-bench
or tb
for convenience.
Run tb --help
to see the available commands and options.