LLMs can vibe-code and win coding contests, but can they handle real-world software issues like dependency hell, legacy toolchains or weird compile errors?
We gave 19 state-of-the-art LLMs unmodified source code of open-source projects like curl (HTTP client), jq (command-line JSON processor) and tested them on 15 real-world tasks.
The goal is simple: build a working binary from source - but getting there is hard. The toughest challanges include cross-compiling to Windows or ARM64 and resurrecting decade-old code on modern systems. Agents sometimes need 88 commands and 29 minutes to produce a working binary.
| # | Model | pass@1 / pass@2 |
|---|---|---|
| 1 |
|
83% / 93%
|
| 2 |
|
83% / 87%
|
| 3 |
|
70% / 87%
|
| 4 |
|
80% / 80%
|
| 5 |
|
80% / 80%
|
| 6 |
|
57% / 80%
|
| 7 |
|
67% / 73%
|
| 8 |
|
57% / 73%
|
| 9 |
|
60% / 67%
|
| 10 |
|
57% / 67%
|
| 11 |
|
50% / 67%
|
| 12 |
|
43% / 67%
|
| 13 |
|
57% / 60%
|
| 14 |
|
50% / 60%
|
| 15 |
|
47% / 60%
|
| 16 |
|
47% / 60%
|
| 17 |
|
47% / 53%
|
| 18 |
|
40% / 53%
|
| 19 |
|
37% / 47%
|
| pass@1 |
|
success within a single attempt |
| pass@2 |
|
success within 2 attempts |
Each task follows a shared structure: we hand an AI the source of an open-source project, a clear build objective, and an interactive Linux terminal. The agent must discover the build system (e.g., Autotools/Make/CMake or custom scripts), decide whether to patch the sources, resolve missing headers and libraries, choose compiler/linker flags (dynamic vs static, glibc vs musl), and verify that the produced binary works.
Difficulty spans quick, modern builds to reviving 2003-era code and producing fully static binaries. Tasks include projects like cowsay, jq, and GNU coreutils (including static and legacy variants); see the per-task pages for details.
| Task | pass@1 / pass@2 |
|---|---|
|
curl-ssl
Build curl 8.16.0 with SSL support (TLS v1.3), brotli, zlib and zstd; autotools setup and library detection.
|
92% / 100%
|
|
coreutils
Build coreutils 9.7; large project with extensive feature detection.
|
87% / 100%
|
|
cowsay
Build cowsay 3.8.4; small legacy build with quirky packaging.
|
92% / 95%
|
|
jq
Build jq 1.8.1; autotools and dependency detection can be tricky.
|
92% / 95%
|
|
coreutils-static-alpine
Produce fully static coreutils 9.7 with a working sha1sum; many binaries, strict static linking.
|
82% / 95%
|
|
curl
Build curl 8.16.0; autotools setup and library detection.
|
79% / 95%
|
|
coreutils-static
Produce fully static coreutils 9.7; many binaries, strict static linking.
|
76% / 95%
|
|
jq-windows2
Compile the jq package for amd64 Windows and install it to /home/peter/result/jq.exe. Make sure it runs correctly via the installed wine; static linking flags, dependency closure, and toolchain differences.
|
66% / 84%
|
|
jq-static
Produce a fully static jq 1.8.1; careful with linker flags and deps.
|
61% / 79%
|
|
coreutils-old-version
Build coreutils 5.0; legacy autotools and modern compiler hurdles.
|
58% / 74%
|
|
jq-static-musl
Produce a musl-linked static jq 1.8.1; toolchain and portability challenges.
|
39% / 53%
|
|
jq-windows
Compile the jq package for amd64 Windows and install it to /home/peter/result/jq.exe. Build it statically; static linking flags, dependency closure, and toolchain differences.
|
37% / 47%
|
|
coreutils-old-version-alpine
Build coreutils 5.0 and surface a working sha1sum; legacy autotools and modern compiler hurdles.
|
8% / 11%
|
|
curl-ssl-arm64-static2
Build curl 8.16.0 with SSL support (TLS v1.3), brotli, zlib and zstd. The binary should be statically compiled for arm64. Do a trial run via qemu-aarch64-static, downloading https://google.com; autotools setup and library detection.
|
5% / 11%
|
|
curl-ssl-arm64-static
Build curl 8.16.0 with SSL support (TLS v1.3), brotli, zlib and zstd. The binary should be statically compiled for arm64; autotools setup and library detection.
|
3% / 5%
|
| # | Model | Sum of cheapest 7 | pass@1 / pass@2 |
|---|---|---|---|
| 1 |
|
$0.02 |
37% / 47%
|
| 2 |
|
$0.03 |
67% / 73%
|
| 3 |
|
$0.04 |
47% / 60%
|
| 4 |
|
$0.04 |
47% / 60%
|
| 5 |
|
$0.06 |
50% / 60%
|
| 6 |
|
$0.07 |
47% / 53%
|
| 7 |
|
$0.08 |
83% / 87%
|
| 8 |
|
$0.13 |
50% / 67%
|
| 9 |
|
$0.13 |
43% / 67%
|
| 10 |
|
$0.21 |
57% / 73%
|
| 11 |
|
$0.23 |
60% / 67%
|
| 12 |
|
$0.27 |
57% / 60%
|
| 13 |
|
$0.28 |
40% / 53%
|
| 14 |
|
$0.49 |
83% / 93%
|
| 15 |
|
$0.54 |
57% / 67%
|
| 16 |
|
$0.84 |
70% / 87%
|
| 17 |
|
$0.94 |
80% / 80%
|
| 18 |
|
$0.99 |
80% / 80%
|
| 19 |
|
$3.65 |
57% / 80%
|
| # | Model | Sum of fastest 7 | pass@1 / pass@2 |
|---|---|---|---|
| 1 |
|
4m55s |
60% / 67%
|
| 2 |
|
5m18s |
43% / 67%
|
| 3 |
|
5m35s |
47% / 60%
|
| 4 |
|
5m44s |
47% / 60%
|
| 5 |
|
6m26s |
57% / 73%
|
| 6 |
|
6m37s |
50% / 67%
|
| 7 |
|
6m38s |
37% / 47%
|
| 8 |
|
7m32s |
67% / 73%
|
| 9 |
|
7m38s |
47% / 53%
|
| 10 |
|
7m44s |
50% / 60%
|
| 11 |
|
8m44s |
57% / 67%
|
| 12 |
|
9m14s |
57% / 60%
|
| 13 |
|
9m25s |
80% / 80%
|
| 14 |
|
10m43s |
40% / 53%
|
| 15 |
|
12m22s |
80% / 80%
|
| 16 |
|
12m41s |
83% / 87%
|
| 17 |
|
13m14s |
57% / 80%
|
| 18 |
|
15m38s |
83% / 93%
|
| 19 |
|
17m36s |
70% / 87%
|
| # | Model | Total cost | LLM inference time | Command execution time | Total time | Tokens used |
|---|---|---|---|---|---|---|
| 1 |
|
$0.18 | 15m58s | 20m37s | 37m29s | 231k |
| 2 |
|
$0.62 | 55m9s | 1h27m37s | 2h52m14s | 540k |
| 3 |
|
$0.93 | 21m10s | 24m2s | 46m13s | 406k |
| 4 |
|
$1.56 | 2h11m49s | 39m4s | 2h51m37s | 891k |
| 5 |
|
$1.90 | 32m46s | 31m9s | 1h04m36s | 466k |
| 6 |
|
$1.99 | 35m22s | 58m54s | 1h34m58s | 671k |
| 7 |
|
$2.16 | 22m9s | 1h23m26s | 1h46m29s | 575k |
| 8 |
|
$2.20 | 34m15s | 21m10s | 56m7s | 328k |
| 9 |
|
$3.06 | 21m3s | 1h07m42s | 1h29m36s | 688k |
| 10 |
|
$5.67 | 2h05m46s | 25m19s | 2h31m46s | 751k |
| 11 |
|
$7.36 | 47m9s | 1h00m11s | 1h47m59s | 558k |
| 12 |
|
$9.22 | 39m27s | 30m17s | 1h10m31s | 582k |
| 13 |
|
$10.78 | 24m3s | 21m48s | 46m32s | 536k |
| 14 |
|
$13.39 | 1h18m32s | 57m1s | 2h16m06s | 807k |
| 15 |
|
$14.05 | 1h00m15s | 28m21s | 1h29m07s | 828k |
| 16 |
|
$18.37 | 1h17m52s | 1h06m00s | 2h24m14s | 590k |
| 17 |
|
$32.13 | 2h37m22s | 1h04m59s | 3h43m18s | 811k |
| 18 |
|
$38.66 | 36m13s | 44m23s | 1h21m45s | 723k |
| 19 |
|
$50.39 | 1h46m28s | 23m58s | 2h11m02s | 669k |
| Total | $214.62 | 18h42m48s | 14h15m57s | 33h41m38s | 11.7M |
| Model | Task | Status | Error |
|---|---|---|---|
|
|
coreutils | Failure | exceeded max tool calls (50) |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Failure | exceeded max tool calls (70) |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version-alpine | Success | - |
|
|
coreutils-old-version-alpine | Success | - |
|
|
coreutils-static | Failure | exceeded max tool calls (50) |
|
|
coreutils-static | Failure | task failed: kill missing at /home/peter/result/kill or not executable |
|
|
coreutils-static-alpine | Failure | exceeded max tool calls (50) |
|
|
coreutils-static-alpine | Success | - |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl-arm64 is not statically linked |
|
|
curl-ssl-arm64-static | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
jq | Failure | exceeded max tool calls (50) |
|
|
jq | Success | - |
|
|
jq-static | Failure | exceeded max tool calls (50) |
|
|
jq-static | Success | - |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-static-musl | Success | - |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows | Success | - |
|
|
jq-windows2 | Success | - |
|
|
jq-windows2 | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version-alpine | Failure | exceeded max tool calls (100) |
|
|
coreutils-old-version-alpine | Failure | task failed: df missing at /home/peter/result/df or not executable |
|
|
coreutils-static | Success | - |
|
|
coreutils-static | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl-arm64 is not statically linked |
|
|
curl-ssl-arm64-static | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl-arm64 is not statically linked |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static-musl | Success | - |
|
|
jq-static-musl | Success | - |
|
|
jq-windows | Success | - |
|
|
jq-windows | Success | - |
|
|
jq-windows2 | Success | - |
|
|
jq-windows2 | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version-alpine | Failure | task failed: df missing at /home/peter/result/df or not executable |
|
|
coreutils-static | Success | - |
|
|
coreutils-static | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl-arm64 is not statically linked |
|
|
curl-ssl-arm64-static | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static-musl | Success | - |
|
|
jq-static-musl | Success | - |
|
|
jq-windows | Success | - |
|
|
jq-windows | Success | - |
|
|
jq-windows2 | Success | - |
|
|
jq-windows2 | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Failure | unexpected end of JSON input |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-static | Success | - |
|
|
coreutils-static | Failure | task failed: uptime missing at /home/peter/result/uptime or not executable |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl-arm64 is not statically linked |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: * Protocol "https" not... |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static | Failure | task failed: jq binary does not exist |
|
|
jq-static-musl | Success | - |
|
|
jq-static-musl | Failure | task failed: jq binary does not exist |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows | Success | - |
|
|
jq-windows2 | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows2 | Failure | task failed: jq help does not contain expected string |
|
|
coreutils | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Failure | task failed: chroot missing at /home/peter/result/chroot or not executable |
|
|
coreutils-old-version | Failure | task failed: chroot missing at /home/peter/result/chroot or not executable |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-static | Success | - |
|
|
coreutils-static | Success | - |
|
|
coreutils-static-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-static-alpine | Success | - |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Failure | task failed: curl binary does not exist |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl-arm64 is not aarch64 architecture |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl-arm64 is not aarch64 architecture |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl binary does not exist |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static | Failure | task failed: jq is not statically linked |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows2 | Success | - |
|
|
jq-windows2 | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Failure | task failed: seq missing at /home/peter/result/seq or not executable |
|
|
coreutils-old-version | Failure | failed to unmarshal shell-harness response: unexpected end of JSON input |
|
|
coreutils-old-version-alpine | Failure | exceeded max tool calls (100) |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-static | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-static | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl-arm64 is not aarch64 architecture |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Failure | task failed: jq is not statically linked |
|
|
jq-static | Failure | exceeded max tool calls (50) |
|
|
jq-static-musl | Failure | exceeded max tool calls (50) |
|
|
jq-static-musl | Failure | task failed: jq binary does not exist |
|
|
jq-windows | Failure | task failed: jq.exe binary does not exist |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows2 | Failure | task failed: jq.exe binary does not exist |
|
|
jq-windows2 | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version | Failure | task failed: chroot missing at /home/peter/result/chroot or not executable |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-static | Success | - |
|
|
coreutils-static | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl-arm64 is not statically linked |
|
|
curl-ssl-arm64-static | Failure | task failed: curl-arm64 is not statically linked |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl binary does not exist |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-static-musl | Failure | task failed: jq binary does not exist |
|
|
jq-windows | Failure | task failed: jq.exe binary does not exist |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows2 | Failure | task failed: jq.exe binary does not exist |
|
|
jq-windows2 | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-static | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-static | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Failure | task failed: curl did not download the expected local file content, but instead: curl: (1) Protocol "file" not supported |
|
|
curl-ssl | Failure | task failed: curl binary does not exist |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl binary does not exist |
|
|
jq | Failure | task failed: jq binary does not exist |
|
|
jq | Failure | task failed: jq binary does not exist |
|
|
jq-static | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows | Failure | task failed: jq.exe binary does not exist |
|
|
jq-windows2 | Failure | task failed: jq.exe binary does not exist |
|
|
jq-windows2 | Failure | task failed: jq help does not contain expected string |
|
|
coreutils | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Failure | task failed: install missing at /home/peter/result/install or not executable |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version-alpine | Failure | task failed: df missing at /home/peter/result/df or not executable |
|
|
coreutils-static | Success | - |
|
|
coreutils-static | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl binary does not exist |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows2 | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows2 | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-static | Success | - |
|
|
coreutils-static | Failure | task failed: install missing at /home/peter/result/install or not executable |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Failure | exceeded max tool calls (50) |
|
|
curl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl binary does not exist |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Failure | task failed: jq is not statically linked |
|
|
jq-static | Failure | task failed: jq is not statically linked |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows2 | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows2 | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version-alpine | Failure | exceeded max tool calls (100) |
|
|
coreutils-old-version-alpine | Success | - |
|
|
coreutils-static | Success | - |
|
|
coreutils-static | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Failure | task failed: curl did not download the expected local file content, but instead: curl: (1) Protocol "file" not supported |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
curl-ssl-arm64-static | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
curl-ssl-arm64-static2 | Success | - |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static-musl | Success | - |
|
|
jq-static-musl | Success | - |
|
|
jq-windows | Success | - |
|
|
jq-windows | Success | - |
|
|
jq-windows2 | Success | - |
|
|
jq-windows2 | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version-alpine | Failure | task failed: df missing at /home/peter/result/df or not executable |
|
|
coreutils-old-version-alpine | Failure | task failed: df missing at /home/peter/result/df or not executable |
|
|
coreutils-static | Success | - |
|
|
coreutils-static | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static-musl | Success | - |
|
|
jq-static-musl | Success | - |
|
|
jq-windows | Success | - |
|
|
jq-windows | Success | - |
|
|
jq-windows2 | Success | - |
|
|
jq-windows2 | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-static | Success | - |
|
|
coreutils-static | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Failure | task failed: tail missing at /home/peter/result/tail or not executable |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Failure | task failed: curl binary does not exist |
|
|
curl | Failure | task failed: curl binary does not exist |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl binary does not exist |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Failure | task failed: jq is not statically linked |
|
|
jq-static | Failure | task failed: jq binary does not exist |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows2 | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows2 | Failure | task failed: jq help does not contain expected string |
|
|
coreutils | Success | - |
|
|
coreutils | Failure | task failed: false missing at /home/peter/result/false or not executable |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-static | Success | - |
|
|
coreutils-static | Failure | task failed: kill missing at /home/peter/result/kill or not executable |
|
|
coreutils-static-alpine | Failure | task failed: kill missing at /home/peter/result/kill or not executable |
|
|
coreutils-static-alpine | Failure | task failed: groups missing at /home/peter/result/groups or not executable |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl-arm64 is not statically linked |
|
|
curl-ssl-arm64-static | Failure | task failed: curl-arm64 is not statically linked |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl-arm64 is not statically linked |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Failure | task failed: jq is not statically linked |
|
|
jq-static | Success | - |
|
|
jq-static-musl | Success | - |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows2 | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows2 | Success | - |
|
|
coreutils | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-static | Success | - |
|
|
coreutils-static | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Failure | task failed: curl did not download the expected local file content, but instead: curl: (1) Protocol "file" not supported |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Failure | unknown tool: |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl binary does not exist |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static | Failure | task failed: jq binary does not exist |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-static-musl | Failure | task failed: jq binary does not exist |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows2 | Failure | unknown tool: |
|
|
jq-windows2 | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version-alpine | Failure | context deadline exceeded |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-static | Failure | task failed: sha1sum is not statically linked |
|
|
coreutils-static | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Failure | task failed: install missing at /home/peter/result/install or not executable |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL... |
|
|
curl-ssl-arm64-static | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Success | - |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl-arm64 is not statically linked |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static | Failure | task failed: jq binary does not exist |
|
|
jq-static-musl | Failure | task failed: jq binary does not exist |
|
|
jq-static-musl | Success | - |
|
|
jq-windows | Success | - |
|
|
jq-windows | Success | - |
|
|
jq-windows2 | Success | - |
|
|
jq-windows2 | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-static | Success | - |
|
|
coreutils-static | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
cowsay | Success | - |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl-arm64 is not statically linked |
|
|
curl-ssl-arm64-static | Failure | task failed: curl-arm64 is not statically linked |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl binary does not exist |
|
|
curl-ssl-arm64-static2 | Failure | failed to unmarshal shell-harness response: unexpected end of JSON input |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static | Failure | task failed: jq is not statically linked |
|
|
jq-static-musl | Success | - |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows | Failure | task failed: jq help does not contain expected string |
|
|
jq-windows2 | Success | - |
|
|
jq-windows2 | Success | - |
|
|
coreutils | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Failure | unknown tool: run_system_cmd |
|
|
coreutils-old-version | Failure | exceeded max tool calls (70) |
|
|
coreutils-old-version-alpine | Failure | invalid character '*' in string escape code |
|
|
coreutils-old-version-alpine | Failure | task failed: df missing at /home/peter/result/df or not executable |
|
|
coreutils-static | Success | - |
|
|
coreutils-static | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Success | - |
|
|
cowsay | Failure | task failed: Cowsay binary does not exist |
|
|
cowsay | Success | - |
|
|
curl | Success | - |
|
|
curl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Success | - |
|
|
curl-ssl-arm64-static | Failure | task failed: curl-arm64 is not statically linked |
|
|
curl-ssl-arm64-static | Failure | task failed: curl-arm64 is not statically linked |
|
|
curl-ssl-arm64-static2 | Failure | invalid character '\\' after object key:value pair |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl-arm64 is not statically linked |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static | Success | - |
|
|
jq-static-musl | Failure | task failed: jq binary does not exist |
|
|
jq-static-musl | Failure | task failed: jq is not statically linked |
|
|
jq-windows | Failure | invalid character '.' in string escape code |
|
|
jq-windows | Success | - |
|
|
jq-windows2 | Success | - |
|
|
jq-windows2 | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils | Success | - |
|
|
coreutils-old-version | Success | - |
|
|
coreutils-old-version | Failure | exceeded max cost dollars (max=$3.00, current=3.01) |
|
|
coreutils-old-version-alpine | Failure | POST "https://openrouter.ai/api/v1/chat/completions": 400 Bad Request {"message":"Provider returned error","code":400,"m... |
|
|
coreutils-old-version-alpine | Failure | task failed: sha1sum binary does not exist |
|
|
coreutils-static | Success | - |
|
|
coreutils-static | Failure | task failed: install missing at /home/peter/result/install or not executable |
|
|
coreutils-static-alpine | Success | - |
|
|
coreutils-static-alpine | Failure | LLM call failed: POST "https://openrouter.ai/api/v1/chat/completions": 429 Too Many Requests {"message":"Rate limit exce... |
|
|
cowsay | Failure | task failed: Cowsay does not contain expected string (eyes) |
|
|
cowsay | Failure | task failed: Cowsay does not contain expected string (eyes) |
|
|
curl | Success | - |
|
|
curl | Failure | LLM call failed: POST "https://openrouter.ai/api/v1/chat/completions": 429 Too Many Requests {"message":"Rate limit exce... |
|
|
curl-ssl | Success | - |
|
|
curl-ssl | Failure | LLM call failed: POST "https://openrouter.ai/api/v1/chat/completions": 429 Too Many Requests {"message":"Rate limit exce... |
|
|
curl-ssl-arm64-static | Failure | task failed: curl-arm64 is not aarch64 architecture |
|
|
curl-ssl-arm64-static | Failure | task failed: curl-arm64 is not statically linked |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl brotli compression test failed - content-encoding: br not found |
|
|
curl-ssl-arm64-static2 | Failure | task failed: curl-arm64 is not aarch64 architecture |
|
|
jq | Success | - |
|
|
jq | Success | - |
|
|
jq-static | Failure | task failed: jq is not statically linked |
|
|
jq-static | Failure | task failed: jq is not statically linked |
|
|
jq-static-musl | Success | - |
|
|
jq-static-musl | Success | - |
|
|
jq-windows | Success | - |
|
|
jq-windows | Failure | LLM call failed: POST "https://openrouter.ai/api/v1/chat/completions": 429 Too Many Requests {"message":"Rate limit exce... |
|
|
jq-windows2 | Success | - |
|
|
jq-windows2 | Failure | task failed: jq.exe binary does not exist |