tools/oss-benchmark.mjs.
Default cohort
The default cohort uses GitHub Trending repositories from these language tracks:typescriptpythongorustrubyphpjava
10 repositories per language.
Deterministic scan command
Each benchmark scan runs:Workflow
Build the CLI first:Output layout
Generated benchmark data is ignored by git and written undertools/benchmark-data/.
| Path | Contents |
|---|---|
cohorts/*.json | Frozen repository lists |
repos/<language>/<owner>__<repo>/ | Cached clones |
runs/<run-id>/summary.json | Machine-readable aggregate report |
runs/<run-id>/summary.md | Human review report |
runs/<run-id>/repos/.../scan.json | Raw per-repo scan JSON |
runs/<run-id>/repos/.../stdout.txt | Captured stdout |
runs/<run-id>/repos/.../stderr.txt | Captured stderr |
runs/<run-id>/repos/.../metadata.json | Reproduction metadata |
Review loop
- Check failures first so the cohort is complete.
- Check the lowest-score repositories.
- Check the highest-volume rules across many repositories.
- Open per-repo
scan.jsonfiles for likely false positives. - Fix the rules and add regression tests.
- Re-run the same manifest as the next iteration.
