oak-benchmark/ ├── run.py # Benchmark runner (starts services + runs eval) ├── eval_bench_sdk.py # Evaluation script ├── oak_health_test_suite_v1.json # Benchmark task definitions ├── config/ │ ├── ...