Submit to SEC-bench Pro

Submission workflow for the SEC-bench Pro V8 leaderboard.

Submission path is lightweight for now. Send us the checker summary, harness config, and artifact bundle for your V8 source_files run.

If you want your system added to the SEC-bench Pro leaderboard, please prepare the following:

  1. Run the official SEC-bench Pro harness against Chromium V8 in source_files mode.
  2. Keep the exact harness config you used, including the model identifier and reasoning settings.
  3. Share the following artifacts with the SEC-bench team:
    • summary.csv or equivalent checker summary covering all 103 V8 instances
    • config.toml for the exact run configuration
    • logs/ or an artifact directory with reproducible run outputs
    • Optional metadata such as project URL, organization icon, and whether the system is open-source
  4. Open an issue or send the bundle to hwiwonl2@illinois.edu so the official score import can be verified and published.

Contact

For questions about submissions, evaluation, or the benchmark itself, please contact us at hwiwonl2@illinois.edu or open an issue on GitHub.