tree: 6ee49d3f7a33a1cc8dcf8bc557ab3b48ada9a255 [path history] [tgz]
  1. adapt_builder/
  2. add_browser_test_coverage/
  3. add_gtest_coverage/
  4. build_target/
  5. feature_flags_add/
  6. fix_broken_test/
  7. README.md
agents/prompts/eval/README.md

Eval Directory

This directory contains prompts for various software engineering tasks that can be executed by an agent. Each subdirectory represents a single, self-contained task.

These are intended to be used as an eval set for regression testing on a Chromium CI builder in addition to being a reference for new tasks.

Subdirectory Structure

Each subdirectory should contain all the necessary files and information for the agent to perform the task. This includes:

  • eval.md: A file describing the task and its outcome.
  • prompt.md: The prompt that initiates the task.

README.md Format

Note: this format is subject to change as a regression test suite is implemented.

The eval.md file in each subdirectory should follow this format:

  • Owner: List or person to contact if the task stops reproducing.
  • Description: A brief description of the task.
  • Git-Revision: The git revision on which the task was successfully performed. This is used for reproducibility.
  • Result: A summary of what the agent accomplished.
  • Modified files: A list of the files that were modified by the agent during the task.

This structure ensures that each task is well-documented and can be easily understood and reproduced.