examples: minor tweak on llm_as_a_judge example #1284

mshsheikh · 2025-07-28T14:10:40Z

Added trailing spaces between adjacent string literals for clearer agent prompts
Inserted missing “the” in evaluator instructions for grammatical accuracy

* Introduced `max_attempts` counter to prevent infinite judging loops (defaults to 5) * Added trailing spaces between adjacent string literals for clearer agent prompts * Inserted missing “the” in evaluator instructions for grammatical accuracy

seratch · 2025-07-28T23:51:23Z

examples/agent_patterns/llm_as_a_judge.py

@@ -30,9 +30,9 @@ class EvaluationFeedback:
 evaluator = Agent[None](
    name="evaluator",
    instructions=(
-        "You evaluate a story outline and decide if it's good enough."
-        "If it's not good enough, you provide feedback on what needs to be improved."
-        "Never give it a pass on the first try. After 5 attempts, you can give it a pass if story outline is good enough - do not go for perfection"


the instructions allow running 5+ times, so the changes in this PR are inconsistent. if you remove the max_attempts etc., we are happy to merge other changes.

Thanks for pointing that out. The max_attempts logic has been removed, and only the instruction formatting fixes are kept.

seratch · 2025-07-29T04:09:05Z

examples/agent_patterns/llm_as_a_judge.py

@@ -46,6 +46,8 @@ async def main() -> None:

    # We'll run the entire workflow in a single trace
    with trace("LLM as a judge"):
+        max_attempts = 5


as i mentioned above, please remove this additional logic, which is not necessary

Apologies, I clicked the review request by mistake before reverting the changes.

* Reverted the `max_attempts` safeguard logic to restore the intended behavior of the LLM-as-a-judge pattern. The evaluator instructions use the phrase “After 5 attempts, you *can* give it a pass,” indicating discretion rather than a strict limit. * Retained fixes to instruction string formatting to avoid run-together text (e.g., “input.If there” → “input. If there”). * Corrected a minor grammatical issue by adding “the” before “story outline” for proper English usage.

seratch requested changes Jul 28, 2025

View reviewed changes

seratch added the documentation Improvements or additions to documentation label Jul 28, 2025

mshsheikh requested a review from seratch July 29, 2025 04:07

seratch reviewed Jul 29, 2025

View reviewed changes

seratch approved these changes Jul 29, 2025

View reviewed changes

seratch changed the title ~~feat: add loop safeguard and fix instruction spacing/grammar~~ examples: minor tweak on llm_as_a_judge example Jul 29, 2025

seratch merged commit 4cb07d5 into openai:main Jul 29, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

examples: minor tweak on llm_as_a_judge example #1284

examples: minor tweak on llm_as_a_judge example #1284

Uh oh!

mshsheikh commented Jul 28, 2025 •

edited

Loading

Uh oh!

seratch Jul 28, 2025

Uh oh!

mshsheikh Jul 29, 2025

Uh oh!

seratch Jul 29, 2025

Uh oh!

mshsheikh Jul 29, 2025

Uh oh!

Uh oh!

Uh oh!

examples: minor tweak on llm_as_a_judge example #1284

examples: minor tweak on llm_as_a_judge example #1284

Uh oh!

Conversation

mshsheikh commented Jul 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seratch Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

mshsheikh Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

seratch Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

mshsheikh Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mshsheikh commented Jul 28, 2025 •

edited

Loading