feat: Add AI-powered issue triage and PR review bot by sudsali · Pull Request #260 · awslabs/python-deequ

sudsali · 2026-04-20T23:12:25Z

Summary

Adds a GitHub Actions bot that automatically triages issues and reviews PRs using Amazon Bedrock. Runs on issue open/reopen, issue comments, and PR open/reopen/synchronize events.

Issue Handling

Classifies issues (bug, question, feature request, off-topic) using a two-phase LLM call
First pass classifies and selects source files to read; second pass responds with full code context
Follows up on user replies (capped at 2 bot replies per issue)
Detects user dissatisfaction and escalates to maintainers via Slack
Escalates reopened issues directly to Slack
Labels issues automatically

PR Review

Posts inline review comments on specific diff lines via GitHub review API
Validates line numbers against diff hunks before submitting
Uses REQUEST_CHANGES when findings exist
Re-reviews on new commits (synchronize trigger)
Reads existing review comments to avoid duplicates

Architecture

GitHub Event → Job 1: analyze (read-only, AWS creds)
                ├── Load KB from S3
                ├── Generate codebase map
                ├── Fetch issue/PR + comments
                ├── Bedrock Converse API (system prompt cached, user content guarded)
                ├── Structured JSON output
                ├── Sanitize output
                └── Write JSON artifact
              → Job 2: act (write-only, no AWS creds)
                ├── Post comment/review to GitHub
                ├── Add labels
                └── Notify Slack

Security

Bedrock Guardrails — prompt injection detection + sensitive info blocking (replaces regex-based sanitizer)
Trusted/untrusted split — prompt instructions go as system prompt (cached, not guarded); user content (issue body, PR diff, comments) wrapped in guardContent and run through guardrail.
Template injection prevention — string.Template.safe_substitute instead of .format()
Two-job architecture — read-only analyze job (AWS creds, no GitHub write), write-only act job (GitHub write, no AWS creds)
Fork PR blocking — fork PRs are not processed
Output sanitization — backup injection markers + #N references wrapped in backticks to prevent auto-linking

Files

File	Purpose
`issue-bot.yml`	GitHub Actions workflow
`main.py`	Two-phase orchestrator (`analyze`/`act`)
`config.py`	Environment variable loading + allowed labels
`bedrock_client.py`	Bedrock Converse API with prompt caching, guardrails, structured output, circuit breaker
`github_client.py`	GitHub API + local file reads + codebase map + diff line validation
`knowledge_base.py`	S3-backed KB with relevance truncation
`slack_client.py`	Slack webhook notifications
`sanitizer.py`	Backup injection blocking + issue ref formatting
`prompts.py`	Prompt loading from env vars
`schemas/*.json`	JSON schemas for structured LLM output (issue, PR review, followup)
`tests/test_bot.py`	42 unit tests for parsing, validation, and sanitization

Required GitHub Secrets

AWS_ROLE_ARN, KB_S3_BUCKET, KB_S3_KEY, BEDROCK_MODEL_ID, BEDROCK_API_VERSION, SLACK_WEBHOOK_URL, ISSUE_CLASSIFY_PROMPT, ISSUE_RESPOND_PROMPT, PR_FILE_REVIEW_PROMPT, FOLLOWUP_PROMPT, GUARDRAIL_ID

Testing

Supports workflow_dispatch with dry_run: true. Tested on the fork against bug reports, usage questions, feature requests, follow-ups, dissatisfaction escalation, prompt injection, duplicate runs, PR inline reviews, and re-reviews on new commits.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

… retries, smoke tests

…render injection, pagination

…eduplicate API calls

…ials false, metadata footer, os.walk, retention 30d

…n, remove dead code

…, URL validation

… boto3 from act job

…ontaining security strings

…tives on security code

… diffs Bedrock Guardrails scan all content (system prompt AND user message). PR diffs containing the bot's own security code (e.g. 'ignore previous instructions' in sanitizer.py) trigger guardrail false positives. Fix: add use_guardrail parameter to invoke(). PR reviews pass use_guardrail=False since diffs are code, not user-generated attacks. Issue/followup prompts still use the guardrail (default True). Reverts _split_prompt to the original simple <knowledge_base> split.

…pt to avoid guardrail false positives

…lse positives

…havior

SamPom100 reviewed Apr 21, 2026

View reviewed changes

feat: Add AI-powered issue triage and PR review bot

32dfbe2

sudsali force-pushed the master branch from 89e4e41 to 32dfbe2 Compare April 21, 2026 21:10

sudsali added 14 commits April 22, 2026 14:30

fix: Address review - path traversal, guardrail fallback, elif chain,…

69af5db

… retries, smoke tests

fix: Address review round 2 - sanitization dispatch, path traversal, …

276d752

…render injection, pagination

fix: Review round 3 - empty response handling, inline sanitization, d…

ea4b111

…eduplicate API calls

fix: Pin actions to SHAs, add secret scanning backup, persist-credent…

2dd3ec2

…ials false, metadata footer, os.walk, retention 30d

fix: P0 - uuid render tokens, specific exceptions, artifact validatio…

175ee51

…n, remove dead code

test: Add artifact validation tests for action enum, title truncation…

9b53d70

…, URL validation

chore: Remove dead code (_parse_file_review, has_bot_commented), drop…

be5b256

… boto3 from act job

fix: Handle empty GUARDRAIL_VERSION env var defaulting to DRAFT

6c81302

fix: Restore boto3 in act job - needed for module import chain

14325d6

fix: Three-part prompt split to prevent guardrail blocking PR diffs c…

9feed00

…ontaining security strings

fix: Move context/diff to system prompt to avoid guardrail false posi…

f04a19d

…tives on security code

fix: Split at user_input marker - KB and diffs must be in system prom…

bb7958c

…pt to avoid guardrail false positives

test: Update split_prompt tests to match user_input marker

ae624be

sudsali force-pushed the master branch from f8dcffd to ae624be Compare April 22, 2026 23:20

sudsali added 2 commits April 22, 2026 19:23

fix: Re-enable guardrail for all paths - user_input split prevents fa…

a00cbe3

…lse positives

docs: Update _split_prompt docstring to explain guardrail scanning be…

65e128c

…havior

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add AI-powered issue triage and PR review bot#260

feat: Add AI-powered issue triage and PR review bot#260
sudsali wants to merge 17 commits intoawslabs:masterfrom
sudsali:master

sudsali commented Apr 20, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sudsali commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Issue Handling

PR Review

Architecture

Security

Files

Required GitHub Secrets

Testing

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sudsali commented Apr 20, 2026 •

edited

Loading