این صفحه به‌وسیله ‏Cloud Translation API‏ ترجمه شده است.

ساخت دستیار بازبینی کد هوش مصنوعی تولید با Google ADK

۱. بررسی کد آخر شب

ساعت ۲ بامداد است

ساعت‌ها مشغول اشکال‌زدایی بوده‌اید. تابع درست به نظر می‌رسد، اما یک جای کار می‌لنگد. شما این حس را می‌شناسید - وقتی کدی باید کار کند اما کار نمی‌کند، و دیگر نمی‌توانید دلیلش را بفهمید چون خیلی طولانی به آن خیره شده‌اید.

def dfs_search_v1(graph, start, target):
    """Find if target is reachable from start."""
    visited = set()
    stack = start  # Looks innocent enough...
   
    while stack:
        current = stack.pop()
       
        if current == target:
            return True
           
        if current not in visited:
            visited.add(current)
           
            for neighbor in graph[current]:
                if neighbor not in visited:
                    stack.append(neighbor)
   
    return False

سفر توسعه‌دهنده هوش مصنوعی

اگر در حال خواندن این مطلب هستید، احتمالاً تحولی را که هوش مصنوعی در کدنویسی ایجاد کرده است، تجربه کرده‌اید. ابزارهایی مانند Gemini Code Assist ، Claude Code و Cursor نحوه کدنویسی ما را تغییر داده‌اند. آن‌ها برای تولید کدهای آماده، پیشنهاد پیاده‌سازی‌ها و تسریع توسعه فوق‌العاده هستند.

اما شما اینجا هستید چون می‌خواهید عمیق‌تر شوید. می‌خواهید بفهمید که چگونه این سیستم‌های هوش مصنوعی را بسازید ، نه اینکه فقط از آنها استفاده کنید. می‌خواهید چیزی بسازید که:

رفتار قابل پیش‌بینی و قابل ردیابی دارد
می‌تواند با اطمینان به تولید اعزام شود
نتایج پایداری را ارائه می‌دهد که می‌توانید به آنها اعتماد کنید
دقیقاً به شما نشان می‌دهد که چگونه تصمیم‌گیری می‌کند

از مصرف‌کننده تا خالق

معماری.png

امروز، شما از استفاده از ابزارهای هوش مصنوعی به ساخت آنها جهش خواهید کرد. شما یک سیستم چندعامله خواهید ساخت که:

ساختار کد را به صورت قطعی تجزیه و تحلیل می‌کند
آزمایش‌های واقعی را برای تأیید رفتار اجرا می‌کند
انطباق استایل با لینترهای واقعی را تأیید می‌کند
یافته‌ها را به بازخورد عملی تبدیل می‌کند .
با قابلیت مشاهده کامل، به Google Cloud منتقل می‌شود

۲. اولین استقرار نماینده شما

سوال توسعه‌دهنده

«من LLMها را درک می‌کنم، از APIها استفاده کرده‌ام، اما چگونه می‌توانم از یک اسکریپت پایتون به یک عامل هوش مصنوعی تولیدی که مقیاس‌پذیر است، تبدیل شوم؟»

بیایید با راه‌اندازی صحیح محیط خود، سپس ساخت یک عامل ساده برای درک اصول اولیه قبل از پرداختن به الگوهای تولید، به این سوال پاسخ دهیم.

تنظیمات ضروری اول

قبل از ایجاد هرگونه عامل، بیایید مطمئن شویم که محیط Google Cloud شما آماده است.

به اعتبار ابری گوگل نیاز دارید؟

روی فعال کردن Cloud Shell در بالای کنسول Google Cloud کلیک کنید (این نماد به شکل ترمینال در بالای صفحه Cloud Shell است)،

متن جایگزین

شناسه پروژه گوگل کلود خود را پیدا کنید:

کنسول گوگل کلود را باز کنید: https://console.cloud.google.com
پروژه‌ای را که می‌خواهید برای این کارگاه استفاده کنید، از منوی کشویی پروژه در بالای صفحه انتخاب کنید.
شناسه پروژه شما در کارت اطلاعات پروژه در داشبورد نمایش داده می‌شود.

مرحله ۱: شناسه پروژه خود را تنظیم کنید

در Cloud Shell، ابزار خط فرمان gcloud از قبل پیکربندی شده است. دستور زیر را برای تنظیم پروژه فعال خود اجرا کنید. این از متغیر محیطی $GOOGLE_CLOUD_PROJECT استفاده می‌کند که به طور خودکار در جلسه Cloud Shell شما تنظیم می‌شود.

gcloud config set project $GOOGLE_CLOUD_PROJECT

مرحله ۲: تنظیمات خود را تأیید کنید

در مرحله بعد، دستورات زیر را اجرا کنید تا تأیید شود که پروژه شما به درستی تنظیم شده و احراز هویت شده‌اید.

# Confirm project is set
echo "Current project: $(gcloud config get-value project)"

# Check authentication status
gcloud auth list

باید شناسه پروژه خود را چاپ شده و حساب کاربری خود را که با (ACTIVE) در کنار آن نمایش داده شده است، ببینید.

اگر حساب کاربری شما فعال نیست یا با خطای احراز هویت مواجه شدید، دستور زیر را برای ورود اجرا کنید:

gcloud auth application-default login

مرحله ۳: فعال کردن APIهای ضروری

ما حداقل به این APIها برای عامل اصلی نیاز داریم:

gcloud services enable \
    aiplatform.googleapis.com \
    compute.googleapis.com

این ممکن است یک یا دو دقیقه طول بکشد. خواهید دید:

Operation "operations/..." finished successfully.

مرحله ۴: نصب ADK

# Install the ADK CLI
pip install google-adk --upgrade

# Verify installation
adk --version

شما باید شماره نسخه‌ای مانند 1.15.0 یا بالاتر را ببینید.

حالا عامل اصلی خود را ایجاد کنید

با آماده شدن محیط، بیایید آن عامل ساده را ایجاد کنیم.

مرحله ۵: استفاده از ADK Create

adk create my_first_agent

دستورالعمل‌های تعاملی را دنبال کنید:

Choose a model for the root agent:
1. gemini-2.5-flash
2. Other models (fill later)
Choose model (1, 2): 1

1. Google AI
2. Vertex AI
Choose a backend (1, 2): 2

Enter Google Cloud project ID [auto-detected-from-gcloud]:
Enter Google Cloud region [us-central1]:

مرحله ۶: بررسی کنید چه چیزی ایجاد شده است

cd my_first_agent
ls -la

سه فایل پیدا خواهید کرد:

.env          # Configuration (auto-populated with your project)
__init__.py   # Package marker
agent.py      # Your agent definition

مرحله 7: بررسی سریع پیکربندی

# Verify the .env was created correctly
cat .env

# Should show something like:
# GOOGLE_CLOUD_PROJECT=your-project-id
# GOOGLE_CLOUD_LOCATION=us-central1
# GOOGLE_GENAI_USE_VERTEXAI=1

اگر شناسه پروژه وجود ندارد یا نادرست است، فایل .env را ویرایش کنید:

nano .env  # or use your preferred editor

مرحله ۸: به کد عامل نگاه کنید

cat agent.py

from google.adk.agents.llm_agent import Agent

root_agent = Agent(
    model='gemini-2.5-flash',
    name='root_agent',
    description='A helpful assistant for user questions.',
    instruction='Answer user questions to the best of your knowledge',
)

ساده، تمیز، مینیمال. این «سلام دنیای» شما از عامل‌ها است.

عامل پایه خود را آزمایش کنید

مرحله 9: عامل خود را اجرا کنید

cd ..
adk run my_first_agent

شما باید چیزی شبیه به این را ببینید:

Log setup complete: /tmp/agents_log/agent.20250930_162430.log
To access latest log: tail -F /tmp/agents_log/agent.latest.log
[user]:

مرحله 10: چند پرس‌وجو را امتحان کنید

در ترمینالی که adk run در آن اجرا می‌شود، اعلانی مشاهده خواهید کرد. درخواست‌های خود را تایپ کنید:

[user]: Hey, what can you do?
[root_agent]: I can help answer questions, generate text, explain concepts, assist with problem-solving, and provide information on a wide range of topics. What would you like to know or discuss?

[user]: What's the current weather in Mountain View?
[root_agent]: I'm sorry, but I cannot provide real-time weather information as I don't have access to current data or weather services. To get the current weather in Mountain View, I recommend checking a weather website or app like weather.com or using a search engine.

به محدودیت توجه کنید - نمی‌تواند به داده‌های فعلی دسترسی داشته باشد. بیایید بیشتر پیش برویم:

[user]: Review this Python code: def add(a, b): return a + b
[root_agent]: This is a simple Python function that adds two numbers. Here's my review:

**Strengths:**
- Clean and concise implementation
- Follows Python conventions for simple operations
- Clear function name that describes its purpose

**Suggestions for improvement:**
- Add a docstring to document the function
- Consider adding type hints for better code clarity
- You might want to handle edge cases

Here's an enhanced version:
def add(a: float, b: float) -> float:
    """Add two numbers and return their sum."""
    return a + b

نماینده می‌تواند در مورد کد بحث کند، اما آیا می‌تواند:

آیا واقعاً می‌توان AST را تجزیه کرد تا ساختار آن را درک کرد؟
برای تأیید کارکرد آن، آزمایش‌هایی اجرا کنید؟
انطباق سبک را بررسی کنید؟
نقدهای قبلی‌تان را یادتان هست؟

نه. اینجاست که به معماری نیاز داریم.

🏃🚪 خروج با

Ctrl+C

وقتی کاوش تمام شد.

۳. آماده‌سازی فضای کاری تولید

راه حل: یک معماری آماده برای تولید

آن عامل ساده نقطه شروع را نشان داد، اما یک سیستم تولید به ساختار قوی نیاز دارد. اکنون یک پروژه کامل راه‌اندازی خواهیم کرد که اصول تولید را در بر می‌گیرد.

راه‌اندازی بنیاد

شما قبلاً پروژه Google Cloud خود را برای عامل پایه پیکربندی کرده‌اید. حالا بیایید فضای کاری کامل را با تمام ابزارها، الگوها و زیرساخت‌های مورد نیاز برای یک سیستم واقعی آماده کنیم.

مرحله ۱: دریافت پروژه ساختاریافته

ابتدا، با استفاده از Ctrl+C از هرگونه adk run در حال اجرا خارج شوید و آن را پاکسازی کنید:

# Clean up the basic agent
cd ~  # Make sure you're not inside my_first_agent
rm -rf my_first_agent

# Get the production scaffold
git clone https://github.com/ayoisio/adk-code-review-assistant.git
cd adk-code-review-assistant
git checkout codelab

مرحله 2: ایجاد و فعال کردن محیط مجازی

# Create the virtual environment
python -m venv .venv

# Activate it
# On macOS/Linux:
source .venv/bin/activate
# On Windows:
# .venv\Scripts\activate

تأیید : اکنون اعلان شما باید در ابتدا (.venv) را نشان دهد.

مرحله ۳: نصب وابستگی‌ها

pip install -r code_review_assistant/requirements.txt

# Install the package in editable mode (enables imports)
pip install -e .

این نصب می‌کند:

google-adk - چارچوب ADK
pycodestyle - برای بررسی PEP 8
vertexai - برای استقرار ابری
سایر وابستگی‌های تولید

آپشن -e به شما امکان می‌دهد ماژول‌های code_review_assistant از هر جایی وارد کنید.

مرحله ۴: پیکربندی محیط

# Copy the example environment file
cp .env.example .env

# Edit .env and replace the placeholders:
# - GOOGLE_CLOUD_PROJECT=your-project-id → your actual project ID
# - Keep other defaults as-is

تأیید : پیکربندی خود را بررسی کنید:

cat .env

باید نشان دهد:

GOOGLE_CLOUD_PROJECT=your-actual-project-id
GOOGLE_CLOUD_LOCATION=us-central1
GOOGLE_GENAI_USE_VERTEXAI=TRUE

مرحله ۵: اطمینان از احراز هویت

از آنجایی که قبلاً gcloud auth اجرا کرده‌اید، بیایید بررسی کنیم:

# Check current authentication
gcloud auth list

# Should show your account with (ACTIVE)
# If not, run:
gcloud auth application-default login

مرحله 6: فعال کردن APIهای عملیاتی اضافی

ما قبلاً APIهای پایه را فعال کرده‌ایم. حالا APIهای عملیاتی را اضافه کنید:

gcloud services enable \
    sqladmin.googleapis.com \
    run.googleapis.com \
    cloudbuild.googleapis.com \
    artifactregistry.googleapis.com \
    storage.googleapis.com \
    cloudtrace.googleapis.com

این امر موارد زیر را ممکن می‌سازد:

SQL Admin : برای Cloud SQL در صورت استفاده از Cloud Run
اجرای ابری : برای استقرار بدون سرور
ساخت ابری : برای استقرار خودکار
رجیستری مصنوعات : برای تصاویر کانتینر
فضای ذخیره‌سازی ابری : برای مصنوعات و صحنه‌سازی
ردیابی ابر : برای مشاهده‌پذیری

مرحله 7: ایجاد مخزن رجیستری مصنوعات

استقرار ما، تصاویر کانتینری را می‌سازد که به یک خانه نیاز دارند:

gcloud artifacts repositories create code-review-assistant-repo \
    --repository-format=docker \
    --location=us-central1 \
    --description="Docker repository for Code Review Assistant"

شما باید ببینید:

Created repository [code-review-assistant-repo].

اگر از قبل وجود داشته باشد (شاید از تلاش قبلی)، اشکالی ندارد - یک پیام خطا خواهید دید که می‌توانید آن را نادیده بگیرید.

مرحله ۸: اعطای مجوزهای IAM

# Get your project number
PROJECT_NUMBER=$(gcloud projects describe $GOOGLE_CLOUD_PROJECT \
    --format="value(projectNumber)")

# Define the service account
SERVICE_ACCOUNT="${PROJECT_NUMBER}@cloudbuild.gserviceaccount.com"

# Grant necessary roles
gcloud projects add-iam-policy-binding $GOOGLE_CLOUD_PROJECT \
    --member="serviceAccount:${SERVICE_ACCOUNT}" \
    --role="roles/run.admin"

gcloud projects add-iam-policy-binding $GOOGLE_CLOUD_PROJECT \
    --member="serviceAccount:${SERVICE_ACCOUNT}" \
    --role="roles/iam.serviceAccountUser"

gcloud projects add-iam-policy-binding $GOOGLE_CLOUD_PROJECT \
    --member="serviceAccount:${SERVICE_ACCOUNT}" \
    --role="roles/cloudsql.admin"

gcloud projects add-iam-policy-binding $GOOGLE_CLOUD_PROJECT \
    --member="serviceAccount:${SERVICE_ACCOUNT}" \
    --role="roles/storage.admin"

هر دستور خروجی زیر را خواهد داشت:

Updated IAM policy for project [your-project-id].

آنچه شما به انجام رسانده‌اید

فضای کاری تولید شما اکنون کاملاً آماده است:

✅ پروژه Google Cloud پیکربندی و احراز هویت شد
✅ عامل پایه برای درک محدودیت‌ها آزمایش شده است
✅ کد پروژه با متغیرهای استراتژیک آماده
✅ وابستگی‌های ایزوله شده در محیط مجازی
✅ تمام API های لازم فعال هستند
✅ رجیستری کانتینر آماده برای استقرار
✅ مجوزهای IAM به درستی پیکربندی شده‌اند
✅ متغیرهای محیطی به درستی تنظیم شده‌اند

حالا شما آماده‌اید تا یک سیستم هوش مصنوعی واقعی با ابزارهای قطعی، مدیریت حالت و معماری مناسب بسازید.

۴. ساخت اولین نماینده شما

نمودار-ساخت-اولین-عامل-شما.png

چه چیزی ابزارها را از LLM ها متفاوت می کند

وقتی از یک LLM می‌پرسید «چند تابع در این کد وجود دارد؟»، از تطبیق الگو و تخمین استفاده می‌کند. وقتی از ابزاری استفاده می‌کنید که ast.parse() پایتون را فراخوانی می‌کند، درخت سینتکس واقعی را تجزیه می‌کند - بدون حدس زدن، هر بار نتیجه یکسان است.

این بخش ابزاری می‌سازد که ساختار کد را به صورت قطعی تجزیه و تحلیل می‌کند، سپس آن را به عاملی متصل می‌کند که می‌داند چه زمانی آن را فراخوانی کند.

مرحله 1: درک داربست

بیایید ساختاری را که قرار است پر کنید بررسی کنیم.

👉 باز است

code_review_assistant/tools.py

تابع analyze_code_structure را به همراه توضیحاتی که محل اضافه کردن کد را مشخص می‌کنند، مشاهده خواهید کرد. این تابع از قبل ساختار اولیه را دارد - شما آن را گام به گام بهبود خواهید بخشید.

مرحله ۲: اضافه کردن فضای ذخیره‌سازی حالت

ذخیره‌سازی وضعیت به سایر عوامل در خط لوله اجازه می‌دهد تا بدون اجرای مجدد تحلیل، به نتایج ابزار شما دسترسی داشته باشند.

👉 پیدا کنید:

        # MODULE_4_STEP_2_ADD_STATE_STORAGE

👉 آن خط را با این کد جایگزین کنید:

        # Store code and analysis for other agents to access
        tool_context.state[StateKeys.CODE_TO_REVIEW] = code
        tool_context.state[StateKeys.CODE_ANALYSIS] = analysis
        tool_context.state[StateKeys.CODE_LINE_COUNT] = len(code.splitlines())

مقدار بازگشتی در مقابل حالت: تفاوت چیست؟

مقدار بازگشتی: آنچه LLM بلافاصله می‌بیند. برای پیام‌های وضعیت، خلاصه‌ها و اطلاعاتی که LLM در حال حاضر به آن نیاز دارد، استفاده می‌شود.
ذخیره‌سازی وضعیت: آنچه سایر عوامل/ابزارها می‌توانند بعداً بخوانند. برای داده‌های دقیقی که باید در طول فرآیند پردازش باقی بمانند، استفاده می‌شود.

مقدار برگشتی را به عنوان «بلند صحبت کردن» و حالت را به عنوان «نوشتن روی یک تخته سفید مشترک» که همه عامل‌ها می‌توانند آن را ببینند، در نظر بگیرید.

چرا از ثابت‌های StateKeys استفاده کنیم؟

👁️🔦 نکته دسترسی : اگر در خواندن کد این بلوک‌های سایه‌دار مشکل دارید، از دکمه‌ی تغییر حالت روشن/تیره در گوشه‌ی بالا سمت راست بلوک کد برای تغییر به حالت روشن استفاده کنید.

توجه داشته باشید که ما به جای رشته "code_to_review" از StateKeys.CODE_TO_REVIEW استفاده می‌کنیم:

# Without constants - prone to typos
tool_context.state["code_to_review"] = code
tool_context.state["code_to_reveiw"]  # Typo! Returns None silently

# With constants - typos caught by IDE
tool_context.state[StateKeys.CODE_TO_REVIEW] = code
tool_context.state[StateKeys.CODE_TO_REVEIW]  # Error immediately!

ثابت‌ها در code_review_assistant/constants.py تعریف شده‌اند:

class StateKeys:
    CODE_TO_REVIEW = "code_to_review"
    CODE_ANALYSIS = "code_analysis"
    # ... more keys

این از باگ‌هایی که در غیر این صورت فقط در محیط عملیاتی ظاهر می‌شوند، جلوگیری می‌کند. وقتی چندین عامل حالت را به اشتراک می‌گذارند (مانند ماژول ۵)، یک اشتباه تایپی کل خط لوله را بی‌صدا از کار می‌اندازد. ثابت‌ها اشتباهات تایپی را غیرممکن می‌کنند - IDE شما فوراً آنها را تشخیص می‌دهد.

مرحله ۳: اضافه کردن تجزیه ناهمگام با Thread Pools

ابزار ما باید AST را بدون مسدود کردن سایر عملیات تجزیه کند. بیایید اجرای ناهمگام را با استفاده از thread pools اضافه کنیم.

👉 پیدا کنید:

        # MODULE_4_STEP_3_ADD_ASYNC

👉 آن خط را با این کد جایگزین کنید:

        # Parse in thread pool to avoid blocking the event loop
        loop = asyncio.get_event_loop()
        with ThreadPoolExecutor() as executor:
            tree = await loop.run_in_executor(executor, ast.parse, code)

ساخت ابزارهای غیر مسدود کننده

این الگو از متوقف شدن سایر عملیات توسط ابزار جلوگیری می‌کند. در اینجا نحوه‌ی عملکرد هر بخش آمده است:

امضای تابع async def (که از قبل در scaffold وجود دارد):

به این ابزار اجازه می‌دهد تا از await استفاده کند
به ADK اجازه می‌دهد چندین ابزار را همزمان اجرا کند
ضروری برای ساخت عامل‌های کارآمد و غیر مسدودکننده. در حالی که چارچوب ADK می‌تواند یک تابع همگام استاندارد را در بر بگیرد، این ابزار تمام عملیات همزمان دیگر را تا زمان تکمیل آن مسدود می‌کند. برای عامل‌های آماده برای تولید، async def استاندارد است.

الگوی run_in_executor (چیزی که اضافه کردید):

loop = asyncio.get_event_loop()
with ThreadPoolExecutor() as executor:
    tree = await loop.run_in_executor(executor, ast.parse, code)

ast.parse مصرف CPU بالایی دارد را در یک thread جداگانه اجرا می‌کند.
دستور await این ابزار را در حین کار thread متوقف می‌کند.
ابزارهای دیگر می‌توانند در طول آن مکث اجرا شوند
از مسدود شدن حلقه رویداد جلوگیری می‌کند

چرا هر دو مورد نیاز هستند:

# Just async def - still blocks everything!
async def my_tool():
    tree = ast.parse(code)  # Blocks for 100ms, nothing else runs

# With thread pool - work happens in background
async def my_tool():
    tree = await loop.run_in_executor(executor, ast.parse, code)
    # Other tools run while ast.parse works in the thread

این الگوی پیشنهادی ADK برای عملیات‌های با مصرف بالای CPU است که در راهنمای عملکرد مستند شده است.

مرحله ۴: استخراج اطلاعات جامع

حالا بیایید کلاس‌ها، ایمپورت‌ها و معیارهای دقیق را استخراج کنیم - هر آنچه که برای یک بررسی کامل کد نیاز داریم.

👉 پیدا کنید:

        # MODULE_4_STEP_4_EXTRACT_DETAILS

👉 آن خط را با این کد جایگزین کنید:

        # Extract comprehensive structural information
        analysis = await loop.run_in_executor(
            executor, _extract_code_structure, tree, code
        )

👉 تأیید کنید: تابع

analyze_code_structure

در

tools.py

یک بدنه مرکزی دارد که به این شکل است:

# Parse in thread pool to avoid blocking the event loop
loop = asyncio.get_event_loop()
with ThreadPoolExecutor() as executor:
    tree = await loop.run_in_executor(executor, ast.parse, code)

    # Extract comprehensive structural information
    analysis = await loop.run_in_executor(
        executor, _extract_code_structure, tree, code
    )

# Store code and analysis for other agents to access
tool_context.state[StateKeys.CODE_TO_REVIEW] = code
tool_context.state[StateKeys.CODE_ANALYSIS] = analysis
tool_context.state[StateKeys.CODE_LINE_COUNT] = len(code.splitlines())

👉 حالا به پایین صفحه بروید

tools.py

و پیدا کنید:

# MODULE_4_STEP_4_HELPER_FUNCTION

👉 آن خط را با تابع کمکی کامل جایگزین کنید:

def _extract_code_structure(tree: ast.AST, code: str) -> Dict[str, Any]:
    """
    Helper function to extract structural information from AST.
    Runs in thread pool for CPU-bound work.
    """
    functions = []
    classes = []
    imports = []
    docstrings = []

    for node in ast.walk(tree):
        if isinstance(node, ast.FunctionDef):
            func_info = {
                'name': node.name,
                'args': [arg.arg for arg in node.args.args],
                'lineno': node.lineno,
                'has_docstring': ast.get_docstring(node) is not None,
                'is_async': isinstance(node, ast.AsyncFunctionDef),
                'decorators': [d.id for d in node.decorator_list
                               if isinstance(d, ast.Name)]
            }
            functions.append(func_info)

            if func_info['has_docstring']:
                docstrings.append(f"{node.name}: {ast.get_docstring(node)[:50]}...")

        elif isinstance(node, ast.ClassDef):
            methods = []
            for item in node.body:
                if isinstance(item, ast.FunctionDef):
                    methods.append(item.name)

            class_info = {
                'name': node.name,
                'lineno': node.lineno,
                'methods': methods,
                'has_docstring': ast.get_docstring(node) is not None,
                'base_classes': [base.id for base in node.bases
                                 if isinstance(base, ast.Name)]
            }
            classes.append(class_info)

        elif isinstance(node, ast.Import):
            for alias in node.names:
                imports.append({
                    'module': alias.name,
                    'alias': alias.asname,
                    'type': 'import'
                })
        elif isinstance(node, ast.ImportFrom):
            imports.append({
                'module': node.module or '',
                'names': [alias.name for alias in node.names],
                'type': 'from_import',
                'level': node.level
            })

    return {
        'functions': functions,
        'classes': classes,
        'imports': imports,
        'docstrings': docstrings,
        'metrics': {
            'line_count': len(code.splitlines()),
            'function_count': len(functions),
            'class_count': len(classes),
            'import_count': len(imports),
            'has_main': any(f['name'] == 'main' for f in functions),
            'has_if_main': '__main__' in code,
            'avg_function_length': _calculate_avg_function_length(tree)
        }
    }


def _calculate_avg_function_length(tree: ast.AST) -> float:
    """Calculate average function length in lines."""
    function_lengths = []

    for node in ast.walk(tree):
        if isinstance(node, ast.FunctionDef):
            if hasattr(node, 'end_lineno') and hasattr(node, 'lineno'):
                length = node.end_lineno - node.lineno + 1
                function_lengths.append(length)

    if function_lengths:
        return sum(function_lengths) / len(function_lengths)
    return 0.0

چه اطلاعاتی را استخراج می‌کنیم؟

برای هر تابع، موارد زیر را ثبت می‌کنیم:

نام و آرگومان‌ها (برای مستندسازی)
شماره خط (برای گزارش خطا)
اینکه آیا دارای docstring است یا خیر (بررسی سبک)
دکوراتورها (شناسایی الگوهای خاص)

برای کلاس‌ها، موارد زیر را استخراج می‌کنیم:

متدهای تعریف شده در کلاس
کلاس‌های پایه (تحلیل وراثت)
وضعیت مستندات

این تحلیل دقیق، بررسی‌کننده‌ی سبک، اجراکننده‌ی تست و ترکیب‌کننده‌ی بازخورد را قادر می‌سازد تا بینش‌های خاص و کاربردی ارائه دهند.

مرحله ۵: اتصال به یک نماینده

حالا ما این ابزار را به عاملی متصل می‌کنیم که می‌داند چه زمانی از آن استفاده کند و چگونه نتایج آن را تفسیر کند.

👉 باز است

code_review_assistant/sub_agents/review_pipeline/code_analyzer.py

👉 پیدا کنید:

# MODULE_4_STEP_5_CREATE_AGENT

👉 آن خط را با کل عامل تولید جایگزین کنید:

code_analyzer_agent = Agent(
    name="CodeAnalyzer",
    model=config.worker_model,
    description="Analyzes Python code structure and identifies components",
    instruction="""You are a code analysis specialist responsible for understanding code structure.

Your task:
1. Take the code submitted by the user (it will be provided in the user message)
2. Use the analyze_code_structure tool to parse and analyze it
3. Pass the EXACT code to your tool - do not modify, fix, or "improve" it
4. Identify all functions, classes, imports, and structural patterns
5. Note any syntax errors or structural issues
6. Store the analysis in state for other agents to use

CRITICAL:
- Pass the code EXACTLY as provided to the analyze_code_structure tool
- Do not fix syntax errors, even if obvious
- Do not add missing imports or fix indentation
- The goal is to analyze what IS there, not what SHOULD be there

When calling the tool, pass the code as a string to the 'code' parameter.
If the analysis fails due to syntax errors, clearly report the error location and type.

Provide a clear summary including:
- Number of functions and classes found
- Key structural observations
- Any syntax errors or issues detected
- Overall code organization assessment""",
    tools=[FunctionTool(func=analyze_code_structure)],
    output_key="structure_analysis_summary"
)

چرا چنین دستورالعمل‌های دقیقی؟

تأکید دستورالعمل بر «کد دقیق» و «اصلاح نشود» بسیار مهم است زیرا:

LLM ها ذاتاً می‌خواهند مفید باشند و خطاهای آشکار را برطرف کنند.
اما این یک عامل REVIEW است، نه یک عامل FIX - نگرانی‌ها جداست.
ما خواهان تحلیل صادقانه از آنچه واقعاً وجود دارد هستیم
خط لوله اصلاح (ماژول ۶) اصلاحات را مدیریت می‌کند.

مراحل شماره‌گذاری شده، LLM را به وضوح در جریان کار راهنمایی می‌کند و ابهام در مورد اینکه چه کاری را چه زمانی باید انجام دهد، کاهش می‌دهد.

اشتباه رایج کارگزاران: LLM های بیش از حد مفید

بدون دستورالعمل "کد دقیق"، نتایج گمراه‌کننده‌ای خواهید گرفت:

# User submits:
def add(a,b):return a+b  # Missing spaces, wrong style

# Without instruction, LLM "helpfully" calls tool with:
def add(a, b):
    return a + b

# Style checker analyzes the "fixed" code
# Reports: "Perfect! No issues found!"
# User gets completely wrong feedback

دستورالعمل صریح با گفتن به LLM از این امر جلوگیری می‌کند: کار شما تجزیه و تحلیل است، نه بهبود. دقیقاً آنچه را که دریافت می‌کنید، منتقل کنید.

آنالیزور کد خود را آزمایش کنید

حالا مطمئن شوید که آنالایزر شما به درستی کار می‌کند.

👉 اسکریپت آزمایشی را اجرا کنید:

python tests/test_code_analyzer.py

اسکریپت آزمایشی به طور خودکار پیکربندی را از فایل .env شما با استفاده از python-dotenv بارگذاری می‌کند، بنابراین نیازی به تنظیم دستی متغیر محیطی نیست.

خروجی مورد انتظار:

INFO:code_review_assistant.config:Code Review Assistant Configuration Loaded:
INFO:code_review_assistant.config:  - GCP Project: your-project-id
INFO:code_review_assistant.config:  - Artifact Bucket: gs://your-project-artifacts
INFO:code_review_assistant.config:  - Models: worker=gemini-2.5-flash, critic=gemini-2.5-pro
Testing code analyzer...
INFO:code_review_assistant.tools:Tool: Analysis complete - 2 functions, 1 classes

=== Analyzer Response ===
The analysis of the provided code shows the following:

* **Functions Found:** 2
    * `add(a, b)`: A global function at line 2.
    * `multiply(self, x, y)`: A method within the `Calculator` class.

* **Classes Found:** 1
    * `Calculator`: A class defined at line 5. Contains one method, `multiply`.

* **Imports:** 0

* **Structural Patterns:** The code defines one global function and one class 
  with a single method. Both are simple, each with a single return statement.

* **Syntax Errors/Issues:** No syntax errors detected.

* **Overall Code Organization:** The code is well-organized for its small size, 
  clearly defining a function and a class with a method.

اتفاقی که تازه افتاده:

اسکریپت آزمایشی، پیکربندی .env شما را به طور خودکار بارگذاری کرد.
ابزار analyze_code_structure() شما کد را با استفاده از AST پایتون تجزیه و تحلیل کرد.
تابع کمکی _extract_code_structure() توابع، کلاس‌ها و معیارها را استخراج کرد.
نتایج با استفاده از ثابت‌های StateKeys در session state ذخیره شدند.
عامل تحلیلگر کد نتایج را تفسیر و خلاصه‌ای ارائه کرد

عیب‌یابی:

"هیچ ماژولی با نام 'code_review_assistant' وجود ندارد" : pip install -e . را از ریشه پروژه اجرا کنید
"آرگومان ورودی کلید وجود ندارد" : بررسی کنید که .env شما دارای GOOGLE_CLOUD_PROJECT ، GOOGLE_CLOUD_LOCATION و GOOGLE_GENAI_USE_VERTEXAI=true باشد.

آنچه ساخته‌اید

اکنون یک تحلیلگر کد آماده برای تولید دارید که:

✅ تجزیه AST واقعی پایتون - قطعی، نه تطبیق الگو
✅ نتایج را در وضعیت ذخیره می‌کند - سایر نمایندگان می‌توانند به تجزیه و تحلیل دسترسی داشته باشند
✅ به صورت غیرهمزمان اجرا می‌شود - ابزارهای دیگر را مسدود نمی‌کند
✅ اطلاعات جامعی را استخراج می‌کند - توابع، کلاس‌ها، ایمپورت‌ها، معیارها
✅ خطاها را به خوبی مدیریت می‌کند - خطاهای نحوی را با شماره خط گزارش می‌دهد
✅ به یک نماینده متصل می‌شود - LLM می‌داند چه زمانی و چگونه از آن استفاده کند

مفاهیم کلیدی تسلط یافته

ابزارها در مقابل عامل‌ها:

ابزارها کار قطعی انجام می‌دهند (تجزیه AST)
عامل‌ها تصمیم می‌گیرند چه زمانی از ابزارها استفاده کنند و نتایج را تفسیر کنند

مقدار بازگشتی در مقابل حالت:

بازگشت: آنچه LLM بلافاصله می‌بیند
حالت: آنچه برای سایر عامل‌ها باقی می‌ماند

کلیدهای حالت، ثابت‌ها:

جلوگیری از اشتباهات تایپی در سیستم‌های چندعاملی
به عنوان قرارداد بین نمایندگان عمل کنید
بحرانی بودن زمانی که نمایندگان داده‌ها را به اشتراک می‌گذارند

استخرهای نخ + ناهمگام:

async def به ابزارها اجازه می‌دهد تا اجرا را متوقف کنند
استخرهای نخ، کارهای وابسته به CPU را در پس‌زمینه اجرا می‌کنند.
آنها با هم حلقه رویداد را پاسخگو نگه می‌دارند

توابع کمکی:

ابزارهای کمکی همگام‌سازی را از ابزارهای ناهمگام جدا کنید
کد را قابل آزمایش و قابل استفاده مجدد می‌کند

دستورالعمل‌های عامل:

دستورالعمل‌های دقیق از اشتباهات رایج LLM جلوگیری می‌کنند
صریحاً در مورد کارهایی که نباید انجام دهید (کد را اصلاح نکنید)
مراحل گردش کار را برای ثبات واضح کنید

قدم بعدی چیست؟

در ماژول ۵، موارد زیر را اضافه خواهید کرد:

بررسی‌کننده‌ی استایل که کد را از حالت می‌خواند
اجراکننده تست که واقعاً تست‌ها را اجرا می‌کند
سینتی‌سایزر فیدبک‌دار که تمام تحلیل‌ها را با هم ترکیب می‌کند

خواهید دید که چگونه وضعیت از طریق یک خط لوله ترتیبی جریان می‌یابد، و چرا الگوی ثابت‌ها زمانی که چندین عامل داده‌های یکسان را می‌خوانند و می‌نویسند، اهمیت دارد.

۵. ایجاد یک خط لوله: چندین عامل با هم کار می‌کنند

نمودار-ساخت-یک-خط-لوله-چند-عامل-همکاری-با-هم.png

مقدمه

در ماژول ۴، شما یک عامل واحد ساختید که ساختار کد را تجزیه و تحلیل می‌کند. اما بررسی جامع کد به چیزی بیش از تجزیه نیاز دارد - شما به بررسی سبک، اجرای تست و ترکیب هوشمند بازخورد نیاز دارید.

این ماژول یک خط لوله از ۴ عامل ایجاد می‌کند که به ترتیب با هم کار می‌کنند و هر کدام تجزیه و تحلیل تخصصی ارائه می‌دهند:

تحلیلگر کد (از ماژول ۴) - ساختار را تجزیه می‌کند
بررسی‌کننده سبک - تخلفات سبک را شناسایی می‌کند
اجراکننده تست - تست‌ها را اجرا و اعتبارسنجی می‌کند
فیدبک سینت سایزر - همه چیز را در قالب فیدبک‌های کاربردی ترکیب می‌کند

مفهوم کلیدی: حالت به عنوان کانال ارتباطی. هر عامل آنچه را که عامل‌های قبلی برای حالت نوشته‌اند، می‌خواند، تحلیل خود را اضافه می‌کند و حالت غنی‌شده را به عامل بعدی منتقل می‌کند. الگوی ثابت‌ها از ماژول ۴ زمانی حیاتی می‌شود که چندین عامل داده‌ها را به اشتراک می‌گذارند.

پیش‌نمایشی از آنچه خواهید ساخت: ارسال کد نامرتب → مشاهده جریان وضعیت از طریق ۴ عامل → دریافت گزارش جامع به همراه بازخورد شخصی‌سازی‌شده بر اساس الگوهای گذشته.

مرحله ۱: ابزار بررسی استایل + عامل (Agent) را اضافه کنید

بررسی‌کننده‌ی سبک، تخلفات PEP 8 را با استفاده از pycodestyle شناسایی می‌کند - یک linter قطعی، نه تفسیر مبتنی بر LLM.

ابزار بررسی استایل را اضافه کنید

👉 باز است

code_review_assistant/tools.py

👉 پیدا کنید:

# MODULE_5_STEP_1_STYLE_CHECKER_TOOL

👉 آن خط را با این کد جایگزین کنید:

async def check_code_style(code: str, tool_context: ToolContext) -> Dict[str, Any]:
    """
    Checks code style compliance using pycodestyle (PEP 8).

    Args:
        code: Python source code to check (or will retrieve from state)
        tool_context: ADK tool context

    Returns:
        Dictionary containing style score and issues
    """
    logger.info("Tool: Checking code style...")

    try:
        # Retrieve code from state if not provided
        if not code:
            code = tool_context.state.get(StateKeys.CODE_TO_REVIEW, '')
            if not code:
                return {
                    "status": "error",
                    "message": "No code provided or found in state"
                }

        # Run style check in thread pool
        loop = asyncio.get_event_loop()
        with ThreadPoolExecutor() as executor:
            result = await loop.run_in_executor(
                executor, _perform_style_check, code
            )

        # Store results in state
        tool_context.state[StateKeys.STYLE_SCORE] = result['score']
        tool_context.state[StateKeys.STYLE_ISSUES] = result['issues']
        tool_context.state[StateKeys.STYLE_ISSUE_COUNT] = result['issue_count']

        logger.info(f"Tool: Style check complete - Score: {result['score']}/100, "
                    f"Issues: {result['issue_count']}")

        return result

    except Exception as e:
        error_msg = f"Style check failed: {str(e)}"
        logger.error(f"Tool: {error_msg}", exc_info=True)

        # Set default values on error
        tool_context.state[StateKeys.STYLE_SCORE] = 0
        tool_context.state[StateKeys.STYLE_ISSUES] = []

        return {
            "status": "error",
            "message": error_msg,
            "score": 0
        }

👉 حالا به انتهای فایل بروید و عبارت زیر را پیدا کنید:

# MODULE_5_STEP_1_STYLE_HELPERS

👉 آن خط را با توابع کمکی جایگزین کنید:

def _perform_style_check(code: str) -> Dict[str, Any]:
    """Helper to perform style check in thread pool."""
    import io
    import sys

    with tempfile.NamedTemporaryFile(mode='w', suffix='.py', delete=False) as tmp:
        tmp.write(code)
        tmp_path = tmp.name

    try:
        # Capture stdout to get pycodestyle output
        old_stdout = sys.stdout
        sys.stdout = captured_output = io.StringIO()

        style_guide = pycodestyle.StyleGuide(
            quiet=False,  # We want output
            max_line_length=100,
            ignore=['E501', 'W503']
        )

        result = style_guide.check_files([tmp_path])

        # Restore stdout
        sys.stdout = old_stdout

        # Parse captured output
        output = captured_output.getvalue()
        issues = []

        for line in output.strip().split('\n'):
            if line and ':' in line:
                parts = line.split(':', 4)
                if len(parts) >= 4:
                    try:
                        issues.append({
                            'line': int(parts[1]),
                            'column': int(parts[2]),
                            'code': parts[3].split()[0] if len(parts) > 3 else 'E000',
                            'message': parts[3].strip() if len(parts) > 3 else 'Unknown error'
                        })
                    except (ValueError, IndexError):
                        pass

        # Add naming convention checks
        try:
            tree = ast.parse(code)
            naming_issues = _check_naming_conventions(tree)
            issues.extend(naming_issues)
        except SyntaxError:
            pass  # Syntax errors will be caught elsewhere

        # Calculate weighted score
        score = _calculate_style_score(issues)

        return {
            "status": "success",
            "score": score,
            "issue_count": len(issues),
            "issues": issues[:10],  # First 10 issues
            "summary": f"Style score: {score}/100 with {len(issues)} violations"
        }

    finally:
        if os.path.exists(tmp_path):
            os.unlink(tmp_path)


def _check_naming_conventions(tree: ast.AST) -> List[Dict[str, Any]]:
    """Check PEP 8 naming conventions."""
    naming_issues = []

    for node in ast.walk(tree):
        if isinstance(node, ast.FunctionDef):
            # Skip private/protected methods and __main__
            if not node.name.startswith('_') and node.name != node.name.lower():
                naming_issues.append({
                    'line': node.lineno,
                    'column': node.col_offset,
                    'code': 'N802',
                    'message': f"N802 function name '{node.name}' should be lowercase"
                })
        elif isinstance(node, ast.ClassDef):
            # Check if class name follows CapWords convention
            if not node.name[0].isupper() or '_' in node.name:
                naming_issues.append({
                    'line': node.lineno,
                    'column': node.col_offset,
                    'code': 'N801',
                    'message': f"N801 class name '{node.name}' should use CapWords convention"
                })

    return naming_issues


def _calculate_style_score(issues: List[Dict[str, Any]]) -> int:
    """Calculate weighted style score based on violation severity."""
    if not issues:
        return 100

    # Define weights by error type
    weights = {
        'E1': 10,  # Indentation errors
        'E2': 3,  # Whitespace errors
        'E3': 5,  # Blank line errors
        'E4': 8,  # Import errors
        'E5': 5,  # Line length
        'E7': 7,  # Statement errors
        'E9': 10,  # Syntax errors
        'W2': 2,  # Whitespace warnings
        'W3': 2,  # Blank line warnings
        'W5': 3,  # Line break warnings
        'N8': 7,  # Naming conventions
    }

    total_deduction = 0
    for issue in issues:
        code_prefix = issue['code'][:2] if len(issue['code']) >= 2 else 'E2'
        weight = weights.get(code_prefix, 3)
        total_deduction += weight

    # Cap at 100 points deduction
    return max(0, 100 - min(total_deduction, 100))

الگوی تولید: جداسازی توابع کمکی

به ساختار توجه کنید:

ابزار اصلی ( check_code_style ): ناهمگام‌سازی، مدیریت حالت، مدیریت خطا
کمک‌کننده ( _perform_style_check ): همگام‌سازی، منطق محض، در مخزن نخ اجرا می‌شود.
زیر-یاورها ( _check_naming_conventions , _calculate_style_score ): ابزارهای متمرکز

این جداسازی موارد زیر را فراهم می‌کند:

قابلیت آزمایش : کمک‌کننده‌ها را به‌طور مستقل آزمایش کنید
قابلیت استفاده مجدد : ابزارهای دیگر می‌توانند از همین منطق استفاده کنند
ایمنی نخ : کمک‌کننده‌های همگام‌سازی در استخرهای نخ کار می‌کنند
قابلیت نگهداری : هر تابع یک مسئولیت واحد دارد

سیستم امتیازدهی وزنی ( _calculate_style_score ) تخلفات جدی (تورفتگی، نحو) را بر تخلفات جزئی (فضای خالی) اولویت می‌دهد و ارزیابی کیفیت دقیق‌تری نسبت به شمارش ساده ارائه می‌دهد.

الگوی بازیابی حالت

این ابزار بررسی می‌کند که آیا کد ارائه شده است یا خیر، و در صورت عدم ارائه، از وضعیت بازیابی می‌کند:

if not code:
    code = tool_context.state.get(StateKeys.CODE_TO_REVIEW, '')

این باعث می‌شود ابزار انعطاف‌پذیر باشد:

استفاده از خط لوله : عامل به طور خودکار از حالت می‌خواند
استفاده مستقل : می‌تواند مستقیماً کد را برای آزمایش ارسال کند
مدیریت خطا : قبل از پردازش، وجود کد را تأیید می‌کند

این الگو در سراسر ابزارهای تولید ظاهر می‌شود - همیشه برای حالت‌های مختلف، پشتیبان فراهم کنید.

افزودن عامل بررسی استایل

👉 باز است

code_review_assistant/sub_agents/review_pipeline/style_checker.py

👉 پیدا کنید:

# MODULE_5_STEP_1_INSTRUCTION_PROVIDER

👉 آن خط را با این کد جایگزین کنید:

async def style_checker_instruction_provider(context: ReadonlyContext) -> str:
    """Dynamic instruction provider that injects state variables."""
    template = """You are a code style expert focused on PEP 8 compliance.

Your task:
1. Use the check_code_style tool to validate PEP 8 compliance
2. The tool will retrieve the ORIGINAL code from state automatically
3. Report violations exactly as found
4. Present the results clearly and confidently

CRITICAL:
- The tool checks the code EXACTLY as provided by the user
- Do not suggest the code was modified or fixed
- Report actual violations found in the original code
- If there are style issues, they should be reported honestly

Call the check_code_style tool with an empty string for the code parameter,
as the tool will retrieve the code from state automatically.

When presenting results based on what the tool returns:
- State the exact score from the tool results
- If score >= 90: "Excellent style compliance!"
- If score 70-89: "Good style with minor improvements needed"
- If score 50-69: "Style needs attention"
- If score < 50: "Significant style improvements needed"

List the specific violations found (the tool will provide these):
- Show line numbers, error codes, and messages
- Focus on the top 10 most important issues

Previous analysis: {structure_analysis_summary}

Format your response as:
## Style Analysis Results
- Style Score: [exact score]/100
- Total Issues: [count]
- Assessment: [your assessment based on score]

## Top Style Issues
[List issues with line numbers and descriptions]

## Recommendations
[Specific fixes for the most critical issues]"""

    return await instructions_utils.inject_session_state(template, context)

👉 پیدا کنید:

# MODULE_5_STEP_1_STYLE_CHECKER_AGENT

👉 آن خط را با این کد جایگزین کنید:

style_checker_agent = Agent(
    name="StyleChecker",
    model=config.worker_model,
    description="Checks Python code style against PEP 8 guidelines",
    instruction=style_checker_instruction_provider,
    tools=[FunctionTool(func=check_code_style)],
    output_key="style_check_summary"
)

ارائه دهندگان آموزش پویا

به الگو توجه کنید:

async def style_checker_instruction_provider(context: ReadonlyContext) -> str:
    template = """..."""
    return await instructions_utils.inject_session_state(template, context)

چرا دستورالعمل‌های پویا در مقابل دستورالعمل‌های ایستا؟

استاتیک (آنچه ممکن است انتظار داشته باشید):

instruction="Check the code style and report issues"

مشکل: عمومی، بدون هیچ زمینه‌ای در مورد یافته‌های قبلی ماموران

پویا (الگوی تولید):

instruction=style_checker_instruction_provider

هر فراخوانی را اجرا می‌کند - آخرین وضعیت را دریافت می‌کند
مقادیری مانند {structure_analysis_summary} را از state تزریق می‌کند.
دستورالعمل‌ها را بر اساس زمینه بررسی فعلی تطبیق می‌دهد
LLM داده‌های مشخصی در مورد این کد خاص می‌بیند.

فراخوانی instructions_utils.inject_session_state متغیرهای {key_name} را با مقادیر واقعی از context.state جایگزین می‌کند.

مرحله 2: اضافه کردن عامل اجرای تست

اجراکننده‌ی تست، تست‌های جامعی تولید می‌کند و آن‌ها را با استفاده از اجراکننده‌ی کد داخلی اجرا می‌کند.

👉 باز است

code_review_assistant/sub_agents/review_pipeline/test_runner.py

👉 پیدا کنید:

# MODULE_5_STEP_2_INSTRUCTION_PROVIDER

👉 آن خط را با این کد جایگزین کنید:

async def test_runner_instruction_provider(context: ReadonlyContext) -> str:
    """Dynamic instruction provider that injects the code_to_review directly."""
    template = """You are a testing specialist who creates and runs tests for Python code.

THE CODE TO TEST IS:
{code_to_review}

YOUR TASK:
1. Understand what the function appears to do based on its name and structure
2. Generate comprehensive tests (15-20 test cases)
3. Execute the tests using your code executor
4. Analyze results to identify bugs vs expected behavior
5. Output a detailed JSON analysis

TESTING METHODOLOGY:
- Test with the most natural interpretation first
- When something fails, determine if it's a bug or unusual design
- Test edge cases, boundaries, and error scenarios
- Document any surprising behavior

Execute your tests and output ONLY valid JSON with this structure:
- "test_summary": object with "total_tests_run", "tests_passed", "tests_failed", "tests_with_errors", "critical_issues_found"
- "critical_issues": array of objects, each with "type", "description", "example_input", "expected_behavior", "actual_behavior", "severity"
- "test_categories": object with "basic_functionality", "edge_cases", "error_handling" (each containing "passed", "failed", "errors" counts)
- "function_behavior": object with "apparent_purpose", "actual_interface", "unexpected_requirements"
- "verdict": object with "status" (WORKING/BUGGY/BROKEN), "confidence" (high/medium/low), "recommendation"

Do NOT output the test code itself, only the JSON analysis."""

    return await instructions_utils.inject_session_state(template, context)

👉 پیدا کنید:

# MODULE_5_STEP_2_TEST_RUNNER_AGENT

👉 آن خط را با این کد جایگزین کنید:

test_runner_agent = Agent(
    name="TestRunner",
    model=config.critic_model,
    description="Generates and runs tests for Python code using safe code execution",
    instruction=test_runner_instruction_provider,
    code_executor=BuiltInCodeExecutor(),
    output_key="test_execution_summary"
)

چرا مدل انتقادی برای آزمایش؟

به جای worker_model به model=config.critic_model توجه کنید:

test_runner_agent = Agent(
    model=config.critic_model,  # More capable model
    ...
)

انتخاب مدل کارگر در مقابل منتقد:

کارگر ( gemini-2.5-flash ): سریع، ارزان‌تر، مناسب برای کارهای مکانیکی
منتقد ( gemini-2.5-pro ): کندتر، گران‌تر، استدلال بهتر

آزمایش نیاز دارد:

درک قصد تابع از روی نام/ساختار
تولید ۱۵ تا ۲۰ مورد آزمایشی معنادار
تشخیص اشکالات از انتخاب‌های طراحی
تحلیل الگوهای شکست

این سطح از استدلال، مدل توانمندتر (و گران‌تر) را توجیه می‌کند. تحلیلگر و بررسی‌کننده‌ی سبک از مدل‌های کارگر استفاده می‌کنند زیرا مکانیکی‌تر هستند.

قدرت اجرای کد

BuiltInCodeExecutor چیزی است که یک بررسی‌کننده کد واقعی را از یک هوش مصنوعی که فقط در مورد کد صحبت می‌کند، متمایز می‌کند. وقتی عامل TestRunner موارد تست را تولید می‌کند، در واقع آنها را در یک جعبه شنی امن پایتون اجرا می‌کند. این به این معنی است:

اعتبارسنجی واقعی : تست‌ها واقعاً اجرا می‌شوند و خطاهای زمان اجرا را که تحلیل استاتیک از قلم می‌اندازد، شناسایی می‌کنند.
اثبات، نه حدس و گمان : وقتی می‌گوید «TypeError در خط ۴»، به این دلیل است که کد را اجرا کرده و خطا را دیده است.
نتایج ثابت : آزمایش‌های یکسان، هر بار نتایج یکسانی تولید می‌کنند.
اجرای ایمن : سندباکس از سیستم شما جدا شده است

این اجراکننده‌ی داخلی برای مورد استفاده‌ی ما - تست کد الگوریتمی خالص مانند ساختارهای داده، الگوریتم‌های مرتب‌سازی و منطق محاسباتی - عالی است.

درک محدودیت‌های سندباکس

جعبه شنی BuiltInCodeExecutor محدودیت‌های عمدی دارد:

نصب بسته انجام نشد : نمی‌توان از pip install استفاده کرد
دسترسی به شبکه وجود ندارد : نمی‌توان درخواست‌های HTTP ارسال کرد یا به APIها دسترسی پیدا کرد.
زمان اجرای محدود : کد طولانی مدت دچار وقفه زمانی می‌شود

اینها نقص نیستند - اینها ویژگی‌های امنیتی هستند. با محدود کردن sandbox به پایتون خالص، می‌توانیم با خیال راحت کد غیرقابل اعتماد را اجرا کنیم.

جایگزین تولید : برای استقرارهای تولید در GKE، GkeCodeExecutor را در نظر بگیرید که کد را در غلاف‌های Kubernetes ایزوله با gVisor اجرا می‌کند.

مرحله ۳: درک حافظه برای یادگیری بین جلساتی

قبل از ساخت ترکیب‌کننده‌ی بازخورد، باید تفاوت بین حالت و حافظه را درک کنید - دو مکانیسم ذخیره‌سازی متفاوت برای دو هدف متفاوت.

حالت در مقابل حافظه: تمایز کلیدی

بیایید با یک مثال مشخص از بررسی کد، موضوع را روشن کنیم:

ایالت (فقط جلسه فعلی):

# Data from THIS review session
tool_context.state[StateKeys.STYLE_ISSUES] = [
    {"line": 5, "code": "E231", "message": "missing whitespace"},
    {"line": 12, "code": "E701", "message": "multiple statements"}
]

محدوده: فقط این مکالمه
هدف: انتقال داده‌ها بین عامل‌ها در خط لوله فعلی
محل سکونت: شیء Session
طول عمر: با پایان جلسه، حذف می‌شود

حافظه (تمام جلسات گذشته):

# Learned from 50 previous reviews
"User frequently forgets docstrings on helper functions"
"User tends to write long functions (avg 45 lines)"
"User improved error handling after feedback in session #23"

محدوده: تمام جلسات گذشته برای این کاربر
هدف: یادگیری الگوها، ارائه بازخورد شخصی‌سازی‌شده
ساکن در: MemoryService
طول عمر: در طول جلسات باقی می‌ماند، قابل جستجو است

چرا بازخورد به هر دو نیاز دارد:

تصور کنید که سینتی‌سایزر بازخورد ایجاد می‌کند:

فقط با استفاده از وضعیت (بررسی فعلی):

"Function `calculate_total` has no docstring."

بازخورد عمومی و مکانیکی.

استفاده از وضعیت + حافظه (الگوهای فعلی + گذشته):

"Function `calculate_total` has no docstring. This is the 4th review
where helper functions lacked documentation. Consider adding docstrings
as you write functions, not afterwards - you mentioned in our last
session that you find it easier that way."

بهبود منابع شخصی‌سازی‌شده، متناسب با متن و با گذشت زمان.

برای استقرار در محیط عملیاتی، گزینه‌های زیر را دارید :

گزینه ۱: VertexAiMemoryBankService (پیشرفته)

چه کاری انجام می‌دهد: استخراج حقایق معنادار از مکالمات با استفاده از LLM
جستجو: جستجوی معنایی (معنا را درک می‌کند، نه فقط کلمات کلیدی)
مدیریت حافظه: به طور خودکار خاطرات را در طول زمان تثبیت و به‌روزرسانی می‌کند
نیاز به: پروژه ابری گوگل + راه‌اندازی موتور عامل
مورد استفاده زمانی که: به دنبال خاطرات پیچیده، در حال تکامل و شخصی‌سازی شده هستید
مثال: «کاربر برنامه‌نویسی تابعی را ترجیح می‌دهد» (برگرفته از ۱۰ مکالمه در مورد سبک کد)

گزینه ۲: ادامه با InMemoryMemoryService + Persistent Sessions

چه کاری انجام می‌دهد: تاریخچه کامل مکالمات را برای جستجوی کلمات کلیدی ذخیره می‌کند
جستجو: تطبیق کلمات کلیدی پایه در جلسات گذشته
مدیریت حافظه: شما کنترل می‌کنید چه چیزی ذخیره شود (از طریق add_session_to_memory )
نیاز دارد: فقط یک SessionService پایدار (مانند VertexAiSessionService یا DatabaseSessionService )
مورد استفاده: زمانی که به جستجوی ساده در مکالمات گذشته بدون پردازش LLM نیاز دارید
مثال: جستجوی عبارت "docstring" تمام جلساتی را که در آنها از این کلمه استفاده شده است، برمی‌گرداند.

درک رابطه سرویس

به این شکل بهش فکر کن:

SessionService (مدیریت مکالمات):

فروشگاه‌ها: رویدادها، وضعیت برای مکالمه فعلی
مثال: VertexAiSessionService ، DatabaseSessionService ، InMemorySessionService
کاربرد: ماندگاری مکالمه فعلی

MemoryService (دانش بین جلسات):

فروشگاه‌ها: اطلاعات حاصل از مکالمات گذشته
مثال:
- InMemoryMemoryService : تاریخچه کامل و جستجوی کلمات کلیدی را ذخیره می‌کند.
- VertexAiMemoryBankService : استخراج دانش، جستجوی معنایی
کاربرد: بازیابی متن از جلسات گذشته

آنها با هم کار می‌کنند:

# After code review completes
session = await session_service.get_session(...)

# Add session to memory for future reference
await memory_service.add_session_to_memory(session)

# Future reviews can search memory
results = await memory_service.search_memory("docstring patterns")

شما می‌توانید از VertexAiSessionService بدون Memory Bank (فقط session persistence) استفاده کنید، اما Memory Bank برای استخراج به sessionها نیاز دارد.

چگونه حافظه پر می‌شود

پس از اتمام هر بررسی کد:

# At the end of a session (typically in your application code)
await memory_service.add_session_to_memory(session)

چه اتفاقی می‌افتد:

InMemoryMemoryService: رویدادهای کامل جلسه را برای جستجوی کلمات کلیدی ذخیره می‌کند.
VertexAiMemoryBankService: LLM حقایق کلیدی را استخراج می‌کند و با خاطرات موجود ادغام می‌کند.

جلسات آینده می‌توانند موارد زیر را پرس‌وجو کنند:

# In a tool, search for relevant past feedback
results = tool_context.search_memory("feedback about docstrings")

حالت، حافظه و مصنوعات: چه زمانی از هر کدام استفاده کنیم

اکنون سه مکانیزم ذخیره‌سازی دارید:

ایالت:

نوع: داده‌های ساختاریافته (دیکته‌ها، لیست‌ها، اعداد، رشته‌ها)
مثال: {"style_score": 75, "test_pass_rate": 0.8}
مورد استفاده زمانی که: سایر عوامل در این خط لوله به داده‌ها نیاز دارند
دسترسی: tool_context.state[StateKeys.STYLE_SCORE]

حافظه:

نوع: متن قابل جستجو از جلسات گذشته
مثال: «کاربر در مدیریت خطا در فراخوانی‌های API مشکل دارد»
مورد استفاده: یادگیری الگوها برای بهبود جلسات آینده
دسترسی: tool_context.search_memory("error handling patterns")

مصنوعات:

نوع: فایل‌های دودویی (PDF، تصاویر، فایل‌های اکسل)
مثال: گزارش نهایی بررسی کد به همراه قالب‌بندی
مورد استفاده زمانی که: کاربران نیاز به دانلود/مشاهده فایل‌ها دارند
دسترسی: tool_context.save_artifact("report.pdf", pdf_bytes)

سینتی‌سایزر از هر سه مورد زیر استفاده می‌کند:

حالت را می‌خواند تا تحلیل فعلی را دریافت کند
الگوهای گذشته را در حافظه جستجو می‌کند
مصنوع را برای گزارش نهایی ذخیره می‌کند

مرحله ۴: افزودن ابزارها و عامل سنتزکننده بازخورد

ترکیب‌کننده‌ی بازخورد، پیچیده‌ترین عامل در خط تولید است. این عامل سه ابزار را هماهنگ می‌کند، از دستورالعمل‌های پویا استفاده می‌کند و حالت، حافظه و مصنوعات را با هم ترکیب می‌کند.

سه ابزار سینتی‌سایزر را اضافه کنید

👉 باز است

code_review_assistant/tools.py

👉 پیدا کنید:

# MODULE_5_STEP_4_SEARCH_PAST_FEEDBACK

👉 جایگزین با ابزار ۱ - جستجوی حافظه (نسخه‌ی عملیاتی):

async def search_past_feedback(developer_id: str, tool_context: ToolContext) -> Dict[str, Any]:
    """
    Search for past feedback in memory service.

    Args:
        developer_id: ID of the developer (defaults to "default_user")
        tool_context: ADK tool context with potential memory service access

    Returns:
        Dictionary containing feedback search results
    """
    logger.info(f"Tool: Searching for past feedback for developer {developer_id}...")

    try:
        # Default developer ID if not provided
        if not developer_id:
            developer_id = tool_context.state.get(StateKeys.USER_ID, 'default_user')

        # Check if memory service is available
        if hasattr(tool_context, 'search_memory'):
            try:
                # Perform structured searches
                queries = [
                    f"developer:{developer_id} code review feedback",
                    f"developer:{developer_id} common issues",
                    f"developer:{developer_id} improvements"
                ]

                all_feedback = []
                patterns = {
                    'common_issues': [],
                    'improvements': [],
                    'strengths': []
                }

                for query in queries:
                    search_result = await tool_context.search_memory(query)

                    if search_result and hasattr(search_result, 'memories'):
                        for memory in search_result.memories[:5]:
                            memory_text = memory.text if hasattr(memory, 'text') else str(memory)
                            all_feedback.append(memory_text)

                            # Extract patterns
                            if 'style' in memory_text.lower():
                                patterns['common_issues'].append('style compliance')
                            if 'improved' in memory_text.lower():
                                patterns['improvements'].append('showing improvement')
                            if 'excellent' in memory_text.lower():
                                patterns['strengths'].append('consistent quality')

                # Store in state
                tool_context.state[StateKeys.PAST_FEEDBACK] = all_feedback
                tool_context.state[StateKeys.FEEDBACK_PATTERNS] = patterns

                logger.info(f"Tool: Found {len(all_feedback)} past feedback items")

                return {
                    "status": "success",
                    "feedback_found": True,
                    "count": len(all_feedback),
                    "summary": " | ".join(all_feedback[:3]) if all_feedback else "No feedback",
                    "patterns": patterns
                }

            except Exception as e:
                logger.warning(f"Tool: Memory search error: {e}")

        # Fallback: Check state for cached feedback
        cached_feedback = tool_context.state.get(StateKeys.USER_PAST_FEEDBACK_CACHE, [])
        if cached_feedback:
            tool_context.state[StateKeys.PAST_FEEDBACK] = cached_feedback
            return {
                "status": "success",
                "feedback_found": True,
                "count": len(cached_feedback),
                "summary": "Using cached feedback",
                "patterns": {}
            }

        # No feedback found
        tool_context.state[StateKeys.PAST_FEEDBACK] = []
        logger.info("Tool: No past feedback found")

        return {
            "status": "success",
            "feedback_found": False,
            "message": "No past feedback available - this appears to be a first submission",
            "patterns": {}
        }

    except Exception as e:
        error_msg = f"Feedback search error: {str(e)}"
        logger.error(f"Tool: {error_msg}", exc_info=True)

        tool_context.state[StateKeys.PAST_FEEDBACK] = []

        return {
            "status": "error",
            "message": error_msg,
            "feedback_found": False
        }

الگوی تولید: تخریب دلپذیر

به استراتژی پشتیبان سه لایه توجه کنید:

# 1. Try memory service if available
if hasattr(tool_context, 'search_memory'):
    # Search multiple queries, extract patterns

# 2. Fall back to cached feedback in state
cached_feedback = tool_context.state.get(StateKeys.USER_PAST_FEEDBACK_CACHE, [])
if cached_feedback:
    # Use cached data

# 3. Gracefully handle no feedback
return {"feedback_found": False, "message": "...first submission"}

این الگو تضمین می‌کند که ابزار هرگز خط لوله را از کار نمی‌اندازد :

سرویس حافظه در دسترس نیست؟ از حافظه پنهان استفاده کنید
حافظه پنهان خالی است؟ عبارت "بازخوردی یافت نشد" را برمی‌گرداند.
همیشه پاسخ معتبر برمی‌گرداند، هرگز خطایی ایجاد نمی‌کند

ابزارهای تولید، انعطاف‌پذیری را بر کمال‌گرایی اولویت می‌دهند.

👉 پیدا کنید:

# MODULE_5_STEP_4_UPDATE_GRADING_PROGRESS

👉 جایگزین با ابزار ۲ - ردیاب درجه‌بندی (نسخه تولیدی):

async def update_grading_progress(tool_context: ToolContext) -> Dict[str, Any]:
    """
    Updates grading progress counters and metrics in state.
    """
    logger.info("Tool: Updating grading progress...")

    try:
        current_time = datetime.now().isoformat()

        # Build all state changes
        state_updates = {}

        # Temporary (invocation-level) state
        state_updates[StateKeys.TEMP_PROCESSING_TIMESTAMP] = current_time

        # Session-level state
        attempts = tool_context.state.get(StateKeys.GRADING_ATTEMPTS, 0) + 1
        state_updates[StateKeys.GRADING_ATTEMPTS] = attempts
        state_updates[StateKeys.LAST_GRADING_TIME] = current_time

        # User-level persistent state
        lifetime_submissions = tool_context.state.get(StateKeys.USER_TOTAL_SUBMISSIONS, 0) + 1
        state_updates[StateKeys.USER_TOTAL_SUBMISSIONS] = lifetime_submissions
        state_updates[StateKeys.USER_LAST_SUBMISSION_TIME] = current_time

        # Calculate improvement metrics
        current_style_score = tool_context.state.get(StateKeys.STYLE_SCORE, 0)
        last_style_score = tool_context.state.get(StateKeys.USER_LAST_STYLE_SCORE, 0)
        score_improvement = current_style_score - last_style_score

        state_updates[StateKeys.USER_LAST_STYLE_SCORE] = current_style_score
        state_updates[StateKeys.SCORE_IMPROVEMENT] = score_improvement

        # Track test results if available
        test_results = tool_context.state.get(StateKeys.TEST_EXECUTION_SUMMARY, {})

        # Parse if it's a string
        if isinstance(test_results, str):
            try:
                test_results = json.loads(test_results)
            except:
                test_results = {}

        if test_results and test_results.get('test_summary', {}).get('total_tests_run', 0) > 0:
            summary = test_results['test_summary']
            total = summary.get('total_tests_run', 0)
            passed = summary.get('tests_passed', 0)
            if total > 0:
                pass_rate = (passed / total) * 100
                state_updates[StateKeys.USER_LAST_TEST_PASS_RATE] = pass_rate

        # Apply all updates atomically
        for key, value in state_updates.items():
            tool_context.state[key] = value

        logger.info(f"Tool: Progress updated - Attempt #{attempts}, "
                    f"Lifetime: {lifetime_submissions}")

        return {
            "status": "success",
            "session_attempts": attempts,
            "lifetime_submissions": lifetime_submissions,
            "timestamp": current_time,
            "improvement": {
                "style_score_change": score_improvement,
                "direction": "improved" if score_improvement > 0 else "declined"
            },
            "summary": f"Attempt #{attempts} recorded, {lifetime_submissions} total submissions"
        }

    except Exception as e:
        error_msg = f"Progress update error: {str(e)}"
        logger.error(f"Tool: {error_msg}", exc_info=True)

        return {
            "status": "error",
            "message": error_msg
        }

الگوی تولید: مدیریت وضعیت چند لایه

این ابزار مدل حالت سه لایه ADK را نشان می‌دهد:

# Temporary (invocation-level) - cleared after this turn
state_updates[StateKeys.TEMP_PROCESSING_TIMESTAMP] = current_time

# Session-level - persists during this conversation
state_updates[StateKeys.GRADING_ATTEMPTS] = attempts

# User-level - persists across all sessions
state_updates[StateKeys.USER_TOTAL_SUBMISSIONS] = lifetime_submissions

چرا سه طبقه؟

موقت : اطلاعات اشکال‌زدایی، مهرهای زمانی - پس از این نوبت مورد نیاز نیست
جلسه : داده‌های بررسی فعلی - تا پایان بررسی مورد نیاز است
کاربر : معیارهای طول عمر - مورد نیاز برای شخصی‌سازی در طول جلسات

این ابزار با مقایسه نمرات فعلی با جلسات قبلی، میزان بهبود را محاسبه می‌کند:

current_style_score = tool_context.state.get(StateKeys.STYLE_SCORE, 0)
last_style_score = tool_context.state.get(StateKeys.USER_LAST_STYLE_SCORE, 0)
score_improvement = current_style_score - last_style_score

این امکان بازخوردهایی مانند «سبک شما از آخرین بررسی ۱۵ امتیاز بهبود یافته است!» را فراهم می‌کند.

👉 پیدا کنید:

# MODULE_5_STEP_4_SAVE_GRADING_REPORT

👉 با ابزار ۳ - Artifact Saver (نسخه‌ی اصلی) جایگزین کنید:

async def save_grading_report(feedback_text: str, tool_context: ToolContext) -> Dict[str, Any]:
    """
    Saves a detailed grading report as an artifact.

    Args:
        feedback_text: The feedback text to include in the report
        tool_context: ADK tool context for state management

    Returns:
        Dictionary containing save status and details
    """
    logger.info("Tool: Saving grading report...")

    try:
        # Gather all relevant data from state
        code = tool_context.state.get(StateKeys.CODE_TO_REVIEW, '')
        analysis = tool_context.state.get(StateKeys.CODE_ANALYSIS, {})
        style_score = tool_context.state.get(StateKeys.STYLE_SCORE, 0)
        style_issues = tool_context.state.get(StateKeys.STYLE_ISSUES, [])

        # Get test results
        test_results = tool_context.state.get(StateKeys.TEST_EXECUTION_SUMMARY, {})

        # Parse if it's a string
        if isinstance(test_results, str):
            try:
                test_results = json.loads(test_results)
            except:
                test_results = {}

        timestamp = datetime.now().isoformat()

        # Create comprehensive report dictionary
        report = {
            'timestamp': timestamp,
            'grading_attempt': tool_context.state.get(StateKeys.GRADING_ATTEMPTS, 1),
            'code': {
                'content': code,
                'line_count': len(code.splitlines()),
                'hash': hashlib.md5(code.encode()).hexdigest()
            },
            'analysis': analysis,
            'style': {
                'score': style_score,
                'issues': style_issues[:5]  # First 5 issues
            },
            'tests': test_results,
            'feedback': feedback_text,
            'improvements': {
                'score_change': tool_context.state.get(StateKeys.SCORE_IMPROVEMENT, 0),
                'from_last_score': tool_context.state.get(StateKeys.USER_LAST_STYLE_SCORE, 0)
            }
        }

        # Convert report to JSON string
        report_json = json.dumps(report, indent=2)
        report_part = types.Part.from_text(text=report_json)

        # Try to save as artifact if the service is available
        if hasattr(tool_context, 'save_artifact'):
            try:
                # Generate filename with timestamp (replace colons for filesystem compatibility)
                filename = f"grading_report_{timestamp.replace(':', '-')}.json"

                # Save the main report
                version = await tool_context.save_artifact(filename, report_part)

                # Also save a "latest" version for easy access
                await tool_context.save_artifact("latest_grading_report.json", report_part)

                logger.info(f"Tool: Report saved as {filename} (version {version})")

                # Store report in state as well for redundancy
                tool_context.state[StateKeys.USER_LAST_GRADING_REPORT] = report

                return {
                    "status": "success",
                    "artifact_saved": True,
                    "filename": filename,
                    "version": str(version),
                    "size": len(report_json),
                    "summary": f"Report saved as {filename}"
                }

            except Exception as artifact_error:
                logger.warning(f"Artifact service error: {artifact_error}, falling back to state storage")
                # Continue to fallback below

        # Fallback: Store in state if artifact service is not available or failed
        tool_context.state[StateKeys.USER_LAST_GRADING_REPORT] = report
        logger.info("Tool: Report saved to state (artifact service not available)")

        return {
            "status": "success",
            "artifact_saved": False,
            "message": "Report saved to state only",
            "size": len(report_json),
            "summary": "Report saved to session state"
        }

    except Exception as e:
        error_msg = f"Report save error: {str(e)}"
        logger.error(f"Tool: {error_msg}", exc_info=True)

        # Still try to save minimal data to state
        try:
            tool_context.state[StateKeys.USER_LAST_GRADING_REPORT] = {
                'error': error_msg,
                'feedback': feedback_text,
                'timestamp': datetime.now().isoformat()
            }
        except:
            pass

        return {
            "status": "error",
            "message": error_msg,
            "artifact_saved": False,
            "summary": f"Failed to save report: {error_msg}"
        }

الگوی تولید: گزارش جامع

این گزارش داده‌ها را از منابع مختلف جمع‌آوری می‌کند:

report = {
    'code': {...},           # Original submission
    'analysis': {...},       # From code_analyzer
    'style': {...},          # From style_checker
    'tests': {...},          # From test_runner
    'feedback': {...},       # From this agent
    'improvements': {...}    # Calculated from history
}

این یک دنباله حسابرسی کامل از فرآیند بررسی ایجاد می‌کند.

استراتژی ذخیره‌سازی دوگانه:

# Try artifact service first (persistent, downloadable)
if hasattr(tool_context, 'save_artifact'):
    await tool_context.save_artifact(filename, report_part)

# Fall back to state (always works)
tool_context.state[StateKeys.USER_LAST_GRADING_REPORT] = report

سیستم‌های تولیدی به این افزونگی نیاز دارند - اگر ذخیره‌سازی مصنوعات با مشکل مواجه شود، داده‌ها از بین نمی‌روند.

ایجاد عامل سنتزکننده

👉 باز است

code_review_assistant/sub_agents/review_pipeline/feedback_synthesizer.py

👉 پیدا کنید:

# MODULE_5_STEP_4_INSTRUCTION_PROVIDER

👉 با ارائه دهنده دستورالعمل تولید جایگزین کنید:

async def feedback_instruction_provider(context: ReadonlyContext) -> str:
    """Dynamic instruction provider that injects state variables."""
    template = """You are an expert code reviewer and mentor providing constructive, educational feedback.

CONTEXT FROM PREVIOUS AGENTS:
- Structure analysis summary: {structure_analysis_summary}
- Style check summary: {style_check_summary}  
- Test execution summary: {test_execution_summary}

YOUR TASK requires these steps IN ORDER:
1. Call search_past_feedback tool with developer_id="default_user"
2. Call update_grading_progress tool with no parameters
3. Carefully analyze the test results to understand what really happened
4. Generate comprehensive feedback following the structure below
5. Call save_grading_report tool with the feedback_text parameter
6. Return the feedback as your final output

CRITICAL - Understanding Test Results:
The test_execution_summary contains structured JSON. Parse it carefully:
- tests_passed = Code worked correctly
- tests_failed = Code produced wrong output
- tests_with_errors = Code crashed
- critical_issues = Fundamental problems with the code

If critical_issues array contains items, these are serious bugs that need fixing.
Do NOT count discovering bugs as test successes.

FEEDBACK STRUCTURE TO FOLLOW:

## 📊 Summary
Provide an honest assessment. Be encouraging but truthful about problems found.

## ✅ Strengths  
List 2-3 things done well, referencing specific code elements.

## 📈 Code Quality Analysis

### Structure & Organization
Comment on code organization, readability, and documentation.

### Style Compliance
Report the actual style score and any specific issues.

### Test Results
Report the actual test results accurately:
- If critical_issues exist, report them as bugs to fix
- Be clear: "X tests passed, Y critical issues were found"
- List each critical issue
- Don't hide or minimize problems

## 💡 Recommendations for Improvement
Based on the analysis, provide specific actionable fixes.
If critical issues exist, fixing them is top priority.

## 🎯 Next Steps
Prioritized action list based on severity of issues.

## 💬 Encouragement
End with encouragement while being honest about what needs fixing.

Remember: Complete ALL steps including calling save_grading_report."""

    return await instructions_utils.inject_session_state(template, context)

👉 پیدا کنید:

# MODULE_5_STEP_4_SYNTHESIZER_AGENT

👉 جایگزین کنید با:

feedback_synthesizer_agent = Agent(
    name="FeedbackSynthesizer",
    model=config.critic_model,
    description="Synthesizes all analysis into constructive, personalized feedback",
    instruction=feedback_instruction_provider,
    tools=[
        FunctionTool(func=search_past_feedback),
        FunctionTool(func=update_grading_progress),
        FunctionTool(func=save_grading_report)
    ],
    output_key="final_feedback"
)

الگوی تولید: بازخورد ساختاریافته با هماهنگ‌سازی ابزار

سینتی‌سایزر سه ابزار را به ترتیب هماهنگ می‌کند:

# 1. Search memory for patterns
search_past_feedback(developer_id="default_user")

# 2. Update progress metrics (no params - reads from state)
update_grading_progress()

# 3. Save comprehensive report
save_grading_report(feedback_text=generated_feedback)

چرا این ترتیب مهم است:

جستجوی حافظه ابتدا - قبل از نوشتن بازخورد، زمینه تاریخی را بررسی می‌کند
به‌روزرسانی پیشرفت میانی - ثبت معیارها در حین ترکیب
ذخیره گزارش در آخرین مرحله - بازخورد کامل را پس از تولید ثبت می‌کند

این دستورالعمل به صراحت به LLM می‌گوید که این موارد را به ترتیب فراخوانی کند و از رفتار سازگار اطمینان حاصل کند.

مدل انتقادی برای سنتز

مانند اجراکننده‌ی تست، سینتی‌سایزر از مدل توانمندتر استفاده می‌کند:

model=config.critic_model,

چرا این مدل گران‌قیمت؟ سینتی‌سایزر باید:

تجزیه JSON از اجراکننده تست (داده‌های ساختاریافته)
تشخیص گزارش‌های باگ از تست‌های موفق
الگوهای حافظه را با نتایج فعلی ادغام کنید
بازخورد شخصی‌سازی‌شده و دلگرم‌کننده ایجاد کنید
صداقت را با تشویق متعادل کنید

این سطح از ظرافت و استدلال، هزینه را توجیه می‌کند. عوامل مکانیکی (تحلیلگر، بررسی‌کننده سبک) با مدل کارگر در هزینه صرفه‌جویی می‌کنند.

مرحله 5: سیم کشی خط لوله

حالا هر چهار عامل را به یک خط لوله متوالی وصل کنید و عامل ریشه را ایجاد کنید.

👉 باز است

code_review_assistant/agent.py

👉 ایمپورت‌های لازم را در بالای فایل (بعد از ایمپورت‌های موجود) اضافه کنید:

from google.adk.agents import Agent, SequentialAgent
from code_review_assistant.sub_agents.review_pipeline.code_analyzer import code_analyzer_agent
from code_review_assistant.sub_agents.review_pipeline.style_checker import style_checker_agent
from code_review_assistant.sub_agents.review_pipeline.test_runner import test_runner_agent
from code_review_assistant.sub_agents.review_pipeline.feedback_synthesizer import feedback_synthesizer_agent

فایل شما اکنون باید به شکل زیر باشد:

"""
Main agent orchestration for the Code Review Assistant.
"""

from google.adk.agents import Agent, SequentialAgent
from .config import config
from code_review_assistant.sub_agents.review_pipeline.code_analyzer import code_analyzer_agent
from code_review_assistant.sub_agents.review_pipeline.style_checker import style_checker_agent
from code_review_assistant.sub_agents.review_pipeline.test_runner import test_runner_agent
from code_review_assistant.sub_agents.review_pipeline.feedback_synthesizer import feedback_synthesizer_agent

# MODULE_5_STEP_5_CREATE_PIPELINE

# MODULE_6_STEP_5_CREATE_FIX_LOOP

# MODULE_6_STEP_5_UPDATE_ROOT_AGENT

👉 پیدا کنید:

# MODULE_5_STEP_5_CREATE_PIPELINE

👉 آن خط را با این کد جایگزین کنید:

# Create sequential pipeline
code_review_pipeline = SequentialAgent(
    name="CodeReviewPipeline",
    description="Complete code review pipeline with analysis, testing, and feedback",
    sub_agents=[
        code_analyzer_agent,
        style_checker_agent,
        test_runner_agent,
        feedback_synthesizer_agent
    ]
)

# Root agent - coordinates the review pipeline
root_agent = Agent(
    name="CodeReviewAssistant",
    model=config.worker_model,
    description="An intelligent code review assistant that analyzes Python code and provides educational feedback",
    instruction="""You are a specialized Python code review assistant focused on helping developers improve their code quality.

When a user provides Python code for review:
1. Immediately delegate to CodeReviewPipeline and pass the code EXACTLY as it was provided by the user.
2. The pipeline will handle all analysis and feedback
3. Return ONLY the final feedback from the pipeline - do not add any commentary

When a user asks what you can do or asks general questions:
- Explain your capabilities for code review
- Do NOT trigger the pipeline for non-code messages

The pipeline handles everything for code review - just pass through its final output.""",
    sub_agents=[code_review_pipeline],
    output_key="assistant_response"
)

وابستگی‌های متوالی خط لوله

ترتیب خط لوله بسیار مهم است:

agents=[
    code_analyzer_agent,       # Creates CODE_TO_REVIEW in state
    style_checker_agent,        # Reads CODE_TO_REVIEW
    test_runner_agent,          # Reads CODE_TO_REVIEW
    feedback_synthesizer_agent  # Reads all three summaries
]

آنچه هر نماینده نیاز دارد:

تحلیلگر : فقط به ورودی کاربر نیاز دارد → ابتدا اجرا می‌شود
سبک/آزمون : به CODE_TO_REVIEW از تحلیلگر نیاز دارید
سینتی‌سایزر : به هر سه خلاصه output_key نیاز دارد

تغییر ترتیب، وابستگی‌ها را از بین می‌برد. اگر بررسی‌کننده‌ی سبک ابتدا اجرا شود:

code = tool_context.state.get(StateKeys.CODE_TO_REVIEW)  # Returns None!

ماژول ۵ در مقابل تولید نهایی

در پایان ماژول 5، agent.py شما موارد زیر را خواهد داشت:

# Module 5 - Review pipeline only
root_agent = Agent(
    sub_agents=[code_review_pipeline]  # Single pipeline
)

در ماژول 6، خط لوله رفع مشکل را اضافه خواهید کرد:

# Module 6 - Both pipelines
root_agent = Agent(
    sub_agents=[code_review_pipeline, code_fix_pipeline]  # Two pipelines
)

این دستورالعمل همچنین به ارائه اصلاحات و مدیریت پاسخ‌های کاربران گسترش خواهد یافت. در حال حاضر، روی درست کار کردن خط لوله بررسی تمرکز کنید.

مرحله 6: تست کامل خط لوله

وقتشه که همکاری هر چهار مامور رو ببینیم.

👉 سیستم را راه اندازی کنید:

adk web code_review_assistant

پس از اجرای دستور adk web ، باید خروجی‌ای مشابه زیر را در ترمینال خود مشاهده کنید که نشان می‌دهد وب سرور ADK آغاز شده است:

+-----------------------------------------------------------------------------+
| ADK Web Server started                                                      |
|                                                                             |
| For local testing, access at http://localhost:8000.                         |
+-----------------------------------------------------------------------------+

INFO:     Application startup complete.
INFO:     Uvicorn running on http://0.0.0.0:8000 (Press CTRL+C to quit)

👉 در مرحله بعد، برای دسترسی به رابط کاربری ADK Dev از مرورگر خود:

از آیکون پیش‌نمایش وب (که اغلب شبیه چشم یا مربعی با فلش است) در نوار ابزار Cloud Shell (معمولاً بالا سمت راست)، گزینه تغییر پورت را انتخاب کنید. در پنجره بازشو، پورت را روی ۸۰۰۰ تنظیم کنید و روی «تغییر و پیش‌نمایش» کلیک کنید. سپس Cloud Shell یک تب یا پنجره مرورگر جدید باز می‌کند که رابط کاربری ADK Dev را نمایش می‌دهد.

پیش‌نمایش وب

👉 اکنون عامل در حال اجرا است. رابط کاربری ADK Dev در مرورگر شما، رابط مستقیم شما با عامل است.

هدف خود را انتخاب کنید: در منوی کشویی بالای رابط کاربری، عامل code_review_assistant را انتخاب کنید.

انتخاب عامل

👉 دستور آزمایش:

Please analyze the following:
def dfs_search_v1(graph, start, target):
    """Find if target is reachable from start."""
    visited = set()
    stack = start
   
    while stack:
        current = stack.pop()
       
        if current == target:
            return True
           
        if current not in visited:
            visited.add(current)
           
            for neighbor in graph[current]:
                if neighbor not in visited:
                    stack.append(neighbor)
   
    return False

👉 روند بررسی کد را در عمل مشاهده کنید:

وقتی تابع dfs_search_v1 که باگ دارد را ارسال می‌کنید، فقط یک پاسخ دریافت نمی‌کنید. شما شاهد عملکرد خط لوله چندعاملی خود هستید. خروجی استریمینگی که می‌بینید نتیجه اجرای متوالی چهار عامل تخصصی است که هر کدام روی آخرین عامل بنا می‌شوند.

در اینجا خلاصه‌ای از آنچه هر نماینده در بررسی نهایی و جامع، و تبدیل داده‌های خام به اطلاعات کاربردی، مشارکت می‌کند، ارائه شده است.

بررسی-کد-خط-لوله-در-عمل

۱. گزارش ساختاری تحلیلگر کد

ابتدا، عامل CodeAnalyzer کد خام را دریافت می‌کند. این عامل حدس نمی‌زند که کد چه کاری انجام می‌دهد؛ بلکه از ابزار analyze_code_structure برای انجام یک تجزیه قطعی درخت نحو انتزاعی (AST) استفاده می‌کند.

خروجی آن داده‌های خالص و واقعی در مورد ساختار کد است:

The analysis of the provided code reveals the following:

Summary:
- Functions Found: 1
- Classes Found: 0

Key Structural Observations:
- A single function, dfs_search_v1, is defined.
- It includes a docstring: "Find if target is reachable from start."
- No syntax errors were detected.

Overall Code Organization Assessment:
- The code snippet is a well-defined, self-contained function.

⭐ ارزش: این مرحله اولیه، پایه و اساس تمیز و قابل اعتمادی را برای سایر عامل‌ها فراهم می‌کند. این مرحله اعتبار کد پایتون را تأیید می‌کند و اجزای دقیقی را که نیاز به بررسی دارند، شناسایی می‌کند.

۲. ممیزی PEP 8 از سایت Style Checker

در مرحله بعد، عامل StyleChecker کنترل را به دست می‌گیرد. کد را از حالت مشترک می‌خواند و از ابزار check_code_style استفاده می‌کند که از linter pycodestyle بهره می‌برد.

خروجی آن یک امتیاز کیفی قابل سنجش و تخلفات خاص است:

Style Analysis Results
- Style Score: 88/100
- Total Issues: 6
- Assessment: Good style with minor improvements needed

Top Style Issues
- Line 5, W293: blank line contains whitespace
- Line 19, W292: no newline at end of file

⭐ ارزش: این عامل، بازخورد عینی و غیرقابل مذاکره‌ای را بر اساس استانداردهای تعیین‌شده‌ی جامعه (PEP 8) ارائه می‌دهد. سیستم امتیازدهی وزنی بلافاصله شدت مشکلات را به کاربر اطلاع می‌دهد.

۳. کشف باگ بحرانی توسط اجراکننده تست

اینجاست که سیستم فراتر از تحلیل سطحی می‌رود. عامل TestRunner مجموعه‌ای جامع از تست‌ها را برای اعتبارسنجی رفتار کد تولید و اجرا می‌کند.

خروجی آن یک شیء ساختاریافته JSON است که حاوی یک حکم قاطع است:

{
  "critical_issues": [
    {
      "type": "Critical Bug",
      "description": "The function's initialization `stack = start` is incorrect... When a common input like a string... is provided... the function crashes with an AttributeError.",
      "severity": "Critical"
    }
  ],
  "verdict": {
    "status": "BROKEN",
    "confidence": "high",
    "recommendation": "The function is fundamentally broken... the stack initialization line `stack = start` must be changed to `stack = [start]`."
  }
}

⭐ ارزش: این مهم‌ترین بینش است. عامل فقط حدس نزده است؛ بلکه با اجرای کد، ثابت کرده که کد مشکل دارد. این عامل یک اشکال ظریف اما حیاتی در زمان اجرا را که یک بررسی‌کننده انسانی ممکن است به راحتی از دست بدهد، کشف کرده و علت دقیق و راه‌حل مورد نیاز را مشخص کرده است.

4. The Feedback Synthesizer's Final Report

Finally, the FeedbackSynthesizer agent acts as the conductor. It takes the structured data from the previous three agents and crafts a single, user-friendly report that is both analytical and encouraging.

Its output is the final, polished review you see:

📊 Summary
Great effort on implementing the Depth-First Search algorithm! ... However, a critical bug in the initialization of the stack prevents the function from working correctly...

✅ Strengths
- Good Algorithm Structure
- Correct Use of `visited` Set

📈 Code Quality Analysis
...
### Style Compliance
The style analysis returned a good score of 88/100.
...
### Test Results
The automated testing revealed a critical issue... The line `stack = start` directly assigns the input... which results in an `AttributeError`.

💡 Recommendations for Improvement
**Fix the Critical Stack Initialization Bug:**
- Incorrect Code: `stack = start`
- Correct Code: `stack = [start]`

💬 Encouragement
You are very close to a perfect implementation! The core logic of your DFS algorithm is sound, which is the hardest part.

⭐ Value: This agent transforms technical data into a helpful, educational experience. It prioritizes the most important issue (the bug), explains it clearly, provides the exact solution, and does so in an encouraging tone. It successfully integrates the findings from all previous stages into a cohesive and valuable whole.

This multi-stage process demonstrates the power of an agentic pipeline. Instead of a single, monolithic response, you get a layered analysis where each agent performs a specialized, verifiable task. This leads to a review that is not only insightful but also deterministic, reliable, and deeply educational.

👉💻 Once you're done testing, return to your Cloud Shell Editor terminal and press Ctrl+C to stop the ADK Dev UI.

What You've Built

You now have a complete code review pipeline that:

✅ Parses code structure - deterministic AST analysis with helper functions
✅ Checks style - weighted scoring with naming conventions
✅ Runs tests - comprehensive test generation with structured JSON output
✅ Synthesizes feedback - integrates state + memory + artifacts
✅ Tracks progress - multi-tier state across invocations/sessions/users
✅ Learns over time - memory service for cross-session patterns
✅ Provides artifacts - downloadable JSON reports with complete audit trail

Key Concepts Mastered

Sequential Pipelines:

Four agents executing in strict order
Each enriches state for the next
Dependencies determine execution sequence

Production Patterns:

Helper function separation (sync in thread pools)
Graceful degradation (fallback strategies)
Multi-tier state management (temp/session/user)
Dynamic instruction providers (context-aware)
Dual storage (artifacts + state redundancy)

State as Communication:

Constants prevent typos across agents
output_key writes agent summaries to state
Later agents read via StateKeys
State flows linearly through pipeline

Memory vs State:

State: current session data
Memory: patterns across sessions
Different purposes, different lifetimes

Tool Orchestration:

Single-tool agents (analyzer, style_checker)
Built-in executors (test_runner)
Multi-tool coordination (synthesizer)

Model Selection Strategy:

Worker model: mechanical tasks (parsing, linting, routing)
Critic model: reasoning tasks (testing, synthesis)
Cost optimization through appropriate selection

قدم بعدی چیست؟

In Module 6, you'll build the fix pipeline :

LoopAgent architecture for iterative fixing
Exit conditions via escalation
State accumulation across iterations
Validation and retry logic
Integration with review pipeline to offer fixes

You'll see how the same state patterns scale to complex iterative workflows where agents attempt multiple times until success, and how to coordinate multiple pipelines in a single application.

6. Adding the Fix Pipeline: Loop Architecture

مقدمه

In Module 5, you built a sequential review pipeline that analyzes code and provides feedback. But identifying problems is only half the solution - developers need help fixing them.

This module builds an automated fix pipeline that:

Generates fixes based on review results
Validates fixes by running comprehensive tests
Retries automatically if fixes don't work (up to 3 attempts)
Reports results with before/after comparisons

Key concept: LoopAgent for automatic retry. Unlike sequential agents that run once, a LoopAgent repeats its sub-agents until an exit condition is met or max iterations reached. Tools signal success by setting tool_context.actions.escalate = True .

Preview of what you'll build: Submit buggy code → review identifies issues → fix loop generates corrections → tests validate → retries if needed → final comprehensive report.

Core Concepts: LoopAgent vs Sequential

Sequential Pipeline (Module 5):

SequentialAgent(agents=[A, B, C])
# Executes: A → B → C → Done

One-way flow
Each agent runs exactly once
No retry logic

Loop Pipeline (Module 6):

LoopAgent(agents=[A, B, C], max_iterations=3)
# Executes: A → B → C → (check exit) → A → B → C → (check exit) → ...

Cyclic flow
Agents can run multiple times
Exits when:
- A tool sets tool_context.actions.escalate = True (success)
- max_iterations reached (safety limit)
- Unhandled exception occurs (error)

Why loops for code fixing:

Code fixes often need multiple attempts:

First attempt : Fix obvious bugs (wrong variable types)
Second attempt : Fix secondary issues revealed by tests (edge cases)
Third attempt : Fine-tune and validate all tests pass

Without a loop, you'd need complex conditional logic in agent instructions. With LoopAgent , retry is automatic.

Architecture comparison:

Sequential (Module 5):
User → Review Pipeline → Feedback → Done

Loop (Module 6):
User → Review Pipeline → Feedback → Fix Pipeline
                                         ↓
                          ┌──────────────┴──────────────┐
                          │   Fix Attempt Loop (1-3x)   │
                          │  ┌─────────────────────┐    │
                          │  │ 1. Generate Fixes   │    │
                          │  │ 2. Test Fixes       │    │
                          │  │ 3. Validate & Exit? │────┼─→ If escalate=True
                          │  └─────────────────────┘    │      exit loop
                          │         ↓ If not            │
                          │    Try Again (max 3)        │
                          └─────────────────────────────┘
                                     ↓
                          4. Synthesize Final Report → Done

Step 1: Add Code Fixer Agent

The code fixer generates corrected Python code based on review results.

👉 Open

code_review_assistant/sub_agents/fix_pipeline/code_fixer.py

👉 Find:

# MODULE_6_STEP_1_CODE_FIXER_INSTRUCTION_PROVIDER

👉 Replace that single line with:

async def code_fixer_instruction_provider(context: ReadonlyContext) -> str:
    """Dynamic instruction provider that injects state variables."""
    template = """You are an expert code fixing specialist.

Original Code:
{code_to_review}

Analysis Results:
- Style Score: {style_score}/100
- Style Issues: {style_issues}
- Test Results: {test_execution_summary}

Based on the test results, identify and fix ALL issues including:
- Interface bugs (e.g., if start parameter expects wrong type)
- Logic errors (e.g., KeyError when accessing graph nodes)
- Style violations
- Missing documentation

YOUR TASK:
Generate the complete fixed Python code that addresses all identified issues.

CRITICAL INSTRUCTIONS:
- Output ONLY the corrected Python code
- Do NOT include markdown code blocks (```python)
- Do NOT include any explanations or commentary
- The output should be valid, executable Python code and nothing else

Common fixes to apply based on test results:
- If tests show AttributeError with 'pop', fix: stack = [start] instead of stack = start
- If tests show KeyError accessing graph, fix: use graph.get(current, [])
- Add docstrings if missing
- Fix any style violations identified

Output the complete fixed code now:"""

    return await instructions_utils.inject_session_state(template, context)

👉 Find:

# MODULE_6_STEP_1_CODE_FIXER_AGENT

👉 Replace that single line with:

code_fixer_agent = Agent(
    name="CodeFixer",
    model=config.worker_model,
    description="Generates comprehensive fixes for all identified code issues",
    instruction=code_fixer_instruction_provider,
    code_executor=BuiltInCodeExecutor(),
    output_key="code_fixes"
)

Why Output Raw Code Only?

The instruction explicitly says "Do NOT include markdown code blocks": Bad output (will break downstream agents):

def fixed_function():
    pass

Good output (raw Python):

def fixed_function():
    pass

Why? The fix_test_runner_agent needs to execute this code directly. Markdown formatting would cause syntax errors. The output_key="code_fixes" stores raw Python in state.

Context Provider Pattern Again

Like the synthesizer in Module 5, the fixer uses dynamic instructions:

instruction=code_fixer_instruction_provider

The function reads current state each invocation:

{code_to_review} - original buggy code
{style_issues} - what to fix
{test_execution_summary} - what failed

If the loop retries, the instruction sees updated state from the previous attempt.

Step 2: Add Fix Test Runner Agent

The fix test runner validates corrections by executing comprehensive tests on the fixed code.

👉 Open

code_review_assistant/sub_agents/fix_pipeline/fix_test_runner.py

👉 Find:

# MODULE_6_STEP_2_FIX_TEST_RUNNER_INSTRUCTION_PROVIDER

👉 Replace that single line with:

async def fix_test_runner_instruction_provider(context: ReadonlyContext) -> str:
    """Dynamic instruction provider that uses the clean code from the previous step."""
    template = """You are responsible for validating the fixed code by running tests.

THE FIXED CODE TO TEST:
{code_fixes}

ORIGINAL TEST RESULTS: {test_execution_summary}

YOUR TASK:
1. Understand the fixes that were applied
2. Generate the same comprehensive tests (15-20 test cases)
3. Execute the tests on the FIXED code using your code executor
4. Compare results with original test results
5. Output a detailed JSON analysis

TESTING METHODOLOGY:
- Run the same tests that revealed issues in the original code
- Verify that previously failing tests now pass
- Ensure no regressions were introduced
- Document the improvement

Execute your tests and output ONLY valid JSON with this structure:
- "passed": number of tests that passed
- "failed": number of tests that failed  
- "total": total number of tests
- "pass_rate": percentage as a number
- "comparison": object with "original_pass_rate", "new_pass_rate", "improvement"
- "newly_passing_tests": array of test names that now pass
- "still_failing_tests": array of test names still failing

Do NOT output the test code itself, only the JSON analysis."""

    return await instructions_utils.inject_session_state(template, context)

👉 Find:

# MODULE_6_STEP_2_FIX_TEST_RUNNER_AGENT

👉 Replace that single line with:

fix_test_runner_agent = Agent(
    name="FixTestRunner",
    model=config.critic_model,
    description="Runs comprehensive tests on fixed code to verify all issues are resolved",
    instruction=fix_test_runner_instruction_provider,
    code_executor=BuiltInCodeExecutor(),
    output_key="fix_test_execution_summary"
)

Critic Model for Testing

Notice this agent uses config.critic_model :

model=config.critic_model,

This is typically a more capable model (like gemini-2.5-pro ) because:

Generating 15-20 comprehensive test cases requires sophistication
Must understand edge cases and potential regressions
Needs to parse original test results and compare accurately

The fixer used worker_model because code generation is more mechanical. Testing requires critical thinking.

Step 3: Add Fix Validator Agent

The validator checks if fixes were successful and decides whether to exit the loop.

Understanding the Tools

First, add the three tools the validator needs.

👉 Open

code_review_assistant/tools.py

👉 Find:

# MODULE_6_STEP_3_VALIDATE_FIXED_STYLE

👉 Replace with Tool 1 - Style Validator:

async def validate_fixed_style(tool_context: ToolContext) -> Dict[str, Any]:
    """
    Validates style compliance of the fixed code.

    Args:
        tool_context: ADK tool context containing fixed code in state

    Returns:
        Dictionary with style validation results
    """
    logger.info("Tool: Validating style of fixed code...")

    try:
        # Get the fixed code from state
        code_fixes = tool_context.state.get(StateKeys.CODE_FIXES, '')
       
        # Try to extract from markdown if present
        if '```python' in code_fixes:
            start = code_fixes.rfind('```python') + 9
            end = code_fixes.rfind('```')
            if start < end:
                code_fixes = code_fixes[start:end].strip()

        if not code_fixes:
            return {
                "status": "error",
                "message": "No fixed code found in state"
            }

        # Store the extracted fixed code
        tool_context.state[StateKeys.CODE_FIXES] = code_fixes

        # Run style check on fixed code
        loop = asyncio.get_event_loop()
        with ThreadPoolExecutor() as executor:
            style_result = await loop.run_in_executor(
                executor, _perform_style_check, code_fixes
            )

        # Compare with original
        original_score = tool_context.state.get(StateKeys.STYLE_SCORE, 0)
        improvement = style_result['score'] - original_score

        # Store results
        tool_context.state[StateKeys.FIXED_STYLE_SCORE] = style_result['score']
        tool_context.state[StateKeys.FIXED_STYLE_ISSUES] = style_result['issues']

        logger.info(f"Tool: Fixed code style score: {style_result['score']}/100 "
                    f"(improvement: +{improvement})")

        return {
            "status": "success",
            "fixed_style_score": style_result['score'],
            "original_style_score": original_score,
            "improvement": improvement,
            "remaining_issues": style_result['issues'],
            "perfect_style": style_result['score'] == 100
        }

    except Exception as e:
        logger.error(f"Tool: Style validation failed: {e}", exc_info=True)
        return {
            "status": "error",
            "message": str(e)
        }

👉 Find:

# MODULE_6_STEP_3_COMPILE_FIX_REPORT

👉 Replace with Tool 2 - Report Compiler:

async def compile_fix_report(tool_context: ToolContext) -> Dict[str, Any]:
    """
    Compiles comprehensive report of the fix process.

    Args:
        tool_context: ADK tool context with all fix pipeline data

    Returns:
        Comprehensive fix report
    """
    logger.info("Tool: Compiling comprehensive fix report...")

    try:
        # Gather all data
        original_code = tool_context.state.get(StateKeys.CODE_TO_REVIEW, '')
        code_fixes = tool_context.state.get(StateKeys.CODE_FIXES, '')

        # Test results
        original_tests = tool_context.state.get(StateKeys.TEST_EXECUTION_SUMMARY, {})
        fixed_tests = tool_context.state.get(StateKeys.FIX_TEST_EXECUTION_SUMMARY, {})

        # Parse if strings
        if isinstance(original_tests, str):
            try:
                original_tests = json.loads(original_tests)
            except:
                original_tests = {}

        if isinstance(fixed_tests, str):
            try:
                fixed_tests = json.loads(fixed_tests)
            except:
                fixed_tests = {}

        # Extract pass rates
        original_pass_rate = 0
        if original_tests:
            if 'pass_rate' in original_tests:
                original_pass_rate = original_tests['pass_rate']
            elif 'test_summary' in original_tests:
                # Handle test_runner_agent's JSON structure
                summary = original_tests['test_summary']
                total = summary.get('total_tests_run', 0)
                passed = summary.get('tests_passed', 0)
                if total > 0:
                    original_pass_rate = (passed / total) * 100
            elif 'passed' in original_tests and 'total' in original_tests:
                if original_tests['total'] > 0:
                    original_pass_rate = (original_tests['passed'] / original_tests['total']) * 100

        fixed_pass_rate = 0
        all_tests_pass = False
        if fixed_tests:
            if 'pass_rate' in fixed_tests:
                fixed_pass_rate = fixed_tests['pass_rate']
                all_tests_pass = fixed_tests.get('failed', 1) == 0
            elif 'passed' in fixed_tests and 'total' in fixed_tests:
                if fixed_tests['total'] > 0:
                    fixed_pass_rate = (fixed_tests['passed'] / fixed_tests['total']) * 100
                all_tests_pass = fixed_tests.get('failed', 0) == 0

        # Style scores
        original_style = tool_context.state.get(StateKeys.STYLE_SCORE, 0)
        fixed_style = tool_context.state.get(StateKeys.FIXED_STYLE_SCORE, 0)

        # Calculate improvements
        test_improvement = {
            'original_pass_rate': original_pass_rate,
            'fixed_pass_rate': fixed_pass_rate,
            'improvement': fixed_pass_rate - original_pass_rate,
            'all_tests_pass': all_tests_pass
        }

        style_improvement = {
            'original_score': original_style,
            'fixed_score': fixed_style,
            'improvement': fixed_style - original_style,
            'perfect_style': fixed_style == 100
        }

        # Determine overall status
        if all_tests_pass and style_improvement['perfect_style']:
            fix_status = 'SUCCESSFUL'
            status_emoji = '✅'
        elif test_improvement['improvement'] > 0 or style_improvement['improvement'] > 0:
            fix_status = 'PARTIAL'
            status_emoji = '⚠️'
        else:
            fix_status = 'FAILED'
            status_emoji = '❌'

        # Build comprehensive report
        report = {
            'status': fix_status,
            'status_emoji': status_emoji,
            'timestamp': datetime.now().isoformat(),
            'original_code': original_code,
            'code_fixes': code_fixes,
            'improvements': {
                'tests': test_improvement,
                'style': style_improvement
            },
            'summary': f"{status_emoji} Fix Status: {fix_status}\n"
                      f"Tests: {original_pass_rate:.1f}% → {fixed_pass_rate:.1f}%\n"
                      f"Style: {original_style}/100 → {fixed_style}/100"
        }

        # Store report in state
        tool_context.state[StateKeys.FIX_REPORT] = report
        tool_context.state[StateKeys.FIX_STATUS] = fix_status

        logger.info(f"Tool: Fix report compiled - Status: {fix_status}")
        logger.info(f"Tool: Test improvement: {original_pass_rate:.1f}% → {fixed_pass_rate:.1f}%")
        logger.info(f"Tool: Style improvement: {original_style} → {fixed_style}")

        return {
            "status": "success",
            "fix_status": fix_status,
            "report": report
        }

    except Exception as e:
        logger.error(f"Tool: Failed to compile fix report: {e}", exc_info=True)
        return {
            "status": "error",
            "message": str(e)
        }

👉 Find:

# MODULE_6_STEP_3_EXIT_FIX_LOOP

👉 Replace with Tool 3 - Loop Exit Signal:

def exit_fix_loop(tool_context: ToolContext) -> Dict[str, Any]:
    """
    Signal that fixing is complete and should exit the loop.
   
    Args:
        tool_context: ADK tool context
       
    Returns:
        Confirmation message
    """
    logger.info("Tool: Setting escalate flag to exit fix loop")
   
    # This is the critical line that exits the LoopAgent
    tool_context.actions.escalate = True
   
    return {
        "status": "success",
        "message": "Fix complete, exiting loop"
    }

The Escalate Mechanism

The exit_fix_loop tool has one critical line:

tool_context.actions.escalate = True

This signals the LoopAgent to stop iterating:

Without escalate : Loop continues to next iteration
With escalate : Loop exits immediately after current iteration completes

Why escalate instead of returning a special value?

Any tool in the pipeline can set it (not just the last one)
Works consistently across all agent types
Clear semantic meaning: "escalate out of this loop"
Doesn't interfere with the tool's return data

The validator decides when to call this tool based on fix quality.

Create the Validator Agent

👉 Open

code_review_assistant/sub_agents/fix_pipeline/fix_validator.py

👉 Find:

# MODULE_6_STEP_3_FIX_VALIDATOR_INSTRUCTION_PROVIDER

👉 Replace that single line with:

async def fix_validator_instruction_provider(context: ReadonlyContext) -> str:
    """Dynamic instruction provider that injects state variables."""
    template = """You are the final validation specialist for code fixes.

You have access to:
- Original issues from initial review
- Applied fixes: {code_fixes}
- Test results after fix: {fix_test_execution_summary}
- All state data from the fix process

Your responsibilities:
1. Use validate_fixed_style tool to check style compliance of fixed code
   - Pass no arguments, it will retrieve fixed code from state
2. Use compile_fix_report tool to generate comprehensive report
   - Pass no arguments, it will gather all data from state
3. Based on the report, determine overall fix status:
   - ✅ SUCCESSFUL: All tests pass, style score 100
   - ⚠️ PARTIAL: Improvements made but issues remain
   - ❌ FAILED: Fix didn't work or made things worse

4. CRITICAL: If status is SUCCESSFUL, call the exit_fix_loop tool to stop iterations
   - This prevents unnecessary additional fix attempts
   - If not successful, the loop will continue for another attempt

5. Provide clear summary of:
   - What was fixed
   - What improvements were achieved
   - Any remaining issues requiring manual attention

Be precise and quantitative in your assessment.
"""
    return await instructions_utils.inject_session_state(template, context)

👉 Find:

# MODULE_6_STEP_3_FIX_VALIDATOR_AGENT

👉 Replace that single line with:

fix_validator_agent = Agent(
    name="FixValidator",
    model=config.worker_model,
    description="Validates fixes and generates final fix report",
    instruction=fix_validator_instruction_provider,
    tools=[
        FunctionTool(func=validate_fixed_style),
        FunctionTool(func=compile_fix_report),
        FunctionTool(func=exit_fix_loop)
    ],
    output_key="final_fix_report"
)

Step 4: Understanding LoopAgent Exit Conditions

The LoopAgent has three ways to exit:

1. Success Exit (via escalate)

# Inside any tool in the loop:
tool_context.actions.escalate = True

# Effect: Loop completes current iteration, then exits
# Use when: Fix is successful and no more attempts needed

Example flow:

Iteration 1:
  CodeFixer → generates fixes
  FixTestRunner → tests show 90% pass rate
  FixValidator → compiles report, sees PARTIAL status
  → Does NOT set escalate
  → Loop continues

Iteration 2:
  CodeFixer → refines fixes based on failures
  FixTestRunner → tests show 100% pass rate
  FixValidator → compiles report, sees SUCCESSFUL status
  → Calls exit_fix_loop() which sets escalate = True
  → Loop exits after this iteration

2. Max Iterations Exit

LoopAgent(
    name="FixAttemptLoop",
    sub_agents=[...],
    max_iterations=3  # Safety limit
)

# Effect: After 3 complete iterations, loop exits regardless of escalate
# Use when: Prevent infinite loops if fixes never succeed

Example flow:

Iteration 1: PARTIAL (continue)
Iteration 2: PARTIAL (continue)
Iteration 3: PARTIAL (but max reached)
→ Loop exits, synthesizer presents best attempt

3. Error Exit

# If any agent throws unhandled exception:
raise Exception("Unexpected error")

# Effect: Loop exits immediately with error state
# Use when: Critical failure that can't be recovered

State Evolution Across Iterations:

Each iteration sees updated state from the previous attempt:

# Before Iteration 1:
state = {
    "code_to_review": "def add(a,b):return a+b",  # Original
    "style_score": 40,
    "test_execution_summary": {...}
}

# After Iteration 1:
state = {
    "code_to_review": "def add(a,b):return a+b",  # Unchanged
    "code_fixes": "def add(a, b):\n    return a + b",  # NEW
    "style_score": 40,  # Unchanged
    "fixed_style_score": 100,  # NEW
    "test_execution_summary": {...},  # Unchanged
    "fix_test_execution_summary": {...}  # NEW
}

# Iteration 2 starts with all this state
# If fixes still not perfect, code_fixes gets overwritten

چرا

escalate

Instead of Return Values:

# Bad: Using return value to signal exit
def validator_agent():
    report = compile_report()
    if report['status'] == 'SUCCESSFUL':
        return {"exit": True}  # How does loop know?

# Good: Using escalate
def validator_tool(tool_context):
    report = compile_report()
    if report['status'] == 'SUCCESSFUL':
        tool_context.actions.escalate = True  # Loop knows immediately
    return {"report": report}

مزایا:

Works from any tool, not just the last one
Doesn't interfere with return data
Clear semantic meaning
Framework handles the exit logic

Debugging Loop Iterations

To see what's happening in each iteration:

# Add to validator's state writes:
iteration_count = tool_context.state.get('loop_iteration', 0) + 1
tool_context.state['loop_iteration'] = iteration_count
tool_context.state[f'iteration_{iteration_count}_status'] = fix_status

# After loop completes, inspect:
print(f"Total iterations: {state.get('loop_iteration')}")
print(f"Iter 1: {state.get('iteration_1_status')}")
print(f"Iter 2: {state.get('iteration_2_status')}")

This helps understand when and why the loop exited.

Step 5: Wire the Fix Pipeline

👉 Open

code_review_assistant/agent.py

👉 Add the fix pipeline imports (after the existing imports):

from google.adk.agents import LoopAgent  # Add this to the existing Agent, SequentialAgent line
from code_review_assistant.sub_agents.fix_pipeline.code_fixer import code_fixer_agent
from code_review_assistant.sub_agents.fix_pipeline.fix_test_runner import fix_test_runner_agent
from code_review_assistant.sub_agents.fix_pipeline.fix_validator import fix_validator_agent
from code_review_assistant.sub_agents.fix_pipeline.fix_synthesizer import fix_synthesizer_agent

Your imports should now be:

from google.adk.agents import Agent, SequentialAgent, LoopAgent
from .config import config
# Review pipeline imports (from Module 5)
from code_review_assistant.sub_agents.review_pipeline.code_analyzer import code_analyzer_agent
from code_review_assistant.sub_agents.review_pipeline.style_checker import style_checker_agent
from code_review_assistant.sub_agents.review_pipeline.test_runner import test_runner_agent
from code_review_assistant.sub_agents.review_pipeline.feedback_synthesizer import feedback_synthesizer_agent
# Fix pipeline imports (NEW)
from code_review_assistant.sub_agents.fix_pipeline.code_fixer import code_fixer_agent
from code_review_assistant.sub_agents.fix_pipeline.fix_test_runner import fix_test_runner_agent
from code_review_assistant.sub_agents.fix_pipeline.fix_validator import fix_validator_agent
from code_review_assistant.sub_agents.fix_pipeline.fix_synthesizer import fix_synthesizer_agent

👉 Find:

# MODULE_6_STEP_5_CREATE_FIX_LOOP

👉 Replace that single line with:

# Create the fix attempt loop (retries up to 3 times)
fix_attempt_loop = LoopAgent(
    name="FixAttemptLoop",
    sub_agents=[
        code_fixer_agent,      # Step 1: Generate fixes
        fix_test_runner_agent, # Step 2: Validate with tests
        fix_validator_agent    # Step 3: Check success & possibly exit
    ],
    max_iterations=3  # Try up to 3 times
)

# Wrap loop with synthesizer for final report
code_fix_pipeline = SequentialAgent(
    name="CodeFixPipeline",
    description="Automated code fixing pipeline with iterative validation",
    sub_agents=[
        fix_attempt_loop,      # Try to fix (1-3 times)
        fix_synthesizer_agent  # Present final results (always runs once)
    ]
)

Why Wrap Loop with Sequential?

The structure is:

SequentialAgent([
    LoopAgent([fix, test, validate]),  # Runs 1-3 times
    synthesizer_agent                   # Runs once
])

Why not just:

LoopAgent([fix, test, validate, synthesizer])  # Bad!

Because the synthesizer should run ONCE at the end, regardless of how many loop iterations occurred. If it were inside the loop:

After iteration 1: Synthesizes PARTIAL result
After iteration 2: Synthesizes again (redundant)
After iteration 3: Synthesizes final

By wrapping, the synthesizer sees the final state after all iterations complete and creates one comprehensive report.

👉 Remove the existing

root_agent

تعریف:

root_agent = Agent(...)

👉 Find:

# MODULE_6_STEP_5_UPDATE_ROOT_AGENT

👉 Replace that single line with:

# Update root agent to include both pipelines
root_agent = Agent(
    name="CodeReviewAssistant",
    model=config.worker_model,
    description="An intelligent code review assistant that analyzes Python code and provides educational feedback",
    instruction="""You are a specialized Python code review assistant focused on helping developers improve their code quality.

When a user provides Python code for review:
1. Immediately delegate to CodeReviewPipeline and pass the code EXACTLY as it was provided by the user.
2. The pipeline will handle all analysis and feedback
3. Return ONLY the final feedback from the pipeline - do not add any commentary

After completing a review, if significant issues were identified:
- If style score < 100 OR tests are failing OR critical issues exist:
  * Add at the end: "\n\n💡 I can fix these issues for you. Would you like me to do that?"
 
- If the user responds yes or requests fixes:
  * Delegate to CodeFixPipeline
  * Return the fix pipeline's complete output AS-IS

When a user asks what you can do or general questions:
- Explain your capabilities for code review and fixing
- Do NOT trigger the pipeline for non-code messages

The pipelines handle everything for code review and fixing - just pass through their final output.""",
    sub_agents=[code_review_pipeline, code_fix_pipeline],
    output_key="assistant_response"
)

Two-Pipeline Architecture

Root Agent
  ├─ CodeReviewPipeline (Module 5)
  │    ├─ CodeAnalyzer
  │    ├─ StyleChecker
  │    ├─ TestRunner
  │    └─ FeedbackSynthesizer
  │
  └─ CodeFixPipeline (Module 6)
       ├─ FixAttemptLoop (LoopAgent, 1-3x)
       │    ├─ CodeFixer
       │    ├─ FixTestRunner
       │    └─ FixValidator (may set escalate)
       │
       └─ FixSynthesizer (runs once after loop)

Why separate pipelines?

Review is read-only, fix modifies code (different concerns)
User might only want review, not fixes
Fixes depend on review results (sequential dependency)
Clear separation makes testing easier

Step 6: Add Fix Synthesizer Agent

The synthesizer creates a user-friendly presentation of fix results after the loop completes.

👉 Open

code_review_assistant/sub_agents/fix_pipeline/fix_synthesizer.py

👉 Find:

# MODULE_6_STEP_6_FIX_SYNTHESIZER_INSTRUCTION_PROVIDER

👉 Replace that single line with:

async def fix_synthesizer_instruction_provider(context: ReadonlyContext) -> str:
    """Dynamic instruction provider that injects state variables."""
    template = """You are responsible for presenting the fix results to the user.

Based on the validation report: {final_fix_report}
Fixed code from state: {code_fixes}
Fix status: {fix_status}

Create a comprehensive yet friendly response that includes:

## 🔧 Fix Summary
[Overall status and key improvements - be specific about what was achieved]

## 📊 Metrics
- Test Results: [original pass rate]% → [new pass rate]%
- Style Score: [original]/100 → [new]/100
- Issues Fixed: X of Y

## ✅ What Was Fixed
[List each fixed issue with brief explanation of the correction made]

## 📝 Complete Fixed Code
[Include the complete, corrected code from state - this is critical]

## 💡 Explanation of Key Changes
[Brief explanation of the most important changes made and why]

[If any issues remain]
## ⚠️ Remaining Issues
[List what still needs manual attention]

## 🎯 Next Steps
[Guidance on what to do next - either use the fixed code or address remaining issues]

Save the fix report using save_fix_report tool before presenting.
Call it with no parameters - it will retrieve the report from state automatically.

Be encouraging about improvements while being honest about any remaining issues.
Focus on the educational aspect - help the user understand what was wrong and how it was fixed.
"""
    return await instructions_utils.inject_session_state(template, context)

👉 Find:

# MODULE_6_STEP_6_FIX_SYNTHESIZER_AGENT

👉 Replace that single line with:

fix_synthesizer_agent = Agent(
    name="FixSynthesizer",
    model=config.critic_model,
    description="Creates comprehensive user-friendly fix report",
    instruction=fix_synthesizer_instruction_provider,
    tools=[FunctionTool(func=save_fix_report)],
    output_key="fix_summary"
)

👉 Add

save_fix_report

tool to

tools.py

👉 Find:

# MODULE_6_STEP_6_SAVE_FIX_REPORT

👉 Replace with:

async def save_fix_report(tool_context: ToolContext) -> Dict[str, Any]:
    """
    Saves the fix report as an artifact.

    Args:
        tool_context: ADK tool context

    Returns:
        Save status
    """
    logger.info("Tool: Saving fix report...")

    try:
        # Get the report from state
        fix_report = tool_context.state.get(StateKeys.FIX_REPORT, {})

        if not fix_report:
            return {
                "status": "error",
                "message": "No fix report found in state"
            }

        # Convert to JSON
        report_json = json.dumps(fix_report, indent=2)
        report_part = types.Part.from_text(text=report_json)

        # Generate filename
        timestamp = datetime.now().isoformat().replace(':', '-')
        filename = f"fix_report_{timestamp}.json"

        # Try to save as artifact
        if hasattr(tool_context, 'save_artifact'):
            try:
                version = await tool_context.save_artifact(filename, report_part)
                await tool_context.save_artifact("latest_fix_report.json", report_part)

                logger.info(f"Tool: Fix report saved as {filename}")

                return {
                    "status": "success",
                    "filename": filename,
                    "version": str(version),
                    "size": len(report_json)
                }
            except Exception as e:
                logger.warning(f"Could not save as artifact: {e}")

        # Fallback: store in state
        tool_context.state[StateKeys.LAST_FIX_REPORT] = fix_report

        return {
            "status": "success",
            "message": "Fix report saved to state",
            "size": len(report_json)
        }

    except Exception as e:
        logger.error(f"Tool: Failed to save fix report: {e}", exc_info=True)
        return {
            "status": "error",
            "message": str(e)
        }

Why Synthesizer Runs After Loop

The synthesizer is OUTSIDE the loop:

SequentialAgent([
    LoopAgent([...]),  # Runs 1-3 times
    synthesizer        # Runs ONCE after loop exits
])

این یعنی:

It sees the FINAL state after all fix attempts
It knows how many iterations occurred (from state)
It presents one comprehensive report, not per-iteration reports

The instruction template references state keys that accumulate across iterations:

{code_fixes} - last attempt's code
{final_fix_report} - report from last validator run
{fix_status} - SUCCESSFUL/PARTIAL/FAILED

If synthesizer were inside the loop, it would run multiple times with incomplete data.

Step 7: Test Complete Fix Pipeline

Time to see the entire loop in action.

👉 Start the system:

adk web code_review_assistant

After running the adk web command, you should see output in your terminal indicating that the ADK Web Server has started, similar to this:

+-----------------------------------------------------------------------------+
| ADK Web Server started                                                      |
|                                                                             |
| For local testing, access at http://localhost:8000.                         |
+-----------------------------------------------------------------------------+

INFO:     Application startup complete.
INFO:     Uvicorn running on http://0.0.0.0:8000 (Press CTRL+C to quit)

👉 Test Prompt:

Please analyze the following:
def dfs_search_v1(graph, start, target):
    """Find if target is reachable from start."""
    visited = set()
    stack = start
   
    while stack:
        current = stack.pop()
       
        if current == target:
            return True
           
        if current not in visited:
            visited.add(current)
           
            for neighbor in graph[current]:
                if neighbor not in visited:
                    stack.append(neighbor)
   
    return False

First, submit the buggy code to trigger the review pipeline . After it identifies the flaws, you will ask the agent to "Please fix the code" which triggers the powerful, iterative fix pipeline .

fix-pipeline-in-action

1. The Initial Review (Finding the Flaws)

This is the first half of the process. The four-agent review pipeline analyzes the code, checks its style, and runs a generated test suite. It correctly identifies a critical AttributeError and other issues, delivering a verdict: the code is BROKEN , with a test pass rate of only 84.21% .

2. The Automated Fix (The Loop in Action)

This is the most impressive part. When you ask the agent to fix the code, it doesn't just make one change. It kicks off an iterative Fix and Validate Loop that works just like a diligent developer: it tries a fix, tests it thoroughly, and if it's not perfect, it tries again.

Iteration #1: The First Attempt (Partial Success)

The Fix: The CodeFixer agent reads the initial report and makes the most obvious corrections. It changes stack = start to stack = [start] and uses graph.get() to prevent KeyError exceptions.
The Validation: The TestRunner immediately re-runs the full test suite against this new code.
The Result: The pass rate improves significantly to 88.89% ! The critical bugs are gone. However, the tests are so comprehensive that they reveal two new, subtle bugs (regressions) related to handling None as a graph or non-list neighbor values. The system marks the fix as PARTIAL .

Iteration #2: The Final Polish (100% Success)

The Fix: Because the loop's exit condition (100% pass rate) was not met, it runs again. The CodeFixer now has more information—the two new regression failures. It generates a final, more robust version of the code that explicitly handles those edge cases.
The Validation: The TestRunner executes the test suite one last time against the final version of the code.
The Result: A perfect 100% pass rate . All original bugs and all regressions are resolved. The system marks the fix as SUCCESSFUL and the loop exits.

3. The Final Report: A Perfect Score

With a fully validated fix, the FixSynthesizer agent takes over to present the final report, transforming the technical data into a clear, educational summary.

متریک	قبل از	بعد از	بهبود
Test Pass Rate	84.21%	۱۰۰٪	▲ 15.79%
Style Score	88 / 100	98 / 100	▲ 10 pts
Bugs Fixed	0 of 3	۳ از ۳	✅

✅ The Final, Validated Code

Here is the complete, corrected code that now passes all 19 tests, demonstrating the successful fix:

def dfs_search_v1(graph, start, target):
    """Find if target is reachable from start."""
    # Handles 'None' graph input
    if graph is None:
        return False

    visited = set()
    # Fixes the critical AttributeError
    stack = [start]

    while stack:
        current = stack.pop()

        if current == target:
            return True

        if current not in visited:
            visited.add(current)
            
            # Safely gets neighbors to prevent KeyError
            neighbors = graph.get(current)

            if neighbors is None:
                continue
            
            # Validates that neighbors are iterable
            if not isinstance(neighbors, (list, set, tuple)):
                raise TypeError(
                    f"Graph value for node '{current}' is of type "
                    f"{type(neighbors).__name__}. Expected a list, set, or tuple."
                )
            
            for neighbor in neighbors:
                if neighbor not in visited:
                    stack.append(neighbor)

    return False

👉💻 Once you're done testing, return to your Cloud Shell Editor terminal and press Ctrl+C to stop the ADK Dev UI.

What You've Built

You now have a complete automated fix pipeline that:

✅ Generates fixes - Based on review analysis
✅ Validates iteratively - Tests after each fix attempt
✅ Retries automatically - Up to 3 attempts for success
✅ Exits intelligently - Via escalate when successful
✅ Tracks improvements - Compares before/after metrics
✅ Provides artifacts - Downloadable fix reports

Key Concepts Mastered

LoopAgent vs Sequential:

Sequential: One pass through agents
LoopAgent: Repeats until exit condition or max iterations
Exit via tool_context.actions.escalate = True

State Evolution Across Iterations:

CODE_FIXES updated each iteration
Test results show improvement over time
Validator sees cumulative changes

Multi-Pipeline Architecture:

Review pipeline: Read-only analysis (Module 5)
Fix loop: Iterative correction (Module 6 inner loop)
Fix pipeline: Loop + synthesizer (Module 6 outer)
Root agent: Orchestrates based on user intent

Tools Controlling Flow:

exit_fix_loop() sets escalate
Any tool can signal loop completion
Decouples exit logic from agent instructions

Max Iterations Safety:

Prevents infinite loops
Ensures system always responds
Presents best attempt even if not perfect

قدم بعدی چیست؟

In the final module, you'll learn how to deploy your agent to production :

Setting up persistent storage with VertexAiSessionService
Deploying to Agent Engine on Google Cloud
Monitoring and debugging production agents
Best practices for scaling and reliability

You've built a complete multi-agent system with sequential and loop architectures. The patterns you've learned - state management, dynamic instructions, tool orchestration, and iterative refinement - are production-ready techniques used in real agentic systems.

7. Deploying to Production

مقدمه

Your code review assistant is now complete with review and fix pipelines working locally. The missing piece: it only runs on your machine. In this module, you'll deploy your agent to Google Cloud, making it accessible to your team with persistent sessions and production-grade infrastructure.

What you'll learn:

Three deployment paths: Local, Cloud Run, and Agent Engine
Automated infrastructure provisioning
Session persistence strategies
Testing deployed agents

Understanding Deployment Options

The ADK supports multiple deployment targets, each with different tradeoffs:

Deployment Paths

عامل	Local ( `adk web` )	Cloud Run ( `adk deploy cloud_run` )	Agent Engine ( `adk deploy agent_engine` )
پیچیدگی	مینیمال	متوسط	کم
Session Persistence	In-memory only (lost on restart)	Cloud SQL (PostgreSQL)	Vertex AI managed (automatic)
زیرساخت	None (dev machine only)	Container + Database	Fully managed
Cold Start	ناموجود	100-2000ms	100-500ms
مقیاس‌بندی	Single instance	Automatic (to zero)	خودکار
Cost Model	Free (local compute)	Request-based + free tier	Compute-based
UI Support	Yes (via `adk web` )	Yes (via `--with_ui` )	No (API only)
بهترین برای	Development/testing	Variable traffic, cost control	Production agents

Additional deployment option: Google Kubernetes Engine (GKE) is available for advanced users requiring Kubernetes-level control, custom networking, or multi-service orchestration. GKE deployment is not covered in this codelab but is documented in the ADK deployment guide .

What Gets Deployed

When deploying to Cloud Run or Agent Engine, the following is packaged and deployed:

Your agent code ( agent.py , all sub-agents, tools)
Dependencies ( requirements.txt )
ADK API server (automatically included)
Web UI (Cloud Run only, when --with_ui specified)

Important differences:

Cloud Run : Uses adk deploy cloud_run CLI (builds container automatically) or gcloud run deploy (requires custom Dockerfile)
Agent Engine : Uses adk deploy agent_engine CLI (no container building needed, directly packages Python code)

Step 1: Configure Your Environment

Configure Your `.env` File

Your .env file (created in Module 3) needs updates for cloud deployment. Open .env and verify/update these settings:

Required for all cloud deployments:

# Your actual GCP Project ID (REQUIRED)
GOOGLE_CLOUD_PROJECT=your-project-id

# GCP region for deployments (REQUIRED)
GOOGLE_CLOUD_LOCATION=us-central1

# Use Vertex AI (REQUIRED)
GOOGLE_GENAI_USE_VERTEXAI=true

# Model configuration (already set)
WORKER_MODEL=gemini-2.5-flash
CRITIC_MODEL=gemini-2.5-pro

Set bucket names (REQUIRED before running deploy.sh):

The deployment script creates buckets based on these names. Set them now:

# Staging bucket for Agent Engine code uploads (REQUIRED for agent-engine)
STAGING_BUCKET=gs://your-project-id-staging

# Artifact storage for reports and fixed code (REQUIRED for both cloud-run and agent-engine)
ARTIFACT_BUCKET=gs://your-project-id-artifacts

Replace your-project-id with your actual project ID in both bucket names. The script will create these buckets if they don't exist.

Optional variables (created automatically if blank):

# Agent Engine ID (populated after first deployment)
AGENT_ENGINE_ID=

# Cloud Run Database credentials (created automatically if blank)
CLOUD_SQL_INSTANCE_NAME=
DB_USER=
DB_PASSWORD=
DB_NAME=

Authentication Check

If you encounter authentication errors during deployment:

gcloud auth application-default login
gcloud config set project $GOOGLE_CLOUD_PROJECT

Step 2: Understanding the Deployment Script

The deploy.sh script provides a unified interface for all deployment modes:

./deploy.sh {local|cloud-run|agent-engine}

Script Capabilities

Infrastructure provisioning:

API enablement (AI Platform, Storage, Cloud Build, Cloud Trace, Cloud SQL)
IAM permission configuration (service accounts, roles)
Resource creation (buckets, databases, instances)
Deployment with proper flags
Post-deployment verification

Key Script Sections

Configuration (lines 1-35) : Project, region, service names, defaults
Helper Functions (lines 37-200) : API enablement, bucket creation, IAM setup
Main Logic (lines 202-400) : Mode-specific deployment orchestration

Step 3: Prepare Agent for Agent Engine

Before deploying to Agent Engine, an agent_engine_app.py file is needed that wraps your agent for the managed runtime. This has been created for you already.

View `code_review_assistant/agent_engine_app.py`

👉 Open file:

"""
Agent Engine application wrapper.
This file prepares the agent for deployment to Vertex AI Agent Engine.
"""

from vertexai import agent_engines
from .agent import root_agent

# Wrap the agent in an AdkApp object for Agent Engine deployment
app = agent_engines.AdkApp(
    agent=root_agent,
    enable_tracing=True,
)

Step 4: Deploy to Agent Engine

Agent Engine is the recommended production deployment for ADK agents because it provides:

Fully managed infrastructure (no containers to build)
Built-in session persistence via VertexAiSessionService
Automatic scaling from zero
Cloud Trace integration enabled by default

How Agent Engine Differs from Other Deployments

Under the hood,

deploy.sh agent-engine

uses:

adk deploy agent_engine \
  --project=$GOOGLE_CLOUD_PROJECT \
  --region=$GOOGLE_CLOUD_LOCATION \
  --staging_bucket=$STAGING_BUCKET \
  --display_name="Code Review Assistant" \
  --trace_to_cloud \
  code_review_assistant

This command:

Packages your Python code directly (no Docker build)
Uploads to the staging bucket you specified in .env
Creates a managed Agent Engine instance
Enables Cloud Trace for observability
Uses agent_engine_app.py to configure the runtime

Unlike Cloud Run which containerizes your code, Agent Engine runs your Python code directly in a managed runtime environment, similar to serverless functions.

Run the Deployment

From your project root:

./deploy.sh agent-engine

Deployment Phases

Watch the script execute these phases:

Phase 1: API Enablement
  ✓ aiplatform.googleapis.com
  ✓ storage-api.googleapis.com
  ✓ cloudbuild.googleapis.com
  ✓ cloudtrace.googleapis.com

Phase 2: IAM Setup
  ✓ Getting project number
  ✓ Granting Storage Object Admin
  ✓ Granting AI Platform User
  ✓ Granting Cloud Trace Agent

Phase 3: Staging Bucket
  ✓ Creating gs://your-project-id-staging
  ✓ Setting permissions

Phase 4: Artifact Bucket
  ✓ Creating gs://your-project-id-artifacts
  ✓ Configuring access

Phase 5: Validation
  ✓ Checking agent.py exists
  ✓ Verifying root_agent defined
  ✓ Checking agent_engine_app.py exists
  ✓ Validating requirements.txt

Phase 6: Build & Deploy
  ✓ Packaging agent code
  ✓ Uploading to staging bucket
  ✓ Creating Agent Engine instance
  ✓ Configuring session persistence
  ✓ Setting up Cloud Trace integration
  ✓ Running health checks

This process takes 5-10 minutes as it packages the agent and deploys it to Vertex AI infrastructure.

Save Your Agent Engine ID

Upon successful deployment:

✅ Deployment successful!
   Agent Engine ID: 7917477678498709504
   Resource Name: projects/123456789/locations/us-central1/reasoningEngines/7917477678498709504
   Endpoint: https://us-central1-aiplatform.googleapis.com/v1/...

⚠️  IMPORTANT: Save this in your .env file:
   AGENT_ENGINE_ID=7917477678498709504

Update your

.env

file immediately:

echo "AGENT_ENGINE_ID=7917477678498709504" >> .env

This ID is required for:

Testing the deployed agent
Updating the deployment later
Accessing logs and traces

What Was Deployed

Your Agent Engine deployment now includes:

✅ Complete review pipeline (4 agents)
✅ Complete fix pipeline (loop + synthesizer)
✅ All tools (AST analysis, style checking, artifact generation)
✅ Session persistence (automatic via VertexAiSessionService )
✅ State management (session/user/lifetime tiers)
✅ Observability (Cloud Trace enabled)
✅ Auto-scaling infrastructure

Step 5: Test Your Deployed Agent

Update Your `.env` File

After deployment, verify your .env includes:

AGENT_ENGINE_ID=7917477678498709504  # From deployment output
GOOGLE_CLOUD_PROJECT=your-project-id
GOOGLE_CLOUD_LOCATION=us-central1

Run the Test Script

The project includes tests/test_agent_engine.py specifically for testing Agent Engine deployments:

python tests/test_agent_engine.py

What the Test Does

Authenticates with your Google Cloud project
Creates a session with the deployed agent
Sends a code review request (the DFS bug example)
Streams the response back via Server-Sent Events (SSE)
Verifies session persistence and state management

Expected Output

Authenticated with project: your-project-id
Targeting Agent Engine: projects/.../reasoningEngines/7917477678498709504

Creating new session...
Created session: 4857885913439920384

Sending query to agent and streaming response:
data: {"content": {"parts": [{"text": "I'll analyze your code..."}]}}
data: {"content": {"parts": [{"text": "**Code Structure Analysis**\n..."}]}}
data: {"content": {"parts": [{"text": "**Style Check Results**\n..."}]}}
data: {"content": {"parts": [{"text": "**Test Results**\n..."}]}}
data: {"content": {"parts": [{"text": "**Final Feedback**\n..."}]}}

Stream finished.

Verification Checklist

✅ Full review pipeline executes (all 4 agents)
✅ Streaming response shows progressive output
✅ Session state persists across requests
✅ No authentication or connection errors
✅ Tool calls execute successfully (AST analysis, style checking)
✅ Artifacts are saved (grading report accessible)

Alternative: Deploy to Cloud Run

While Agent Engine is recommended for streamlined production deployment, Cloud Run offers more control and supports the ADK web UI. This section provides an overview.

When to Use Cloud Run

Choose Cloud Run if you need:

The ADK web UI for user interaction
Full control over the container environment
Custom database configurations
Integration with existing Cloud Run services

How Cloud Run Deployment Works

Under the hood,

deploy.sh cloud-run

uses:

adk deploy cloud_run \
  --project=$GOOGLE_CLOUD_PROJECT \
  --region=$GOOGLE_CLOUD_LOCATION \
  --service_name="code-review-assistant" \
  --app_name="code_review_assistant" \
  --port=8080 \
  --with_ui \
  --artifact_service_uri="gs://$ARTIFACT_BUCKET" \
  --trace_to_cloud \
  code_review_assistant

This command:

Builds a Docker container with your agent code
Pushes to Google Artifact Registry
Deploys as a Cloud Run service
Includes the ADK web UI ( --with_ui )
Configures Cloud SQL connection (added by script after initial deployment)

The key difference from Agent Engine: Cloud Run containerizes your code and requires a database for session persistence, while Agent Engine handles both automatically.

Cloud Run Deployment Command

./deploy.sh cloud-run

What's Different

Infrastructure:

Containerized deployment (Docker built automatically by ADK)
Cloud SQL (PostgreSQL) for session persistence
Database auto-created by script or uses existing instance

Session Management:

Uses DatabaseSessionService instead of VertexAiSessionService
Requires database credentials in .env (or auto-generated)
State persists in PostgreSQL database

UI Support:

Web UI available via --with_ui flag (handled by script)
Access at: https://code-review-assistant-xyz.a.run.app

What You've Accomplished

Your production deployment includes:

✅ Automated provisioning via deploy.sh script
✅ Managed infrastructure (Agent Engine handles scaling, persistence, monitoring)
✅ Persistent state across all memory tiers (session/user/lifetime)
✅ Secure credential management (automatic generation and IAM setup)
✅ Scalable architecture (zero to thousands of concurrent users)
✅ Built-in observability (Cloud Trace integration enabled)
✅ Production-grade error handling and recovery

Key Concepts Mastered

Deployment Preparation:

agent_engine_app.py : Wraps agent with AdkApp for Agent Engine
AdkApp automatically configures VertexAiSessionService for persistence
Tracing enabled via enable_tracing=True

Deployment Commands:

adk deploy agent_engine : Packages Python code, no containers
adk deploy cloud_run : Builds Docker container automatically
gcloud run deploy : Alternative with custom Dockerfile

Deployment Options:

Agent Engine: Fully managed, fastest to production
Cloud Run: More control, supports web UI
GKE: Advanced Kubernetes control (see GKE deployment guide )

Managed Services:

Agent Engine handles session persistence automatically
Cloud Run requires database setup (or auto-created)
Both support artifact storage via GCS

Session Management:

Agent Engine: VertexAiSessionService (automatic)
Cloud Run: DatabaseSessionService (Cloud SQL)
Local: InMemorySessionService (ephemeral)

Your Agent Is Live

Your code review assistant is now:

Accessible via HTTPS API endpoints
Persistent with state surviving restarts
Scalable to handle team growth automatically
Observable with complete request traces
Maintainable through scripted deployments

What's Next? In Module 8, you'll learn to use Cloud Trace to understand your agent's performance, identify bottlenecks in the review and fix pipelines, and optimize execution times.

8. Production Observability

مقدمه

Your code review assistant is now deployed and running in production on Agent Engine. But how do you know it's working well? Can you answer these critical questions:

Is the agent responding quickly enough?
Which operations are slowest?
Are the fix loops completing efficiently?
Where are performance bottlenecks?

Without observability, you're operating blind. The --trace-to-cloud flag you used during deployment automatically enabled Cloud Trace, giving you complete visibility into every request your agent processes. This transforms debugging from guesswork into forensic analysis.

In this module, you'll learn to read traces, understand your agent's performance characteristics, and identify areas for optimization based on hard evidence.

Understanding Traces and Spans

What is a Trace?

A trace is the complete timeline of your agent handling a single request. It captures everything from when a user sends a query until the final response is delivered. Each trace shows:

Total duration of the request
All operations that executed
How operations relate to each other (parent-child relationships)
When each operation started and ended

What is a Span?

A span represents a single unit of work within a trace. Common span types in your code review assistant:

agent_run : Execution of an agent (root agent or sub-agent)
call_llm : Request to a language model
execute_tool : Tool function execution
state_read / state_write : State management operations
code_executor : Running code with tests

Spans have:

Name : What operation this represents
Duration : How long it took
Attributes : Metadata like model name, token counts, inputs/outputs
Status : Success or failure
Parent/child relationships : Which operations triggered which

Automatic Instrumentation

When you deployed with --trace-to-cloud , ADK automatically instruments:

Every agent invocation and sub-agent call
All LLM requests with token counts
Tool executions with inputs/outputs
State operations (read/write)
Loop iterations in your fix pipeline
Error conditions and retries

No code changes required - tracing is built into ADK's runtime.

Step 1: Access Cloud Trace Explorer

Open Cloud Trace in your Google Cloud Console:

Navigate to Cloud Trace Explorer
Select your project from the dropdown (should be pre-selected)
You should see traces from your test in Module 7

If you don't see traces yet:

The test you ran in Module 7 should have generated traces. If the list is empty, generate some trace data:

python tests/test_agent_engine.py

Wait 1-2 minutes for traces to appear in the console.

What You're Looking At

The Trace Explorer shows:

List of traces : Each row represents one complete request
Timeline : When requests occurred
Duration : How long each request took
Request details : Timestamp, latency, span count

This is your production traffic log - every interaction with your agent creates a trace.

Step 2: Examine a Review Pipeline Trace

Click on any trace in the list to open the waterfall view.

You'll see a Gantt chart showing the complete execution timeline. The root invocation span represents the entire request. Nested under it are spans for each sub-agent, tool, and LLM call.

خواندن آبشار: شناسایی گلوگاه‌ها

Each bar represents a span. Its horizontal position shows when it started, and its length shows how long it took. This immediately reveals where your agent is spending its time.

Key insights from the trace above:

Total latency : The entire request took 2 minutes and 28 seconds .
Sub-agent breakdown :
- Code Analyzer : 4.7 seconds
- Style Checker : 5.3 seconds
- Test Runner : 1 minute and 28 seconds
- Feedback Synthesizer : 47.9 seconds
Critical Path Analysis : The Test Runner agent is the clear performance bottleneck, accounting for approximately 59% of the total request time .

This visibility is powerful. Rather than guessing where time is spent, you have concrete evidence that if you need to optimize for latency, the Test Runner is the obvious target.

Inspecting Token Usage for Cost Optimization

Cloud Trace doesn't just show time; it also reveals costs by capturing token usage for every LLM call.

Click on a

call_llm

span within the trace. In the details pane, you will find attributes for llm.usage.prompt_tokens and llm.usage.completion_tokens .

This allows you to:

Track costs at a granular level : See exactly how many tokens each agent and tool is consuming.
Identify optimization opportunities : If an agent is using a surprisingly high number of tokens, it may be an opportunity to refine its prompt or switch to a smaller, more cost-effective model for that specific task.

Step 3: Analyze a Fix Pipeline Trace

The fix pipeline is more complex because it includes a LoopAgent . Cloud Trace makes it easy to understand this iterative behavior.

Find a trace that includes "FixAttemptLoop" in the span names.

If you don't have one, run the test script and respond affirmatively when asked if you want to fix the code.

Examining Loop Structure

The trace view clearly visualizes the loop's execution. If the fix loop ran two times before succeeding, you'll see two loop_iteration spans nested under the FixAttemptLoop span, each containing a full cycle of the CodeFixer , FixTestRunner , and FixValidator agents.

Key Observations from the Loop Trace:

Iterative Refinement is Visible : You can see the system attempt a fix in loop_iteration: 1 , validate it, and then—because it wasn't perfect—try again in loop_iteration: 2 .
Convergence is Measurable : You can compare the duration and results of each iteration to understand how the system converged to a correct solution.
Debugging is Simplified : If a loop runs for the maximum number of iterations and still fails, you can inspect the state and agent behavior within each iteration's span to diagnose why the fixes weren't converging.

This level of detail is invaluable for understanding and debugging the behavior of complex, stateful loops in production.

Step 4: What You've Discovered

الگوهای عملکرد

From examining traces, you now have data-driven insights:

Review pipeline:

Primary Bottleneck : The Test Runner agent, specifically its code execution and LLM-based test generation, is the most time-consuming part of the review.
Fast Operations : Deterministic tools ( analyze_code_structure ) and state management operations are extremely fast and not a performance concern.

Fix pipeline:

Convergence Rate : You can see that most fixes complete in 1-2 iterations, confirming the loop architecture is effective.
Progressive Cost : Later iterations may take longer as the LLM context grows with information from previous failed attempts.

Cost Drivers:

Token Consumption : You can pinpoint which agents (like the synthesizers) require the most tokens and decide if using a more powerful but expensive model is justified for that task.

Where to Look for Issues

When reviewing traces in production, watch for:

Unusually long traces : A sign of a performance regression or an unexpected loop behavior.
Failed spans (marked in red): Pinpoints the exact operation that failed.
Excessive loop iterations (>2): May indicate a problem with the fix generation logic.
High token counts : Highlights opportunities for prompt optimization or model selection changes.

What You've Learned

Through Cloud Trace, you now understand how to:

✅ Visualize request flow : See the complete execution path through your sequential and loop-based pipelines.
✅ Identify performance bottlenecks : Use the waterfall chart to find the slowest operations with hard data.
✅ Analyze loop behavior : Observe how iterative agents converge on a solution over multiple attempts.
✅ Track token costs : Inspect LLM spans to monitor and optimize token consumption at a granular level.

Key Concepts Mastered

Traces and Spans: The fundamental units of observability, representing requests and the operations within them.
Waterfall Analysis: Reading Gantt charts to understand execution time and dependencies.
Critical Path Identification: Finding the sequence of operations that determines the overall latency.
Granular Observability: Having visibility into not just time but also metadata like token counts for every operation, automatically instrumented by the ADK.

قدم بعدی چیست؟

Continue exploring Cloud Trace:

Monitor traces regularly to catch issues early
Compare traces to identify performance regressions
Use trace data to inform optimization decisions
Filter by duration to find slow requests

Advanced observability (optional):

Export traces to BigQuery for complex analysis ( docs )
Create custom dashboards in Cloud Monitoring
Set up alerts for performance degradation
Correlate traces with application logs

9. Conclusion: From Prototype to Production

What You've Built

You started with just seven lines of code and built a production-grade AI agent system:

# Where we started (7 lines)
agent = Agent(
    model="gemini-2.5-flash",
    instruction="Review Python code for issues"
)

# Where we ended (production system)
- Two distinct multi-agent pipelines (review and fix) built from 8 specialized agents.
- An iterative fix loop architecture for automated validation and retries.
- Real AST-based code analysis tools for deterministic, accurate feedback.
- Robust state management using the "constants pattern" for type-safe communication.
- Fully automated deployment to a managed, scalable cloud infrastructure.
- Complete, built-in observability with Cloud Trace for production monitoring.

Key Architectural Patterns Mastered

الگو	پیاده‌سازی	Production Impact
Tool Integration	AST analysis, style checking	Real validation, not just LLM opinions
Sequential Pipelines	Review → Fix workflows	Predictable, debuggable execution
Loop Architecture	Iterative fixing with exit conditions	Self-improving until success
مدیریت دولتی	Constants pattern, three-tier memory	Type-safe, maintainable state handling
Production Deployment	Agent Engine via deploy.sh	Managed, scalable infrastructure
مشاهده‌پذیری	Cloud Trace integration	Full visibility into production behavior

Production Insights from Traces

Your Cloud Trace data revealed critical insights:
✅ Bottleneck identified : TestRunner's LLM calls dominate latency
✅ Tool performance : AST analysis executes in 100ms (excellent)
✅ Success rate : Fix loops converge within 2-3 iterations
✅ Token usage : ~600 tokens per review, ~1800 for fixes

These insights drive continuous improvement.

Clean Up Resources (Optional)

If you're done experimenting and want to avoid charges:

Delete Agent Engine deployment:

import vertexai

client = vertexai.Client(  # For service interactions via client.agent_engines
    project="PROJECT_ID",
    location="LOCATION",
)

RESOURCE_NAME = "projects/{PROJECT_ID}/locations/{LOCATION}/reasoningEngines/{RESOURCE_ID}"

client.agent_engines.delete(
    name=RESOURCE_NAME,
    force=True, # Optional, if the agent has resources (e.g. sessions, memory)
)

Delete Cloud Run service (if created):

gcloud run services delete code-review-assistant \
    --region=$GOOGLE_CLOUD_LOCATION \
    --quiet

Delete Cloud SQL instance (if created):

gcloud sql instances delete your-project-db \
    --quiet

Clean up storage buckets:

gsutil -m rm -r gs://your-project-staging
gsutil -m rm -r gs://your-project-artifacts

مراحل بعدی

With your foundation complete, consider these enhancements:

Add more languages : Extend tools to support JavaScript, Go, Java
Integrate with GitHub : Automatic PR reviews
Implement caching : Reduce latency for common patterns
Add specialized agents : Security scanning, performance analysis
Enable A/B testing : Compare different models and prompts
Export metrics : Send traces to specialized observability platforms

نکات کلیدی

Start simple, iterate fast : Seven lines to production in manageable steps
Tools over prompts : Real AST analysis beats "please check for bugs"
State management matters : Constants pattern prevents typo bugs
Loops need exit conditions : Always set max iterations and escalation
Deploy with automation : deploy.sh handles all the complexity
Observability is non-negotiable : You can't improve what you can't measure

Resources for Continued Learning

Your Journey Continues

You've built more than a code review assistant—you've mastered the patterns for building any production AI agent:
✅ Complex workflows with multiple specialized agents
✅ Real tool integration for genuine capabilities
✅ Production deployment with proper observability
✅ State management for maintainable systems

These patterns scale from simple assistants to complex autonomous systems. The foundation you've built here will serve you well as you tackle increasingly sophisticated agent architectures.

Welcome to production AI agent development. Your code review assistant is just the beginning.