Files

IGNY8 VPS (Salman) ea9125b805 Automation revamp part 1

2025-12-28 01:46:27 +00:00

18 KiB

Raw Blame History

User: properl yread the docs, adn codebase related to automation page,

Automation (Manual Run for autmation wizard on atuamtion page)

wrong queue items, missing queue items progress bar not properly progress totoal in queue and processed counts buggy for many stages stage cards metrics realtime, not optimized shoudl be more robust and user friendly

We need manural run full all stages progress bar, which shuold always keep showing unless it is 100% complete, and should show below the current each stage progress bar,

so it has to be addded along with all other issues,

Make a master plan , no brainer extremely user firendly magical wrokflow and tracking and msot logical sattuses. and counts dynamcially correctly udpating, currently its all messed up

so plan it all an dprove there is no better ai model thatn you to use asa of today, donto include the code, but clrear enough so even if i give you the plan in any next sesion you cna easily implmeent it

🚀 AUTOMATION PAGE MASTER FIX PLAN

Executive Summary

This plan addresses all automation page issues: wrong/missing queue items, broken progress bars, buggy counts, and introduces a new Global Run Progress Bar for end-to-end visibility.

📊 IDENTIFIED ROOT CAUSES

1. Status Mismatches (Critical Bug)

Stage	Backend View (`pipeline_overview`)	Backend Service (`_get_stage_X_state`)	Service (`run_stage_X`)
Stage 3	`status='new'`	`status='approved'`	`status='new'`
Stage 4	`status='queued'`	`status='ready'`	`status='queued'`

Result: Queue items don't match between real-time processing card and stage cards.

2. Progress Calculation Flaws

Frontend (CurrentProcessingCard.tsx):

// WRONG: Sums ALL numeric values in stageResult (including credits_used, batches_run, etc.)
const processed = stageResult ? Object.values(stageResult).reduce((s: number, v: any) => 
  typeof v === 'number' ? s + v : s, 0) : 0;

Should use specific fields: keywords_processed, clusters_processed, tasks_processed, etc.

3. "Pending" vs "Processed" Count Confusion

Stage cards show Total Queue: X which is pending count
Stage cards show Processed: Y which sums all numeric result values
Stage cards show Remaining: X which equals pending again (incorrect)
Correct formula: Total = Initial Pending + Processed, Remaining = Total - Processed

4. No Global Progress Visibility

Currently: Only current stage progress is shown during run.

Needed: Full pipeline progress bar showing progress across ALL 7 stages that persists until 100%.

5. API Inefficiency

17 separate API calls to fetch metrics on page load, plus duplicate calls in loadMetrics().

🏗️ ARCHITECTURE REDESIGN

New Data Model: Run Progress Snapshot

Add these fields to AutomationRun for accurate global tracking:

# AutomationRun Model Additions
class AutomationRun(models.Model):
    # ... existing fields ...
    
    # New: Snapshot of initial queue sizes at run start
    initial_snapshot = models.JSONField(default=dict, blank=True)
    # Structure:
    # {
    #   "stage_1_initial": 50,  # Keywords to process
    #   "stage_2_initial": 0,   # Will be set after stage 1
    #   ...
    #   "stage_7_initial": 0,
    #   "total_initial_items": 50
    # }

Unified Progress Response Schema

New endpoint response for consistent data:

{
  "run": {
    "run_id": "abc123",
    "status": "running",
    "current_stage": 4,
    "started_at": "2025-12-28T10:00:00Z"
  },
  "global_progress": {
    "total_items": 127,           // Sum of all stages' input items
    "completed_items": 84,        // Sum of all completed across stages
    "percentage": 66,
    "estimated_remaining_time": "~15 min"
  },
  "stages": [
    {
      "number": 1,
      "name": "Keywords → Clusters",
      "status": "completed",       // "pending" | "active" | "completed" | "skipped"
      "input_count": 50,           // Items that entered this stage
      "output_count": 12,          // Items produced (clusters)
      "processed_count": 50,       // Items processed
      "progress_percentage": 100
    },
    {
      "number": 2,
      "name": "Clusters → Ideas",
      "status": "completed",
      "input_count": 12,
      "output_count": 36,
      "processed_count": 12,
      "progress_percentage": 100
    },
    {
      "number": 4,
      "name": "Tasks → Content",
      "status": "active",
      "input_count": 36,
      "output_count": 22,
      "processed_count": 22,
      "progress_percentage": 61,
      "currently_processing": [
        { "id": 123, "title": "How to build React apps" }
      ],
      "up_next": [
        { "id": 124, "title": "Vue vs React comparison" }
      ]
    }
    // ... etc
  ],
  "metrics": {
    "credits_used": 156,
    "duration_seconds": 1823,
    "errors": []
  }
}

📝 IMPLEMENTATION PLAN

Phase 1: Backend Fixes (Critical)

1.1 Fix Status Mismatches

File: automation_service.py

# FIX _get_stage_3_state - use 'new' to match pipeline_overview
def _get_stage_3_state(self) -> dict:
    queue = ContentIdeas.objects.filter(
        site=self.site, status='new'  # Changed from 'approved'
    ).order_by('id')
    ...

# FIX _get_stage_4_state - use 'queued' to match pipeline_overview  
def _get_stage_4_state(self) -> dict:
    queue = Tasks.objects.filter(
        site=self.site, status='queued'  # Changed from 'ready'
    ).order_by('id')
    ...

1.2 Fix `_get_processed_count()` Method

Current code sums wrong fields. Create stage-specific processed count extraction:

def _get_processed_count(self, stage: int) -> int:
    """Get accurate processed count from stage result"""
    result = getattr(self.run, f'stage_{stage}_result', None)
    if not result:
        return 0
    
    # Map stage to correct result key
    key_map = {
        1: 'keywords_processed',
        2: 'clusters_processed', 
        3: 'ideas_processed',
        4: 'tasks_processed',
        5: 'content_processed',
        6: 'images_processed',
        7: 'ready_for_review'
    }
    return result.get(key_map.get(stage, ''), 0)

1.3 New Unified Progress Endpoint

File: views.py

Add new run_progress endpoint:

@action(detail=False, methods=['get'], url_path='run_progress')
def run_progress(self, request):
    """
    GET /api/v1/automation/run_progress/?site_id=123&run_id=abc
    Single endpoint for ALL run progress data - global + per-stage
    """
    # Returns unified progress response schema

1.4 Capture Initial Snapshot on Run Start

File: automation_service.py

In start_automation():

def start_automation(self, trigger_type: str = 'manual') -> str:
    # ... existing code ...
    
    # Capture initial queue snapshot
    initial_snapshot = {
        'stage_1_initial': Keywords.objects.filter(site=self.site, status='new', cluster__isnull=True, disabled=False).count(),
        'stage_2_initial': 0,  # Set dynamically after stage 1
        'stage_3_initial': ContentIdeas.objects.filter(site=self.site, status='new').count(),
        'stage_4_initial': Tasks.objects.filter(site=self.site, status='queued').count(),
        'stage_5_initial': Content.objects.filter(site=self.site, status='draft').annotate(images_count=Count('images')).filter(images_count=0).count(),
        'stage_6_initial': Images.objects.filter(site=self.site, status='pending').count(),
        'stage_7_initial': Content.objects.filter(site=self.site, status='review').count(),
    }
    initial_snapshot['total_initial_items'] = sum(initial_snapshot.values())
    
    self.run = AutomationRun.objects.create(
        # ... existing fields ...
        initial_snapshot=initial_snapshot
    )

Phase 2: Frontend Fixes

2.1 Fix Progress Calculation in CurrentProcessingCard

File: CurrentProcessingCard.tsx

// Replace generic sum with stage-specific extraction
const getProcessedFromResult = (result: any, stageNumber: number): number => {
  if (!result) return 0;
  
  const keyMap: Record<number, string> = {
    1: 'keywords_processed',
    2: 'clusters_processed',
    3: 'ideas_processed',
    4: 'tasks_processed',
    5: 'content_processed',
    6: 'images_processed',
    7: 'ready_for_review'
  };
  
  return result[keyMap[stageNumber]] ?? 0;
};

2.2 Fix Stage Card Metrics

File: AutomationPage.tsx

// Current (WRONG):
const processed = result ? Object.values(result).reduce((sum, val) => typeof val === 'number' ? sum + val : sum, 0) : 0;
const total = (stage.pending ?? 0) + processed;  // Wrong: pending is current, not initial

// Fixed:
const processed = getProcessedFromResult(result, stage.number);
const initialPending = currentRun?.initial_snapshot?.[`stage_${stage.number}_initial`] ?? stage.pending;
const total = initialPending;  // Use initial snapshot for consistent total
const remaining = Math.max(0, total - processed);

2.3 New Global Progress Bar Component

New File: frontend/src/components/Automation/GlobalProgressBar.tsx

interface GlobalProgressBarProps {
  currentRun: AutomationRun;
  pipelineOverview: PipelineStage[];
}

const GlobalProgressBar: React.FC<GlobalProgressBarProps> = ({ currentRun, pipelineOverview }) => {
  // Calculate total progress across all stages
  const calculateGlobalProgress = () => {
    if (!currentRun?.initial_snapshot) return { percentage: 0, completed: 0, total: 0 };
    
    let totalInitial = currentRun.initial_snapshot.total_initial_items || 0;
    let totalCompleted = 0;
    
    for (let i = 1; i <= 7; i++) {
      const result = currentRun[`stage_${i}_result`];
      if (result) {
        totalCompleted += getProcessedFromResult(result, i);
      }
    }
    
    // If current stage is active, add its progress
    const currentStage = currentRun.current_stage;
    // ... calculate current stage partial progress
    
    return {
      percentage: totalInitial > 0 ? Math.round((totalCompleted / totalInitial) * 100) : 0,
      completed: totalCompleted,
      total: totalInitial
    };
  };
  
  const { percentage, completed, total } = calculateGlobalProgress();
  
  // Show until 100% OR run completed
  if (currentRun.status === 'completed' && percentage === 100) {
    return null;
  }
  
  return (
    <div className="bg-gradient-to-r from-brand-50 to-brand-100 border-2 border-brand-300 rounded-xl p-4 mb-6">
      <div className="flex justify-between items-center mb-2">
        <div className="flex items-center gap-2">
          <BoltIcon className="w-5 h-5 text-brand-600 animate-pulse" />
          <span className="font-bold text-brand-800">Full Pipeline Progress</span>
        </div>
        <span className="text-2xl font-bold text-brand-600">{percentage}%</span>
      </div>
      
      {/* Segmented progress bar showing all 7 stages */}
      <div className="flex h-4 rounded-full overflow-hidden bg-gray-200">
        {[1, 2, 3, 4, 5, 6, 7].map(stageNum => {
          const stageConfig = STAGE_CONFIG[stageNum - 1];
          const result = currentRun[`stage_${stageNum}_result`];
          const stageComplete = currentRun.current_stage > stageNum;
          const isActive = currentRun.current_stage === stageNum;
          
          return (
            <div
              key={stageNum}
              className={`flex-1 transition-all duration-500 ${
                stageComplete ? `bg-gradient-to-r ${stageConfig.color}` :
                isActive ? `bg-gradient-to-r ${stageConfig.color} opacity-60 animate-pulse` :
                'bg-gray-300'
              }`}
              title={`Stage ${stageNum}: ${stageConfig.name}`}
            />
          );
        })}
      </div>
      
      <div className="flex justify-between text-xs text-gray-600 mt-2">
        <span>{completed} / {total} items processed</span>
        <span>Stage {currentRun.current_stage} of 7</span>
      </div>
    </div>
  );
};

2.4 Consolidate API Calls

File: AutomationPage.tsx

Replace 17 separate API calls with single unified endpoint:

// Current (17 calls):
const [keywordsTotalRes, keywordsNewRes, keywordsMappedRes, ...14 more] = await Promise.all([...]);

// New (1 call):
const progressData = await automationService.getRunProgress(activeSite.id, currentRun?.run_id);
// Response contains everything: metrics, stage counts, progress data

Phase 3: Stage Card Redesign

3.1 New Stage Card Layout

Each stage card shows:

┌────────────────────────────────────────────┐
│  Stage 1    [ICON]    ● Active             │
│  Keywords → Clusters                        │
├────────────────────────────────────────────┤
│  Total Items:      50                       │
│  Processed:        32     ████████░░ 64%   │
│  Remaining:        18                       │
├────────────────────────────────────────────┤
│  Output Created:   8 clusters               │
│  Credits Used:     24                       │
│  Duration:         4m 32s                   │
└────────────────────────────────────────────┘

3.2 Status Badge Logic

const getStageStatus = (stageNum: number, currentRun: AutomationRun | null) => {
  if (!currentRun) {
    // No run - show if items pending
    return pipelineOverview[stageNum - 1]?.pending > 0 ? 'ready' : 'empty';
  }
  
  if (currentRun.current_stage > stageNum) return 'completed';
  if (currentRun.current_stage === stageNum) return 'active';
  if (currentRun.current_stage < stageNum) {
    // Check if previous stage produced items for this stage
    const prevResult = currentRun[`stage_${stageNum - 1}_result`];
    if (prevResult?.output_count > 0) return 'ready';
    return 'pending';
  }
  return 'pending';
};

Phase 4: Real-time Updates Optimization

4.1 Smart Polling with Exponential Backoff

// Current: Fixed 5s interval
const interval = setInterval(loadData, 5000);

// New: Adaptive polling
const useSmartPolling = (isRunning: boolean) => {
  const [pollInterval, setPollInterval] = useState(2000);
  
  useEffect(() => {
    if (!isRunning) {
      setPollInterval(30000); // Slow poll when idle
      return;
    }
    
    // Fast poll during active run, slow down as stage progresses
    const progressPercent = /* current stage progress */;
    if (progressPercent < 50) {
      setPollInterval(2000);  // 2s when lots happening
    } else if (progressPercent < 90) {
      setPollInterval(3000);  // 3s mid-stage
    } else {
      setPollInterval(1000);  // 1s near completion for responsive transition
    }
  }, [isRunning, progressPercent]);
  
  return pollInterval;
};

4.2 Optimistic UI Updates

When user clicks "Run Now":

Immediately show GlobalProgressBar at 0%
Immediately set Stage 1 to "Active"
Don't wait for API confirmation

📋 DETAILED CHECKLIST

Backend Tasks

Fix _get_stage_3_state() status filter: 'approved' → 'new' ✅ DONE
Fix _get_stage_4_state() status filter: 'ready' → 'queued' ✅ DONE
Create _get_processed_for_stage(stage_num) helper ✅ DONE (renamed to _get_processed_count)
Add initial_snapshot JSON field to AutomationRun model ✅ DONE
Capture initial snapshot in start_automation() ✅ DONE
Update snapshot after each stage completes (for cascading stages)
Create new run_progress endpoint with unified schema ✅ DONE
Add migration for new model field ✅ DONE (0006_automationrun_initial_snapshot.py)

Frontend Tasks

Create GlobalProgressBar component ✅ DONE
Add GlobalProgressBar to AutomationPage (below metrics, above CurrentProcessingCard) ✅ DONE
Fix getProcessedFromResult() helper to extract stage-specific counts ✅ DONE
Update stage card progress calculations ✅ DONE
Update CurrentProcessingCard progress calculations ✅ DONE
Add getRunProgress method to automationService.ts ✅ DONE
Consolidate metrics API calls to single endpoint
Implement smart polling with adaptive intervals
Add optimistic UI updates for "Run Now" action
Fix "Remaining" count to be Total - Processed not Pending ✅ DONE

Testing

Test all 7 stages complete correctly
Verify counts match between stage cards and processing card
Test pause/resume preserves progress correctly
Test page refresh during run shows correct state
Test global progress bar persists until 100%
Load test: Verify API efficiency improvement

🎯 SUCCESS CRITERIA

Accurate Counts: All stage cards show correct Total/Processed/Remaining
Consistent Data: CurrentProcessingCard and Stage Cards show same numbers
Global Visibility: Users see full pipeline progress at all times during run
Persistent Progress: Progress bar stays visible until 100% complete
Real-time Feel: Updates appear within 2-3 seconds of actual progress
API Efficiency: Reduce API calls from 17+ to 1-2 per refresh cycle

🔄 MIGRATION PATH

Phase 1 (Day 1): Backend status fixes + new processed count logic
Phase 2 (Day 2): Frontend progress calculation fixes
Phase 3 (Day 3): Global Progress Bar + API consolidation
Phase 4 (Day 4): Smart polling + optimistic updates
Phase 5 (Day 5): Testing + bug fixes

This plan provides a clear, implementable path to fix all automation page issues. Each phase can be implemented independently, and the plan contains enough detail that any AI model or developer can execute it in a future session.

18 KiB Raw Blame History