Step 3: Runbook

This runbook defines how to operate the pipeline in production conditions. Step 2 described execution flow; Step 3 describes operator procedure and recovery discipline.

Back to step 2 Back to Step 0 Continue to step 4: Database model

3.1 Operating objective

The objective of a run is to move selected sets from local source folders into reviewed and safely published assets, while preserving deterministic naming, auditable queue state, and clean retry behavior on failure.

3.2 Preflight checklist (before each run)

Open a shell at repository root and activate the environment used for production runs.
If running EXE with external Python, set AMIR_PYTHON to a Python 3.13 runtime.
Confirm Ollama is reachable: ollama list.
Confirm required caption model is available (for example minicpm-v:latest if configured).
Validate publish configuration values (FTP/MySQL host, credentials, base paths) for the target environment.
Confirm writable local paths: data/, logs/, and data/ollama_tmp/.
Ensure no stale external tool is holding locks on data/review.db.

Set-Location "\path\to\amir2000_image_automation"
.\.venv313\Scripts\Activate.ps1
ollama list
python .\main_set.py

3.3 Standard run procedure

Start main_set.py and import one or more sets.
Run the batch and monitor stage progress in console/UI.
Allow stages 1 to 7 to complete; review queue rows are prepared automatically.
In the review editor, validate/edit filename, caption, alt text, keywords, and quality context.
Set row decisions explicitly to approved or rejected.
Publish approved rows only and wait for uploader completion.
Perform post-run validation before starting a new batch.

3.4 Publish gate controls

AI output is assistive and requires manual verification.
No row should be published without explicit approval status.
Use workflow rename/edit paths, not ad-hoc manual file changes outside the pipeline.
Treat uploader errors as blocking until root cause is resolved and validated.

3.5 Post-run validation checklist

Review logs/latest_run.log for stage failures or warnings requiring action.
Review logs/db_uploader.log for upload/upsert failures by row.
Verify expected rows exist in MySQL photos_info_revamp by File_Name.
Verify website image and thumbnail URLs resolve as expected.
Confirm local mirror and queue statuses align with final decisions.
Confirm temporary staging does not retain unintended stale artifacts.

3.6 Incident playbooks

A. Quality scoring stage fails

Retry once in-app.
If it fails again, inspect logs/latest_run.log for dependency/model/runtime errors.
If error mentions python312.dll conflicts with this version of Python, relaunch EXE with AMIR_PYTHON pointing to Python 3.13 and verify runtime line in log.
Fix dependency/model/runtime mismatch, then rerun the batch from start.

B. Caption prefill fails or stalls

Check Ollama service and model list using ollama list.
If error states model missing, pull it (for example ollama pull minicpm-v:latest).
Re-run after confirming model availability and stable service response.

C. Publish fails (FTP/MySQL)

Inspect logs/db_uploader.log and identify the first failing row.
Validate credentials, host reachability, and target path/table configuration.
Re-run publish after connectivity/authentication is confirmed fixed.

D. Crash or forced stop

Restart app and use Recover crash session first (before rebuilding sets manually).
Inspect logs/latest_run.log and crash_startup.log when present.
Inspect data/crash_runtime.log when add-set callbacks or UI runtime handlers failed.
Inspect latest queue rows for partial state before taking cleanup actions.
Restore staged files only through documented rollback-aware process.
Release reserved filenames only when reuse safety is certain.

E. SQLite lock/inconsistency

Close any process holding the DB file.
Back up current data/ state.
Re-initialize DB with python .\init_db.py only if reset is required.

3.7 Safe rerun procedure

Fix the root cause first (model, credentials, path permission, dependency).
Confirm rollback completed or manually validate that staging state is clean.
Re-run the same set through normal pipeline entry, not partial manual edits.
Verify that newly generated filenames remain collision-free.
Re-check publish output and queue status after completion.

3.8 Controlled taxonomy/config updates

Update folder mapping in data/folder_map.json as needed.
Update locations in data/location_list.json when new values are required.
Review new_taxonomy_log.json entries generated by UI flows before production use.

3.9 Continuation path

Step 3 defines operations and incident handling. Step 4 documents the database model that supports these controls.

Next: Step 4 Database model Back to Step 2 Back to Step 0

Back to case study Back to step 1