Add saliency: spectral-residual visual saliency (where to look) by JE-Chen · Pull Request #408 · Integration-Automation/AutoControlGUI

JE-Chen · 2026-06-24T06:49:52Z

Why

When there's no template, no known colour and no text to OCR, an agent still needs a cue for where to look — the region that stands out (a popup, a badge, a highlighted row). saliency computes the spectral-residual saliency map (Hou & Zhang 2007 — log amplitude minus its local average, reconstructed through the phase) and turns it into ranked salient boxes.

saliency_map — the normalised (0–1) saliency map as an ndarray
salient_regions — ranked salient boxes {x, y, width, height, center, score} in source pixel coordinates
most_salient — the single most salient region (the first place to look)

Design

The transform is a pure numpy FFT — cv2.saliency lives in the forbidden opencv-contrib package, so it's re-implemented over base opencv only.
Reuses visual_match._haystack_gray (any ndarray / path / PIL image, or the live screen) and cv2_utils.blobs.connected_boxes for region extraction. cv2/numpy lazily imported.
Regions threshold at mean + 2·std of the saliency map by default (scale-invariant; pass threshold to override), then scale back to source pixel coordinates. Saliency is a coarse attention cue, documented as such — it narrows where a template / OCR pass then looks.
5 layers wired: core → facade __all__ → AC_salient_regions / AC_most_salient → read-only ac_* MCP tools → Script Builder (Image). Qt-free verified.

Tests

test/unit_test/headless/test_saliency_batch.py (cv2 via importorskip) — map shape/dtype/range, size param, salient regions in-bounds + ranked + scores in [0,1] on a 3-block frame, most_salient matches the top region, the high-threshold []/None path, the pure executor path, and 5-layer wiring. 23 passed with the vision siblings. This completes the vision lane HIGH items (image_quality / scale_detect / saliency).

When there's no template, colour or text to key on, an agent still needs a cue for where to look. Compute the spectral-residual saliency map (Hou & Zhang 2007) and rank salient boxes in source coordinates. Pure numpy FFT (cv2.saliency is opencv-contrib, forbidden), reusing visual_match's grayscale loader and cv2_utils.blobs.connected_boxes; regions threshold at mean+2*std by default. A coarse attention cue to narrow where a template / OCR pass then looks.

codacy-production · 2026-06-24T06:51:36Z

Up to standards ✅

🟢 Issues 0 issues

Results:
0 new issues

View in Codacy

🟢 Metrics 30 complexity · 0 duplication

Metric Results

Complexity 30

Duplication 0

View in Codacy

_{NEW Get contextual insights on your PRs based on Codacy's metrics, along with PR and Jira context, without leaving GitHub. Enable AI reviewer}
_{TIP This summary will be updated as you push new changes.}

sonarqubecloud · 2026-06-24T06:56:36Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

JE-Chen merged commit c3a4f1a into dev Jun 24, 2026
16 checks passed

JE-Chen deleted the feat/saliency-batch branch June 24, 2026 06:55

JE-Chen mentioned this pull request Jun 24, 2026

Release: vision lane — image quality + DPI detection + saliency (v188–v190) #409

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add saliency: spectral-residual visual saliency (where to look)#408

Add saliency: spectral-residual visual saliency (where to look)#408
JE-Chen merged 1 commit into
devfrom
feat/saliency-batch

JE-Chen commented Jun 24, 2026

Uh oh!

codacy-production Bot commented Jun 24, 2026

Uh oh!

Uh oh!

sonarqubecloud Bot commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

JE-Chen commented Jun 24, 2026

Why

Design

Tests

Uh oh!

codacy-production Bot commented Jun 24, 2026

Up to standards ✅

Uh oh!

Uh oh!

sonarqubecloud Bot commented Jun 24, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant