Release: vision lane — image quality + DPI detection + saliency (v188–v190) by JE-Chen · Pull Request #409 · Integration-Automation/AutoControlGUI

JE-Chen · 2026-06-24T06:55:19Z

Release: vision lane (v188–v190)

Bundles the completed vision lane — three image-analysis features that reuse visual_match's loaders, each merged to dev CI-green (Codacy 0 / SonarCloud OK / all matrices + Docker).

image_quality (v188, Add image_quality: sharpness/contrast/brightness gate before OCR #406) — sharpness / contrast / brightness metrics + a quality_gate (blurry / low_contrast / too_dark / too_bright) to refuse OCR on a bad frame.
scale_detect (v189, Add scale_detect: infer display scale / visual DPI from a template #407) — infer the display scale / visual DPI a template renders at, with the per-scale score profile + confidence margin match_template discards.
saliency (v190, Add saliency: spectral-residual visual saliency (where to look) #408) — spectral-residual visual saliency (pure numpy FFT) → ranked salient regions: where to look with no template / colour / text.

All three keep the metric / inference / transform logic headless-testable (cv2 via importorskip); cv2/numpy are lazily imported. EN/Zh docs (v188–v190) + WHATS_NEW entries included.

OCR and template matching quietly fail on a blurry, washed-out or too-dark capture, and the caller can't tell a missing element from an unreadable one. Measure sharpness (variance of the Laplacian), contrast (grayscale stddev) and brightness (mean), and gate on them with named issues (blurry / low_contrast / too_dark / too_bright) so a script can pre-process or re-capture before OCR. Reuses visual_match's grayscale loader; cv2/numpy lazily imported.

…y-batch Add image_quality: sharpness/contrast/brightness gate before OCR

A template cropped at 100% scale won't match on a 150%-DPI machine, and match_template returns only the single best match, discarding the per-scale scores. scale_sweep keeps the whole profile (every scale's best match) and detect_scale reports the winning scale as a DPI inference with a confidence margin (how far it beats the runner-up). Reuses visual_match._score_map per scale; cv2/numpy lazily imported.

…-batch Add scale_detect: infer display scale / visual DPI from a template

When there's no template, colour or text to key on, an agent still needs a cue for where to look. Compute the spectral-residual saliency map (Hou & Zhang 2007) and rank salient boxes in source coordinates. Pure numpy FFT (cv2.saliency is opencv-contrib, forbidden), reusing visual_match's grayscale loader and cv2_utils.blobs.connected_boxes; regions threshold at mean+2*std by default. A coarse attention cue to narrow where a template / OCR pass then looks.

Add saliency: spectral-residual visual saliency (where to look)

codacy-production · 2026-06-24T06:56:21Z

Up to standards ✅

🟢 Issues 0 issues

Results:
0 new issues

View in Codacy

🟢 Metrics 77 complexity · 0 duplication

Metric Results

Complexity 77

Duplication 0

View in Codacy

_{NEW Get contextual insights on your PRs based on Codacy's metrics, along with PR and Jira context, without leaving GitHub. Enable AI reviewer}
_{TIP This summary will be updated as you push new changes.}

sonarqubecloud · 2026-06-24T07:03:24Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

JE-Chen added 6 commits June 24, 2026 14:25

Merge pull request #406 from Integration-Automation/feat/image-qualit…

debb186

…y-batch Add image_quality: sharpness/contrast/brightness gate before OCR

Merge pull request #407 from Integration-Automation/feat/scale-detect…

538a6b4

…-batch Add scale_detect: infer display scale / visual DPI from a template

Merge pull request #408 from Integration-Automation/feat/saliency-batch

c3a4f1a

Add saliency: spectral-residual visual saliency (where to look)

JE-Chen merged commit 2ba8465 into main Jun 24, 2026
31 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release: vision lane — image quality + DPI detection + saliency (v188–v190)#409

Release: vision lane — image quality + DPI detection + saliency (v188–v190)#409
JE-Chen merged 6 commits into
mainfrom
dev

JE-Chen commented Jun 24, 2026

Uh oh!

codacy-production Bot commented Jun 24, 2026

Uh oh!

Uh oh!

sonarqubecloud Bot commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

JE-Chen commented Jun 24, 2026

Release: vision lane (v188–v190)

Uh oh!

codacy-production Bot commented Jun 24, 2026

Up to standards ✅

Uh oh!

Uh oh!

sonarqubecloud Bot commented Jun 24, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant