Skip to content

fix commands for NPU in embedding demo#4315

Open
dtrawins wants to merge 2 commits into
mainfrom
CVS-187442
Open

fix commands for NPU in embedding demo#4315
dtrawins wants to merge 2 commits into
mainfrom
CVS-187442

Conversation

@dtrawins

Copy link
Copy Markdown
Collaborator

🛠 Summary

CVS-187442

🧪 Checklist

  • Unit tests added.
  • The documentation updated.
  • Change follows security best practices.
    ``

Copilot AI review requested due to automatic review settings June 22, 2026 22:07

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Updates the embeddings demo documentation to reflect NPU usage when pulling and running pre-converted Hugging Face models via OVMS.

Changes:

  • Adds --target_device NPU to the NPU pull example for OpenVINO/Qwen3-Embedding-0.6B-int8-ov.
  • Updates the “Tested models” table to list NPU as supported for the OpenVINO/*-int8-ov embedding models.

Comment on lines +305 to +306
|OpenVINO/Qwen3-Embedding-0.6B-int8-ov|LAST|CPU,GPU,NPU|
|OpenVINO/bge-base-en-v1.5-int8-ov|CLS|CPU,GPU,NPU|
@dtrawins dtrawins requested review from mzegla and ngrozae June 23, 2026 13:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants