Segmentation
The Segmenter task supports YOLO-seg heads (v8-seg, v11-seg, v26-seg). It does
everything the detector does and, on top of that, decodes the mask prototypes
into per-instance binary masks.
Building the segmenter
from ort_vision_sdk import Segmenter
seg = Segmenter(
"yolov8n-seg.onnx",
head="yolo-seg", # decoder family (default)
labels="coco", # default — 80-class COCO preset
input_size=(640, 640), # default
conf_threshold=0.25,
iou_threshold=0.45,
max_detections=300,
mask_threshold=0.5, # soft → binary mask cutoff
)
import { Segmenter } from "@mauriciobenjamin700/ort-vision-sdk-web";
const seg = await Segmenter.create("/models/yolov8n-seg.onnx", {
head: "yolo-seg", // default
labels: "coco", // default
inputSize: [640, 640], // default
confThreshold: 0.25,
iouThreshold: 0.45,
maskThreshold: 0.5,
});
Predicting
result = seg.predict("street.jpg")[0]
# Same Boxes view as the detector …
print(result.boxes.xyxy, result.boxes.cls, result.boxes.conf)
# … plus per-instance binary masks
for inst in result:
print(inst.name, inst.conf, inst.box.xyxy)
print(inst.mask.shape) # (h, w) uint8 ∈ {0, 255}, cropped to the bbox
print(inst.segmented_image.shape) # (h, w, 3) RGB with background zeroed
On Web:
const result = (await seg.predict("/images/street.jpg"))[0];
for (const inst of result) {
console.log(inst.className, inst.confidence, inst.bbox.asXyxy());
console.log(inst.mask.width, inst.mask.height); // cropped binary mask
// inst.segmentedImage: RGBImage with the background zeroed out
}
The Masks view
Beyond boxes, the segmentation envelope exposes the bulk masks view
(masks.data, masks.xyxy), mirroring Ultralytics' Masks interface.
Per instance, the mask is cropped to the bounding box:
- Python:
inst.maskis an(h, w)uint8 ndarray with values in{0, 255}, andinst.segmented_imageis the RGB crop with the background zeroed. - Web:
inst.maskis aMaskobject (data/width/height, row-major layout), andinst.segmentedImageis anRGBImage.
See also
- Detection guide — the same
Boxesview and the same flow. - Python API reference
- Web API reference