Object Detector
Find and label every object in a photo. Drop in an image, pick a model, and the detector locates each object, draws a labeled bounding box around it, gives a count of what it found, and lets you download the annotated image. Two quality tiers: Fast is YOLO11n, a 10 MB model that loads instantly, and Best is D-FINE-S, an Apache-2.0 model that is noticeably more accurate for a bigger download. Drag the confidence slider to hide weak guesses; it re-filters instantly without re-running the model. Both run entirely in your browser (WebGPU accelerated when available), so nothing is uploaded. The model downloads once on first use, then is cached.
How to detect objects in a photo
- Drop an image in, or click to browse.
- Wait a few seconds while the AI scans the photo for objects.
- Adjust the confidence slider if you like, then download the annotated image.
Examples
Photo to labeled objects
A photo of two cats on a couch with two TV remotes
Boxes labeled cat, cat, remote, remote with confidence scores
Frequently asked questions
Is my image uploaded anywhere?
No. Whichever model you pick runs entirely in your browser, using your GPU via WebGPU when available and WebAssembly otherwise. The image is processed on your device and never uploaded. Only the model itself is downloaded, once, then cached for instant reuse.
What kinds of objects can it detect?
It detects 80 common object types from the COCO dataset: people, animals (cats, dogs, birds, horses), vehicles (cars, bikes, buses, planes), and everyday objects (chairs, laptops, bottles, cups, phones, TV remotes, and more).
Which model powers the detection?
You pick the tier. Fast is YOLO11n, a state-of-the-art real-time detector that loads instantly as a roughly 10 MB ONNX model. Best is D-FINE-S, an Apache-2.0 transformer that is noticeably more accurate (about 48.5 COCO AP versus 39.5) for a similar, roughly 11 MB download. Both predict a labeled box and a confidence score per object and run locally in your browser (WebGPU accelerated when available, WebAssembly otherwise).
What does the confidence slider do?
Each detection has a score from 0 to 1 for how sure the model is. The slider hides any box below the threshold you pick, so you can trim weak or spurious guesses. It re-filters the existing results instantly without re-running the model.
Why does the first run take a moment?
The model you pick (roughly 10 MB for Fast, 11 MB for Best) downloads the first time you use it, then is cached for instant reuse. Larger images also take a little longer because more pixels are analyzed.
Related tools
AI Background Remover
Remove the background from a photo automatically with AI, right in your browser. Get a transparent PNG or a solid color. Nothing is uploaded.
AI Depth Map Generator
Turn any photo into a depth map with AI, right in your browser. Pick a colormap, invert it, and download a PNG. Nothing is uploaded.
Image to Text (OCR)
Extract text from an image with OCR, right in your browser. Supports many languages, copy or download the result. Your image is never uploaded.
Add Border to Image
Add a colored border or frame around an image, with adjustable width and color. Live preview, runs in your browser, nothing uploaded.
AI Alt Text Generator
Generate accessible alt text for any image with AI, right in your browser. Copy clean alt text or a ready HTML img tag. Nothing is uploaded.
AI Image Upscaler
Upscale and enhance images with AI in your browser. Increase resolution 2x or 4x with sharp detail, no blur. Free, private, nothing uploaded.