Microsoft Azure Custom Vision screenshot

What is Microsoft Azure Custom Vision?

Microsoft Azure Custom Vision is a machine learning service that lets you train image recognition models without needing deep expertise in data science. You upload images, label them, and the service builds a classifier or object detector tailored to your needs. It's part of Azure Cognitive Services and integrates with your existing applications through APIs or by exporting models for offline use. The tool handles the training process automatically, making it accessible to developers and businesses who need custom image analysis but lack large data science teams.

Key Features

Image classification

train models to categorise images into custom classes you define

Object detection

locate and identify specific items within images, not just classify the whole image

Quick training

upload labelled images and get a working model in minutes rather than weeks

Export options

download trained models to run locally on edge devices, mobile apps, or servers

Performance metrics

view precision, recall, and other accuracy measures to assess model quality

API integration

connect your trained model to applications via REST or Python SDKs

Pros & Cons

Advantages

  • Minimal machine learning knowledge required; the interface guides you through the process
  • Fast iteration: test and improve models quickly without writing training code
  • Flexible deployment: run models in the cloud via API or export them for offline use
  • Free tier available: start at no cost with reasonable limits on images and predictions

Limitations

  • Smaller datasets than enterprise ML platforms; best for problems with hundreds to thousands of images rather than millions
  • Limited customisation of the underlying algorithm; you have fewer tuning options than building models from scratch
  • Costs can accumulate if you need high prediction volumes; free tier includes limited monthly transactions

Use Cases

Quality control in manufacturing: detect defects in products on assembly lines

Retail: identify products from customer photos to enable visual search

Agriculture: classify crop health or identify plant diseases from field images

Document processing: recognise and sort documents by type or content

Wildlife monitoring: identify animal species in camera trap footage