Azure AI-900 Vision Tasks Guide

April 13, 2026

Study Azure AI-900 Vision Tasks: key concepts, common traps, and exam decision cues.

On this page

AI-900 computer-vision questions are easier when you identify the expected output. Is the system assigning a label to the whole image, locating objects, reading text, or working with faces? That output usually tells you the task.

Core task types by output

Task	What the output looks like	Common clue
image classification	one or more labels for the whole image	“categorize this image”
object detection	labels plus object locations	“find the cars in the image”
OCR	extracted text from a visual source	“read the sign”
facial detection	location of a face	“find whether a face is present”
facial analysis	attributes or information derived from a detected face	“analyze the detected face”

OCR vs document processing

OCR is about extracting text from an image or document. Document processing often goes one step further and cares about document structure such as fields, tables, or form layout. AI-900 can test this boundary even if the formal computer-vision service bullets stay centered on visual capabilities.

High-value scenario clues

Scenario clue	Strongest first answer
“Which category best describes this image?”	image classification
“Detect each package in the warehouse photo”	object detection
“Read the street sign text”	OCR
“Locate faces in uploaded photos”	facial detection
“Analyze information from a detected face”	facial analysis

What strong answers usually do

decide whether the output is a label, location, extracted text, or face-related result
keep OCR separate from generic tagging
remember that object detection answers both what and where
avoid confusing face-specific tasks with broad scene analysis

Decision order that usually wins

First ask whether the task is label the whole image, locate objects, or read text from an image.
If the output is one category for the whole image, think image classification.
If the output must identify objects and their positions, think object detection.
If the task is extracting printed or handwritten text, think OCR.
AI-900 usually rewards identifying the task shape before naming a service.

Quiz

Loading quiz…

Revised on Monday, June 15, 2026

3.2 Vision Services

Browse Microsoft Certification Guides