Browse Microsoft Certification Guides

Azure AI-900 Vision Tasks Guide

Study Azure AI-900 Vision Tasks: key concepts, common traps, and exam decision cues.

AI-900 computer-vision questions are easier when you identify the expected output. Is the system assigning a label to the whole image, locating objects, reading text, or working with faces? That output usually tells you the task.

Core task types by output

Task What the output looks like Common clue
image classification one or more labels for the whole image “categorize this image”
object detection labels plus object locations “find the cars in the image”
OCR extracted text from a visual source “read the sign”
facial detection location of a face “find whether a face is present”
facial analysis attributes or information derived from a detected face “analyze the detected face”

OCR vs document processing

OCR is about extracting text from an image or document. Document processing often goes one step further and cares about document structure such as fields, tables, or form layout. AI-900 can test this boundary even if the formal computer-vision service bullets stay centered on visual capabilities.

High-value scenario clues

Scenario clue Strongest first answer
“Which category best describes this image?” image classification
“Detect each package in the warehouse photo” object detection
“Read the street sign text” OCR
“Locate faces in uploaded photos” facial detection
“Analyze information from a detected face” facial analysis

What strong answers usually do

  • decide whether the output is a label, location, extracted text, or face-related result
  • keep OCR separate from generic tagging
  • remember that object detection answers both what and where
  • avoid confusing face-specific tasks with broad scene analysis

Decision order that usually wins

  1. First ask whether the task is label the whole image, locate objects, or read text from an image.
  2. If the output is one category for the whole image, think image classification.
  3. If the output must identify objects and their positions, think object detection.
  4. If the task is extracting printed or handwritten text, think OCR.
  5. AI-900 usually rewards identifying the task shape before naming a service.

Quiz

Loading quiz…
Revised on Sunday, May 10, 2026