A computer vision model is a machine learning or deep learning system designed to interpret and understand visual data such as images and video. These models power capabilities like object detection, image classification, segmentation, tracking, and visual search, enabling software to "see" and reason about the physical world. They matter because they automate and scale tasks that previously required human visual inspection, improving accuracy, speed, and safety across many industries.
Fully managed cloud API with pre-trained models for image labeling, OCR, and document AI.
Microsoft’s cloud-based vision services integrated with Azure ecosystem and Cognitive Services.
AWS service for image and video analysis with strong integration into AWS tooling.
Open-source computer vision library offering classical CV algorithms and some deep learning integrations.
Facebook AI Research’s open-source platform for object detection and segmentation based on deep learning.