Depth Scene Analyzer

AI-powered spatial scene understanding with object size estimation from single images

Overview

This project combines GroundingDINO, SAM, and Depth Anything V2 to estimate real-world object sizes from single images. Users provide a reference object with known dimensions to calibrate measurements of other objects in the scene.

Key Technologies

PythonPyTorchTransformersOpenCVStreamlit

AI Models: GroundingDINO • SAM • Depth Anything V2

Key Results

95%

Detection Success

±25-35%

Measurement Accuracy

<1s

Processing Time

Pipeline Results
Real example: Office scene with laptop detection
Depth estimation map

Depth Estimation

Depth Anything V2 depth map

SAM segmentation results

Object Segmentation

SAM precise masks

Combined pipeline results

Final Analysis

Combined visualization

Result: Laptop dimensions estimated with 30% accuracy using reference calibration.See detailed analysis →