Evaluate model predictions against ground truth using COCO, Open Images, or custom protocols. Use when computing mAP, precision, recall, confusion matrices, or analyzing TP/FP/FN examples for detection, classification, segmentation, or regression tasks.
/plugin marketplace add voxel51/fiftyone-skills/plugin install fiftyone-model-evaluation@fiftyone-skillsComprehensive feature development workflow with specialized agents for codebase exploration, architecture design, and quality review
Interactive learning mode that requests meaningful code contributions at decision points (mimics the unshipped Learning output style)
Automated code review for pull requests using multiple specialized agents with confidence-based scoring
Comprehensive PR review agents specializing in comments, tests, error handling, type design, code quality, and code simplification