Pixel-Level Pavement Distress Assessment Using Instance Segmentation
Mirrored from Hugging Face Daily Papers for archival readability. Support the source by reading on the original site.
Pixel-Level Pavement Distress Assessment Using Instance Segmentation
Abstract
A vision-based pavement distress analysis system using Mask R-CNN instance segmentation demonstrates superior performance for crack detection and quantification compared to object detection approaches, achieving high precision and recall metrics on a custom field-collected dataset.
Automated pavement distress assessment requires more than image-level classification or coarse bounding box detection, demanding precise localization of thin, branching, and irregular cracks to achieve the geometric precision necessary for maintenance-relevant quantification. This paper presents a vision-based pavement distress analysis system based on Mask R-CNN instance segmentation and evaluates it on UWGB-StreetCrack, a custom field-collected roadway image dataset acquired with a vehicle-mounted smartphone and manually annotated with polygon labels for longitudinal cracks, transverse cracks, alligator cracks, and potholes. Five Detectron2-based Mask R-CNN backbone variants were considered under a consistent fine-tuning protocol. The best-performing model, Mask R-CNN with a ResNet-101 FPN backbone, achieved 84.23% precision, 90.04% recall, and an F1 score of 87.04% under the project-specific bounding-box matching protocol. The same model produced an aggregate predicted crack-area fraction of 2.164%, closely matching the 2.170% ground-truth crack-area fraction. To contextualize the segmentation system against a detector-oriented alternative, a CSPDarknet53-based YOLO detector was also adapted and retrained on the dataset, reaching 27.5% precision and 20.7% recall on the validation protocol. The results show that instance segmentation is a practical direction for field pavement imagery and aggregate crack-area estimation, while also exposing open challenges in annotation consistency, class imbalance, confounder rejection, and mask-level benchmarking.
Community
Get this paper in your agent:
hf papers read 2605.26095 curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper
More from Hugging Face Daily Papers
-
SemBridge: Language Transfer in Sparse Encoders via Multilingual Semantic Bridges
May 26
-
SkillEvolBench: Benchmarking the Evolution from Episodic Experience to Procedural Skills
May 26
-
InstructSAM: Segment Any Instance with Any Instructions
May 26
-
Reinforcing Few-step Generators via Reward-Tilted Distribution Matching
May 26
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.