In summary:
- The RoadSense3D dataset consists of labeled traffic scenarios recorded at various lighting and weather conditions such.
- We provide an in-depth comparison of state-of-the-art monocular 3D object detection methods.
- We extend the Cube R-CNN model to make it compatible with various datasets.
- We develop domain adaptation methods to improve generalization.
- We perform extensive transfer learning experiments and ablation studies on the RoadSense3D dataset, the TUM Traffic datasets, and the DAIR-V2X dataset.
- We open-source our code and dataset and provide some qualitative video results.