Hi,
I have a question regarding the use of depth information in WildDet3D. In the paper, it is mentioned that providing depth information and camera intrinsics can improve the model performance. However, in my own experiments, I observed a somewhat unexpected result.
I tested three input settings:
RGB only
RGB + camera intrinsics
RGB-D + camera intrinsics
I placed a person in front of the camera and compared the predicted height of the 3D bounding box. The person’s actual height is 1.78 m. The results were approximately:
RGB only: 1.75–1.80 m
RGB + intrinsics: 1.80–1.90 m
RGB-D + intrinsics: 1.50–1.60 m
Interestingly, the RGB-D setting produced a less accurate height estimate than RGB-only or RGB + intrinsics in my case.
I am using an Orbbec Gemini 336L RGB-D camera. I would like to ask what depth sensor or depth device was used in your experiments. Were there any specific requirements for the depth map, such as depth scale, alignment between RGB and depth, valid depth range, or preprocessing steps?
Any guidance would be greatly appreciated. Thank you!
RGB only
RGB + camera intrinsics
RGB-D + camera intrinsics
demo code
app.py
Hi,
I have a question regarding the use of depth information in WildDet3D. In the paper, it is mentioned that providing depth information and camera intrinsics can improve the model performance. However, in my own experiments, I observed a somewhat unexpected result.
I tested three input settings:
RGB only
RGB + camera intrinsics
RGB-D + camera intrinsics
I placed a person in front of the camera and compared the predicted height of the 3D bounding box. The person’s actual height is 1.78 m. The results were approximately:
RGB only: 1.75–1.80 m
RGB + intrinsics: 1.80–1.90 m
RGB-D + intrinsics: 1.50–1.60 m
Interestingly, the RGB-D setting produced a less accurate height estimate than RGB-only or RGB + intrinsics in my case.
I am using an Orbbec Gemini 336L RGB-D camera. I would like to ask what depth sensor or depth device was used in your experiments. Were there any specific requirements for the depth map, such as depth scale, alignment between RGB and depth, valid depth range, or preprocessing steps?
Any guidance would be greatly appreciated. Thank you!
RGB only
RGB + camera intrinsics
RGB-D + camera intrinsics
demo code
app.py