Abstract: Underwater object detection suffers from limited long-range dependency modeling, fine-grained feature representation, and noise suppression, resulting in blurred boundaries, frequent missed ...
[ 2024.08 ] Release training and evaluation. [ 2024.07 ] Our huggingface DEMO is here DEMO, welcome to try our model! [ 2024.07 ] Release codes of model! TODO: Clean up training and evaluation A ...
Previous research has investigated the application of Multimodal Large Language Models (MLLMs) in understanding 3D scenes by interpreting them as videos. These approaches generally depend on ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results