Abstract: Underwater object detection suffers from limited long-range dependency modeling, fine-grained feature representation, and noise suppression, resulting in blurred boundaries, frequent missed ...
[ 2024.08 ] Release training and evaluation. [ 2024.07 ] Our huggingface DEMO is here DEMO, welcome to try our model! [ 2024.07 ] Release codes of model! TODO: Clean up training and evaluation A ...
Previous research has investigated the application of Multimodal Large Language Models (MLLMs) in understanding 3D scenes by interpreting them as videos. These approaches generally depend on ...