Vision-Language Models Tutorial

Revisiting Backdoor Attacks against Large Vision-Language Models from Domain Shift

Abstract: Instruction tuning enhances large vision-language models (LVLMs) but increases their vulnerability to backdoor attacks due to their open design. Unlike prior studies in static settings, this ...

The Robot Report

Vision-language-action models are the next leap in autonomous robotics

Explore how vision-language-action models like Helix, GR00T N1, and RT-1 are enabling robots to understand instructions and act autonomously.

The End of Language As We Know It? Scientists Challenge 60 Years of Linguistic Research

An international team proposes replacing Hockett’s feature checklist with a model of language as a dynamic, multimodal, and socially evolving system.

IEEE

Curiosity-Driven Zero-Shot Object Navigation With Vision-Language Models

Abstract: Zero-shot object navigation (ZSON) in unseen environments poses a significant challenge due to the absence of object-specific priors and the need for efficient exploration. Existing ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results