Abstract Text Tutorial

Separate, Locate, and Align: Determine Context Relation of Scene Text From Multiple Perspectives in TextVQA

Abstract: Text-based Visual Question Answering (TextVQA) focuses on answering questions about the scene text in images. Most works in this field uses transformer based models to modeling the ...

IEEE

Text-to-Image Person Re-Identification Based on Multimodal Graph Convolutional Network

Abstract: Text-to-image person re-identification (ReID) is a common subproblem in the field of person re-identification and image-text retrieval. Recent approaches generally follow the structure of a ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Separate, Locate, and Align: Determine Context Relation of Scene Text From Multiple Perspectives in TextVQA

Text-to-Image Person Re-Identification Based on Multimodal Graph Convolutional Network

Trending now