Abstract: In this article, we introduce a new benchmark dataset for the challenging writing in the air (WiTA) task—an elaborate task bridging vision and natural language processing (NLP). WiTA ...