PiggyBack: Pretrained Visual Question Answering Environment for Backing up Non-deep Learning Professionals

Zhihao Zhang, Siwen Luo, Junyi Chen, Sijia Lai, Siqu Long, Hyunsuk Chung, Soyeon Caren Han

Research output: Chapter in Book/Conference paperConference paperpeer-review

Abstract

We propose a PiggyBack, a Visual Question Answering platform that allows users to apply the state-of-the-art visual-language pretrained models easily. We integrate visual-language models, pretrained by HuggingFace, an open-source API platform of deep learning technologies; however, it cannot be runnable without programming skills or deep learning understanding. Hence, our PiggyBack supports an easy-to-use browser-based user interface with several deep-learning visual language pretrained models for general users and domain experts. The PiggyBack includes the following benefits: Portability due to web-based and thus runs on almost any platform, A comprehensive data creation and processing technique, and ease of use on visual language pretrained models. The demo video can be found at https://youtu.be/iz44RZ1lF4s.

Original languageEnglish
Title of host publicationWSDM 2023 - Proceedings of the 16th ACM International Conference on Web Search and Data Mining
PublisherAssociation for Computing Machinery (ACM)
ISBN (Electronic)9781450394079
DOIs
Publication statusPublished - 27 Feb 2023
Event16th ACM International Conference on Web Search and Data Mining, WSDM 2023 - Singapore, Singapore
Duration: 27 Feb 20233 Mar 2023

Publication series

NameWSDM - Proceedings of the ACM International Conference on Web Search and Data Mining

Conference

Conference16th ACM International Conference on Web Search and Data Mining, WSDM 2023
Country/TerritorySingapore
CitySingapore
Period27/02/233/03/23

Fingerprint

Dive into the research topics of 'PiggyBack: Pretrained Visual Question Answering Environment for Backing up Non-deep Learning Professionals'. Together they form a unique fingerprint.

Cite this