Kevin Desai scite author profile

With the ever-increasing amount of data, the central challenge in multimodal learning involves limitations of labelled samples For the task of classification, techniques such as meta-learning, zero-shot learning, and few-shot learning showcase the ability to learn information about novel classes based on prior knowledge . Recent techniques try to learn a cross-modal mapping between the semantic space and the image space. However, they tend to ignore the local and global semantic knowledge. To overcome this problem, we propose a Multimodal Variational Auto-Encoder (M-VAE) which can learn the shared latent space of image features and the semantic space. In our approach we concatenate multimodal data to a single embedding before passing it to the VAE for learning the latent space. We propose the use of a multi-modal loss during the reconstruction of the feature embedding through the decoder. Our approach is capable to correlating modalities and exploit the local and global semantic knowledge for novel sample predictions. Our experimental results using a MLP classifier on four benchmark datasets show that our proposed model outperforms the current state-of-the-art approaches for generalized zero-shot learning.

show abstract

Experiences with Multi-modal Collaborative Virtual Laboratory (MMCVL)

Desai

Belmonte

Jin

et al. 2017

View full text Add to dashboard Cite

12 3 4

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Kevin Desai

Botulinum Toxin Type A for the Treatment of Postamputation Residual Limb Myokymia: A Case Report

Augmented reality-based exergames for rehabilitation

Skeleton-based continuous extrinsic calibration of multiple RGB-D kinect cameras

Using Mr. MAPP for Lower Limb Phantom Pain Management

TMVNet : Using Transformers for Multi-view Voxel-based 3D Reconstruction

Cybersickness Prediction from Integrated HMD’s Sensors: A Multimodal Deep Fusion Approach using Eye-tracking and Head-tracking Data

Generalized Zero-Shot Learning Using Multimodal Variational Auto-Encoder With Semantic Concepts

Experiences with Multi-modal Collaborative Virtual Laboratory (MMCVL)

Contact Info

Product

Resources

About