International Journal of Networked and Distributed Computing

Volume 8, Issue 2, March 2020, Pages 58 - 66

Multimedia Analysis and Fusion via Wasserstein Barycenter

Authors
Cong Jin1, *, Junhao Wang1, Jin Wei1, Lifeng Tan1, Shouxun Liu1, Wei Zhao1, Shan Liu1, Xin Lv2
1School of Information and Communication Engineering, Communication University of China, Beijing 100024, China
2School of Animation and Digital Arts, Communication University of China, Beijing 100024, China
*Corresponding author. Email: jincong0623@cuc.edu.cn
Corresponding Author
Cong Jin
Received 13 August 2019, Accepted 18 October 2019, Available Online 26 February 2020.
DOI
10.2991/ijndc.k.200217.001How to use a DOI?
Keywords
Multimedia analysis; Wasserstein Barycenter; fusion
Abstract

Optimal transport distance, otherwise known as Wasserstein distance, recently has attracted attention in music signal processing and machine learning as powerful discrepancy measures for probability distributions. In this paper, we propose an ensemble approach with Wasserstein distance to integrate various music transcription methods and combine different music classification models so as to achieve a more robust solution. The main idea is to model the ensemble as a problem of Wasserstein Barycenter, where our two experimental results show that our ensemble approach outperforms existing methods to a significant extent. Our proposal offers a new visual angle on the application of Wasserstein distance through music transcription and music classification in multimedia analysis and fusion tasks.

Copyright
© 2020 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)
View full text (HTML)

Journal
International Journal of Networked and Distributed Computing
Volume-Issue
8 - 2
Pages
58 - 66
Publication Date
2020/02/26
ISSN (Online)
2211-7946
ISSN (Print)
2211-7938
DOI
10.2991/ijndc.k.200217.001How to use a DOI?
Copyright
© 2020 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - JOUR
AU  - Cong Jin
AU  - Junhao Wang
AU  - Jin Wei
AU  - Lifeng Tan
AU  - Shouxun Liu
AU  - Wei Zhao
AU  - Shan Liu
AU  - Xin Lv
PY  - 2020
DA  - 2020/02/26
TI  - Multimedia Analysis and Fusion via Wasserstein Barycenter
JO  - International Journal of Networked and Distributed Computing
SP  - 58
EP  - 66
VL  - 8
IS  - 2
SN  - 2211-7946
UR  - https://doi.org/10.2991/ijndc.k.200217.001
DO  - 10.2991/ijndc.k.200217.001
ID  - Jin2020
ER  -