TY - GEN
T1 - Audio Translation with Conditional Generative Adversarial Networks
AU - Moussa, Ahmad
AU - Watanabe, Hiroshi
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/2
Y1 - 2020/2
N2 - This paper explores the applicability of conditional generative adversarial networks in audio-to-audio translation problems and proposes a neural network architecture capable of doing so. Recent advances have shown that causal convolutions can be effective for modeling raw audio when their kernel is dilated by many factors, in contrast to previous techniques that utilized recurrent approaches. Embedding such convolutions within a conditional GAN architecture allows the targeted generation of raw audio given a certain input. This architecture can then be used to learn and simulate certain translative operations applied to an input signal. This creates the defined problem of converting one audio signal into another, which has different characteristics. We also propose a novel discriminator structure for the evaluation of generated audio.
AB - This paper explores the applicability of conditional generative adversarial networks in audio-to-audio translation problems and proposes a neural network architecture capable of doing so. Recent advances have shown that causal convolutions can be effective for modeling raw audio when their kernel is dilated by many factors, in contrast to previous techniques that utilized recurrent approaches. Embedding such convolutions within a conditional GAN architecture allows the targeted generation of raw audio given a certain input. This architecture can then be used to learn and simulate certain translative operations applied to an input signal. This creates the defined problem of converting one audio signal into another, which has different characteristics. We also propose a novel discriminator structure for the evaluation of generated audio.
KW - audio effects
KW - causal dilated convolutions
KW - conditional generative adversarial networks
KW - signal processing
UR - http://www.scopus.com/inward/record.url?scp=85084076296&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85084076296&partnerID=8YFLogxK
U2 - 10.1109/ICAIIC48513.2020.9065067
DO - 10.1109/ICAIIC48513.2020.9065067
M3 - Conference contribution
AN - SCOPUS:85084076296
T3 - 2020 International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2020
SP - 438
EP - 442
BT - 2020 International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2020
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2020
Y2 - 19 February 2020 through 21 February 2020
ER -