Proceedings of the First International Conference on Combinatorial and Optimization, ICCAP 2021, December 7-8 2021, Chennai, India

Research Article

3D - Learning Representations From Audio Using Autoencoders

Download588 downloads
  • @INPROCEEDINGS{10.4108/eai.7-12-2021.2314968,
        author={Bharathi  A and Prakash  J},
        title={3D - Learning Representations From Audio Using Autoencoders},
        proceedings={Proceedings of the First International Conference on Combinatorial and Optimization, ICCAP 2021, December 7-8 2021, Chennai, India},
        publisher={EAI},
        proceedings_a={ICCAP},
        year={2021},
        month={12},
        keywords={: audio signal auto correlogram rnn encoder rnn decoder},
        doi={10.4108/eai.7-12-2021.2314968}
    }
    
  • Bharathi A
    Prakash J
    Year: 2021
    3D - Learning Representations From Audio Using Autoencoders
    ICCAP
    EAI
    DOI: 10.4108/eai.7-12-2021.2314968
Bharathi A1,*, Prakash J1
  • 1: PSG College of technology
*Contact email: abi.cse@psgtech.ac.in

Abstract

Deep learning methods permit us to tackle signal processing challenges from a dissimilar perspective, which is currently overlooked in the composition of music in cinema industry. Audio is inherently added time-sensitive than movie. Audios are encoded using other past methods, resulting in data loss or temporal anomalies. This problem is alleviated by using an auto correlogram with a 3-dimensional view, including time, power, and frequency, to improve accuracy. First, acoustic data should be competently encoded into a compressed format using RNN autoencoder by interrelating with the information. As a result of the compressed format, audio waves should be accurately represented. After that, audio waves are rebuilt into an audio structure with little data loss. The accuracy is improved by 10% by using the RNN encoder and decoder.