In this video, I show how to get audio data ready for deep learning applications using Python and an audio analysis library called Librosa. Starting from an audio file, I perform the Fourier Transform to extract the power spectrum and the spectrogram. I also show how to extract MFCCs and visualise all features.
Code:
https://github.com/musikalkemist/DeepLearningForAudioWithPython/tree/master/11-%20Preprocessing%20audio%20data%20for%20deep%20learning/code
Interested in hiring me as a consultant/freelancer?
https://valeriovelardo.com/
Join The Sound Of AI Slack community:
https://valeriovelardo.com/the-sound-of-ai-community/
Follow Valerio on Facebook:
https://www.facebook.com/TheSoundOfAI
Valerio's Linkedin:
https://www.linkedin.com/in/valeriovelardo/
Valerio's Twitter:
https://twitter.com/musikalkemist