Skip to content
Week of 7/26/21 – 8/1/21 Direction of arrival of Sound: Was able to find this. References A New Moving Sound Source Localization Method Based on the Time Difference of Arrival (2010) – Read , provides a method to fins the distance of the sound source Time difference of arrival estimation of sound source using cross correlation and modified maximum likelihood weighting function (2017) -Read, provides a new weighing function for the Generalized Cross Correlation (GCC) that combines ML and PHAT-pi weighting functions. Direction of arrival estimation – A two microphones approach (2010)GCC-PHAT CROSS-CORRELATION AUDIO FEATURES FOR SIMULTANEOUS SOUND EVENT LOCALIZATION AND DETECTION (SELD) ON MULTIPLE ROOMS (2019)A Comparison of Generalized Cross-Correlation Methods for Time Delay Estimation (2017)A moving sound source localization method based on TDOA (2014)A TALKER TRACKING METHOD USING TWO MICROPHONES BASED ON THE SOUND SOURCE LOCALIZATION (2005)Localization of multiple acoustic sources with small arrays using a coherence test (2008) Week of 7/19/21 – 7/25/21 Surround sound /Stereo/Mono: Surround sound has more than 2 speakers and a subwoofer. Stereo sound has 2 speakers and does not always have a subwoofer.Mono audio files = 1 channel Stereo audio files = 2 channels Surround audio files = more than 2 channels Surround 2.1 uses two stereo speakers and one subwoofer Convolution vs Cross Correlation: Convolution is a measurement of effect of one signal on the other signal. Cross correlation is the measurement of similarity between the two signals. Convolution between the two continuous time signals (this convolution is also called the convolution integral). Note: The lower and upper limits can be set to 0 for the causal signals. For the discrete time signals (the convolution is called the convolution sum). Note, the lower limit can be set to 0 for the causal systems. To find this numerically, there are few methods.Graphical Procedure Sliding Tape Method CorrelationFor the continuous time signals For the discrete time signals References Surround sound2.1 vs. 5.1 vs. 7.1 Surround Sound The Difference Between Mono and Stereo with Audio Example – the best reading material in this spaceWhat is a Stereo Microphone? How is Surround Sound Different Than Stereo? Basic Differences Among Monophonic, Stereophonic and Surround Sound Monophonic, Stereophonic and Surround Sound Differences Mono vs. Stereo Sound: The Difference Explained (With Audio Examples) Time Difference of Arrival (TDOA) estimationThe generalized correlation method for estimation of time delay – (1976), Read- provides a Maximum Likelihood weighing function for Generalized Cross Correlation Convolution and Correlationhttps://www.youtube.com/watch?v=O9-HN-yzsFQ&t=0s – Visualization of convolution and correlation How to Measure a Time Delay Using Cross Correlation? – video https://www.youtube.com/watch?v=oCcUm0_rUJw – Determining signal similaritiesConvolution and Correlation explained (Math) – A comprehensive explanationTeaching the concept of convolution and correlation using Fourier transform Week of 7/12/21 – 7/18/21 Refactored the code by integrating Misty with ML (sound classification) on Ubuntu 20.04 Worked on audio categories – audioset dataset Direction of arrival of sound papers. See references below Misty architecture – From reference 2.1 below References Sound LocalizationSimulation of Human Ear Recognition Sound Direction Based on Convolutional Neural Network Sound Source Direction Estimation in Horizontal Plane Using Microphone Array (2013) – Read, provides a new algorithm for sound source localization using 4 microphones, uses ML estimator for TDOA estimation Localization of sound sources in robotics: A review – (2017) Read, a very comprehensive review paperLearning Sound Location from a Single Microphone (2009) – Read, provides a model for sound localization using a special designed microphone Localization of Sound Sources: A Systematic Review (2021) – Read, provides a summary of different techniques for sound localization Microphone Array | Beamforming | Clean Voice – Read, not very useful for my taskSound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection Spectral Cues in Human Sound Localization Misty IIhttps://www.mistyrobotics.com/blog/what-can-the-misty-ii-platform-do/ – misty platform architectureMisty Specification https://docs.mistyrobotics.com/misty-ii/robot/misty-ii/#connecting-to-adb – connecting to the android – Misty IIhttps://developer.android.com/studio/command-line/adb – What is Android Debug Bridgehttps://docs.mistyrobotics.com/misty-ii/robot/misty-ii/#connecting-to-misty-39-s-file-system – Connecting to Misty’s filesystem ROS on Android/Misty IIros2-android-controller – An android app to control ROS2 robothttp://wiki.ros.org/android/Tutorials/kinetic – Android Tutorials KinectROS_2_ANDROID – Receiving and Publishing data from Android using ROS2ROS Mobile – ROS-Mobile is an Android application designed for dynamic control and visualization of mobile robotic system operated by the Robot Operating System (ROS ). The application uses ROS nodes initializing publisher and subscriber with standard ROS messages.Enabling ROS on our Qualcomm Snapdragon based Products ROS Support for Qualcomm® Snapdragon™ – Bringing ARM into Robotics Week of 7/5/21 – 7/11/21 PANNs inference: PANNs inference works on audio from Misty. Note: audio channel must be Mono for PANNs inference to workIt works on Ubuntu but not on Windows 10 Studied: Signals – Sampling from BP Lathi book, made videos on Sampling and A/D conversion Move to Sound audio works on Misty References https://github.com/iver56/audiomentations – audiomentations —- Python library for audio data augmentationhttps://github.com/facebookresearch/AugLy/tree/main/augly/audio – augly —– Python library for audio data augmentationhttps://github.com/keunwoochoi/kapre – kapre — Python library for audio data augmentation and much morehttps://www.youtube.com/watch?v=RMfeYitdO-c – Audio Classification using Tensorflow using caprehttps://github.com/seth814/Audio-Classification – Audio Classification using Tensorflow using capreListening for Event Messages with Simple WebSocket Client — A method to listen to event messages from Misty using a Websocket clientExpected round-trip times in REST API? Misty Coordinate System Python tips Classes in Python