Leveraging the Urban Soundscape

Abstract – Urban environments are characterised by the presence of distinctive audio signals which alert the drivers to events that require prompt action. The detection and interpretation of these signals would be highly beneficial for smart vehicle systems, as it would provide them with complementary information to navigate safely in the environment. In this paper, we present a framework that spots the presence of acoustic events, such as horns and sirens, using a two-stage approach.

We first model the urban soundscape and use anomaly detection to identify the presence of an anomalous sound, and later determine the nature of this sound. As the audio samples are affected by copious non-stationary and unstructured noise, which can degrade classification performance, we propose a noise-removal technique to obtain a clean representation of the data we can use for classification and waveform reconstruction. The method is based on the idea of analysing the spectrograms of the incoming signals as images and applying spectrogram segmentation to isolate and extract the alerting signals from the background noise.

We evaluate our framework on four hours of urban sounds collected driving around urban Oxford on different kinds of road and in different traffic conditions. When compared to traditional feature representations, such as Mel-frequency cepstrum coefficients, our framework shows an improvement of up to 31% in the classification rate.

 

  • [PDF] L. Marchegiani and I. Posner, “Leveraging the Urban Soundscape: Auditory Perception for Smart Vehicles,” in Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Singapore, 2017.
    [Bibtex]

    @inproceedings{MarchegianiICRA2017,
    Address = {Singapore},
    Author = {Marchegiani, Letizia and Posner, Ingmar},
    Booktitle = {Proceedings of the IEEE International Conference on Robotics and Automation (ICRA)},
    Month = {June},
    Pdf = {http://www.robots.ox.ac.uk/~mobile/Papers/2017ICRA_marchegiani.pdf},
    Title = {Leveraging the Urban Soundscape: Auditory Perception for Smart Vehicles},
    Year = {2017}}