SwePub
Sök i LIBRIS databas

  Utökad sökning

onr:"swepub:oai:DiVA.org:bth-15478"
 

Sökning: onr:"swepub:oai:DiVA.org:bth-15478" > Classifying environ...

Classifying environmental sounds using image recognition networks

Boddapati, Venkatesh (författare)
Blekinge Tekniska Högskola,Institutionen för datalogi och datorsystemteknik
Petef, Andrej (författare)
Sony Mobile Communications AB, SWE
Rasmusson, Jim (författare)
Sony Mobile Communications AB, SWE
visa fler...
Lundberg, Lars (författare)
Blekinge Tekniska Högskola,Institutionen för datalogi och datorsystemteknik
visa färre...
 (creator_code:org_t)
Elsevier B.V. 2017
2017
Engelska.
Ingår i: Procedia Computer Science. - : Elsevier B.V.. - 1877-0509. ; , s. 2048-2056
  • Konferensbidrag (refereegranskat)
Abstract Ämnesord
Stäng  
  • Automatic classification of environmental sounds, such as dog barking and glass breaking, is becoming increasingly interesting, especially for mobile devices. Most mobile devices contain both cameras and microphones, and companies that develop mobile devices would like to provide functionality for classifying both videos/images and sounds. In order to reduce the development costs one would like to use the same technology for both of these classification tasks. One way of achieving this is to represent environmental sounds as images, and use an image classification neural network when classifying images as well as sounds. In this paper we consider the classification accuracy for different image representations (Spectrogram, MFCC, and CRP) of environmental sounds. We evaluate the accuracy for environmental sounds in three publicly available datasets, using two well-known convolutional deep neural networks for image recognition (AlexNet and GoogLeNet). Our experiments show that we obtain good classification accuracy for the three datasets. © 2017 The Author(s).

Ämnesord

NATURVETENSKAP  -- Data- och informationsvetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences (hsv//eng)

Nyckelord

Convolutional Neural Networks
Deep Learning
Environmental Sound Classification
GPU Processing
Image Classification
Classification (of information)
Convolution
Deep neural networks
Image recognition
Knowledge based systems
Neural networks
Automatic classification
Classification accuracy
Classification tasks
Convolutional neural network
Environmental sound classifications
Environmental sounds
Image representations

Publikations- och innehållstyp

ref (ämneskategori)
kon (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy