KNOWLEDGE TRANSFER FROM WEAKLY LABELED AUDIO USING CONVOLUTIONAL NEURAL NETWORK FOR SOUND EVENTS AND SCENES. pdf

Authors: Anurag Kumar, Maksim Khadkevich, Christian Fügen


Average Precision (AP)

average Precision

Figure compared AP of \(\mathcal{N}_S\) and \(\mathcal{N}_S^{slat}\). The Index of sound events 1 to 50 is shown below. The first column is Index in the above figure, second is the event id (magenta) as used in Audioset dataset and third (blue) is sound event name

1    /m/0h9mv   Tire squeal
2    /m/07rdhzs   Whack- thwack
3    /m/0jtg0   Sitar
4    /m/0dxrf   Frying (food)
5    /m/02fs_r   Beep- bleep
6    /m/01yg9g   Lawn mower
7    /t/dd00112   Crumpling- crinkling
8    /m/015y_n   Swing music
9    /m/02z32qm   Fusillade
10    /m/07qqyl4   Boom
11    /m/02rtxlg   Whispering
12    /m/015vgc   Carnatic music
13    /m/07pdjhy   Rub
14    /m/07pbtc8   Walk- footsteps
15    /m/03dnzn   Bathtub (filling or washing)
16    /m/07rgt08   Chuckle- chortle
17    /m/0btp2   Traffic noise- roadway noise
18    /m/01z5f   Canidae- dogs- wolves
19    /m/0brhx   Speech synthesizer
20    /m/02bk07   Chant
21    /m/04k94   Liquid
22    /m/06rqw   Ska
23    /m/09t49   Rustling leaves
24    /m/01p970   Tabla
25    /t/dd00037   Scary music
26    /m/07qjznt   Tick
27    /m/02p0sh1   Traditional music
28    /m/0195fx   Subway- metro- underground
29    /t/dd00033   Sad music
30    /m/0192l   Bagpipes
31    /t/dd00018   Oink
32    /t/dd00077   Mechanisms
33    /m/01j4z9   Chainsaw
34    /m/0dbvp   Goose
35    /m/07qnq_y   Thump- thud
36    /m/01glhc   Tapping (guitar technique)
37    /m/0chx_   White noise
38    /m/04brg2   Dishes- pots- and pans
39    /t/dd00136   Whimper (dog)
40    /m/07qwf61   Honk
41    /m/0395lw   Bell
42    /m/07s04w4   Snicker
43    /m/04zjc   Machine gun
44    /m/0llzx   Sewing machine
45    /m/01h3n   Bee- wasp- etc.
46    /m/07rwj3x   Whoop
47    /m/0z9c   A capella
48    /m/07swgks   Gurgling
49    /m/012n7d   Ambulance (siren)
50    /m/07rkbfh   Chatter