KNOWLEDGE TRANSFER FROM WEAKLY LABELED AUDIO USING CONVOLUTIONAL NEURAL NETWORK FOR SOUND EVENTS AND SCENES. pdf
Authors: Anurag Kumar, Maksim Khadkevich, Christian Fügen
Average Precision (AP)
Figure compared AP of \(\mathcal{N}_S\) and \(\mathcal{N}_S^{slat}\). The Index of sound events 1 to 50 is shown below. The first column is Index in the above figure, second is the event id (magenta) as used in Audioset dataset and third (blue) is sound event name
1 /m/0h9mv Tire squeal
2 /m/07rdhzs Whack- thwack
3 /m/0jtg0 Sitar
4 /m/0dxrf Frying (food)
5 /m/02fs_r Beep- bleep
6 /m/01yg9g Lawn mower
7 /t/dd00112 Crumpling- crinkling
8 /m/015y_n Swing music
9 /m/02z32qm Fusillade
10 /m/07qqyl4 Boom
11 /m/02rtxlg Whispering
12 /m/015vgc Carnatic music
13 /m/07pdjhy Rub
14 /m/07pbtc8 Walk- footsteps
15 /m/03dnzn Bathtub (filling or washing)
16 /m/07rgt08 Chuckle- chortle
17 /m/0btp2 Traffic noise- roadway noise
18 /m/01z5f Canidae- dogs- wolves
19 /m/0brhx Speech synthesizer
20 /m/02bk07 Chant
21 /m/04k94 Liquid
22 /m/06rqw Ska
23 /m/09t49 Rustling leaves
24 /m/01p970 Tabla
25 /t/dd00037 Scary music
26 /m/07qjznt Tick
27 /m/02p0sh1 Traditional music
28 /m/0195fx Subway- metro- underground
29 /t/dd00033 Sad music
30 /m/0192l Bagpipes
31 /t/dd00018 Oink
32 /t/dd00077 Mechanisms
33 /m/01j4z9 Chainsaw
34 /m/0dbvp Goose
35 /m/07qnq_y Thump- thud
36 /m/01glhc Tapping (guitar technique)
37 /m/0chx_ White noise
38 /m/04brg2 Dishes- pots- and pans
39 /t/dd00136 Whimper (dog)
40 /m/07qwf61 Honk
41 /m/0395lw Bell
42 /m/07s04w4 Snicker
43 /m/04zjc Machine gun
44 /m/0llzx Sewing machine
45 /m/01h3n Bee- wasp- etc.
46 /m/07rwj3x Whoop
47 /m/0z9c A capella
48 /m/07swgks Gurgling
49 /m/012n7d Ambulance (siren)
50 /m/07rkbfh Chatter