KNOWLEDGE TRANSFER FROM WEAKLY LABELED AUDIO USING CONVOLUTIONAL NEURAL NETWORK FOR SOUND EVENTS AND SCENES. pdf

Authors: Anurag Kumar, Maksim Khadkevich, Christian Fügen


Area Under ROC Curves (AUC)

average Precision

Figure compared AP of \(\mathcal{N}_S\) and \(\mathcal{N}_S^{slat}\). The Index of sound events 1 to 50 is shown below. The first column is Index in the above figure, second is the event id (magenta) as used in Audioset dataset and third (blue) is sound event name

1    /m/05x_td   Air horn- truck horn
2    /m/04wptg   Wedding music
3    /t/dd00013   Children playing
4    /m/07ptfmf   Stir
5    /m/01xq0k1   Cattle- bovinae
6    /m/0fx9l   Microwave oven
7    /m/06hck5   Steam whistle
8    /m/02x984l   Mechanical fan
9    /m/03cczk   Chewing- mastication
10    /m/0lyf6   Breathing
11    /m/07qf0zm   Howl
12    /m/07sq110   Belly laugh
13    /m/081rb   Writing
14    /m/039jq   Glass
15    /m/07rjzl8   Slam
16    /m/04s8yn   Crow
17    /m/07szfh9   Cacophony
18    /m/09xqv   Cricket
19    /m/0261r1   Babbling
20    /t/dd00001   Baby laughter
21    /m/06q74   Ship
22    /m/02g901   Electric shaver- electric razor
23    /m/01b9nn   Reverberation
24    /m/07ryjzk   Slap- smack
25    /m/01b_21   Cough
26    /m/023pjk   Cutlery- silverware
27    /m/0gvgw0   Air brake
28    /m/0l14t7   Singing bowl
29    /m/07qyrcz   Plop
30    /m/07svc2k   Gobble
31    /m/0f8s22   Chime
32    /m/0c3f7m   Fire alarm
33    /m/02p3nc   Hiccup
34    /m/07rrlb6   Splash- splatter
35    /m/04cvmfc   Roar
36    /m/030rvx   Buzzer
37    /m/0_1c   Artillery fire
38    /m/068zj   Pig
39    /m/0l156b   Steelpan
40    /m/07r660_   Giggle
41    /m/0cj0r   Pink noise
42    /m/02l6bg   Propeller- airscrew
43    /m/02y_763   Sliding door
44    /m/01jg02   Heart sounds- heartbeat
45    /m/07qn4z3   Rattle
46    /m/0dl83   Arrow
47    /t/dd00036   Angry music
48    /t/dd00130   Engine starting
49    /m/01hnzm   Ringtone
50    /m/01rd7k   Turkey