musiccaps
ytid | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
9nVpyqfyBSE | The low quality recording features an excited crowd cheering for a flat female vocalist speaking afterwards. The recording is a bit noisy and in mono. | ["low quality", "crowd cheering", "beeping sound", "flat female vocal", "noisy", "mono", "exciting"] | ["Drum", "Drum roll", "Rimshot", "Snare drum", "Percussion"] | 4 | 0 | 10 | 0 | 0 | ["/m/026t6", "/m/02k_mr", "/m/03t3fj", "/m/06rvn", "/m/0l14md"] | |
H4rdJlSSt5Y | The low quality recording features a covered song with arpeggiated banjo melody and flat female vocal that is occasionally out of tune. The recording is noisy and in mono, as if it was recorded with a phone, but it is also emotional, regardless of the voice crack. | ["low quality", "arpeggiated banjo melody", "noisy", "mono", "flat female vocal", "emotional", "voice crack"] | ["Singing", "Banjo", "Guitar", "Music", "Mandolin", "Musical instrument", "Plucked string instrument"] | 4 | 30 | 40 | 0 | 1 | ["/m/015lz1", "/m/018j2", "/m/0342h", "/m/04rlf", "/m/04rzd", "/m/04szw", "/m/0fx80y"] | |
LAxCq-s84F8 | The low quality recording features a flat female vocal shortly talking on a very noisy microphone, after which a song fades in. The song consists of soft, reverberant female vocal singing over echoing dark rimshots, punchy kick and groovy piano melody. Overall it has a dark and mellow vibe to it. | ["low quality", "fade in", "soft reverberant female vocal", "noisy", "flat female vocal", "echoing dark rimshot", "punchy kick", "groovy piano melody", "dark", "mellow"] | ["New-age music"] | 4 | 380 | 390 | 0 | 0 | ["/m/02v2lh"] | |
WZd2nT2Afds | The low quality recording features a flat female vocal talking over loud bells, reverberant male vocal talking, water leaking sounds and choir singing. It sounds like a compilation of sounds. The recording is noisy and in mono. | ["choir", "flat female vocal", "reverberant male vocal", "loud bells", "water leaking sound", "noisy", "mono", "low quality", "choir singing", "compilation"] | ["Bell", "Speech", "Inside, large room or hall"] | 4 | 30 | 40 | 0 | 0 | ["/m/0395lw", "/m/09x0r", "/t/dd00126"] | |
Wp5AiCmbQjA | The low quality recording features an acapella cover of a R&B song that consists of flat female vocal singing over hand tapping sounds and snaps. The recording is mono and noisy, as it was probably recorded with a phone. | ["low quality", "mono", "noisy", "flat female vocal", "hand tapping sounds", "snaps", "r&b", "cover", "acapella"] | ["Finger snapping", "Vocal music"] | 4 | 70 | 80 | 0 | 0 | ["/m/025_jnm", "/m/0y4f8"] | |
X28GWrn9LlI | The low quality recording features a live performance that consists of a flat female vocal singing over electric rhythm guitar melody. It sounds passionate, reverberant and emotional. The recording is noisy and in mono, as it was probably recorded with a phone. | ["low quality", "live performance", "flat female vocal", "electric rhythm guitar melody", "passionate", "reverberant", "emotional", "noisy", "mono"] | ["Singing", "Music", "Vocal music"] | 4 | 110 | 120 | 0 | 0 | ["/m/015lz1", "/m/04rlf", "/m/0y4f8"] | |
YLlbLSNxdQ4 | The low quality recording features a children's song that consists of a clock ticking, church bells melody, plucked strings melody and flat female vocal singing over it. It sounds happy, joyful and fun. The recording is mono and noisy. | ["low quality", "clock ticking", "church bells melody", "flat female vocal", "plucked strings melody", "happy", "joyful", "fun", "children song", "noisy", "mono"] | ["Music", "Music for children", "Tick", "Glockenspiel"] | 4 | 200 | 210 | 0 | 0 | ["/m/04rlf", "/m/05fw6t", "/m/07qjznt", "/m/0dwtp"] | |
qcjzfHmQvxg | The low quality recording features a flat female vocal talking over breathy flute melody. As soon as the female starts talking, the flute melody lowers in volume. The recording is noisy and in mono. | ["breathy flute", "flat female vocal", "low quality", "noisy", "mono"] | ["Wind instrument, woodwind instrument", "Flute"] | 4 | 20 | 30 | 0 | 0 | ["/m/085jw", "/m/0l14j_"] | |
xWmcax3aX5U | The low quality recording features a boomy wooden percussion one shot, after which a flat female vocal is talking. It sounds like some sort of tutorial and the recording is noisy and in mono. | ["low quality", "boomy wooden percussion one shots", "noisy", "mono", "flat female vocal", "tutorial"] | ["Tabla", "Percussion"] | 4 | 80 | 90 | 0 | 0 | ["/m/01p970", "/m/0l14md"] |