musiccaps
2 rows where aspect_list contains "female voice" and aspect_list contains "latin percussion"
This data as json, CSV (advanced)
aspect_list (array) 12 ✖
- female voice · 2 ✖
- latin percussion · 2 ✖
- camera shutter sound 1
- crowd clapping 1
- dance music 1
- flute 1
- low quality recording 1
- male backing voices 1
- moderate tempo 1
- trombones 1
- trumpets 1
- upright bass 1
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
KrK8Giu9ZUc | The music excerpt starts off with a Latin band with instruments such as trombones, trumpets, upright bass, flute and latin percussion instruments. After a few seconds the band stops and a female voice starts to sing a voice that starts in the high register and moves to the low one. Right between these two moments one can hear claps coming from a crowd of people. Towards the ending of the excerpt the voice finishes singing the melody and the band starts again. | ["female voice", "crowd clapping", "trombones", "trumpets", "upright bass", "flute", "latin percussion"] | ["Humming", "Music"] | 2 | 180 | 190 | 0 | 1 | ["/m/02fxyj", "/m/04rlf"] | |
MEew7OQ17HY | This audio clip features a female voice singing the main melody. The quality of the audio recording is low. The voice is accompanied by Latin style percussion. Male voices sing backing vocals. This is a dance song at a moderate tempo. The sound of a camera shutter is played at the beginning and end of the clip. Other musical instruments are barely audible due to the low quality of audio recording. | ["low quality recording", "female voice", "latin percussion", "male backing voices", "moderate tempo", "dance music", "camera shutter sound"] | ["Music", "Single-lens reflex camera", "Inside, small room"] | 0 | 10 | 20 | 0 | 1 | ["/m/04rlf", "/m/07bjf", "/t/dd00125"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );