musiccaps
ytid | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
CN2QSmhP-HI | This salsa song features a female voice singing the main melody. This is accompanied by the congas. The beat is a dance beat. Trumpets and a saxophone play fills in between lines. A piano plays a melody at the end of the song. The song starts with the voice singing a melody at a moderate tempo. After the piano plays, the tempo of the song increases. Other instruments cannot be heard as the quality of the recording is low. This song can be played in a Latin dance sequence in a movie. | ["low quality recording", "salsa song", "congas", "saxophone", "trumpet", "piano", "female voice", "moderate tempo", "dance music", "seductive rhythm"] | ["Music", "Salsa music"] | 0 | 30 | 40 | 0 | 1 | ["/m/04rlf", "/m/0ln16"] | |
ICcASgMtIJ8 | This house music features a female voice singing the main melody. This is accompanied by programmed percussion playing a simple beat. The kick is played on every count. Hand claps are played at every alternate count. The bass plays the root notes of the chords. Synth chords are played in the background. This song can be played at a club. | ["house music", "party music", "female voice", "hand claps", "programmed percussion", "synth sounds", "bass", "moderate tempo", "dance music"] | ["Disco", "House music", "Music"] | 0 | 30 | 40 | 0 | 0 | ["/m/026z9", "/m/03mb9", "/m/04rlf"] | |
MEew7OQ17HY | This audio clip features a female voice singing the main melody. The quality of the audio recording is low. The voice is accompanied by Latin style percussion. Male voices sing backing vocals. This is a dance song at a moderate tempo. The sound of a camera shutter is played at the beginning and end of the clip. Other musical instruments are barely audible due to the low quality of audio recording. | ["low quality recording", "female voice", "latin percussion", "male backing voices", "moderate tempo", "dance music", "camera shutter sound"] | ["Music", "Single-lens reflex camera", "Inside, small room"] | 0 | 10 | 20 | 0 | 1 | ["/m/04rlf", "/m/07bjf", "/t/dd00125"] |