musiccaps
2 rows where aspect_list contains "folk", aspect_list contains "groovy", aspect_list contains "live performance" and aspect_list contains "punchy snare"
This data as json, CSV (advanced)
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
ChqJYrmQIN4 | The low quality recording features a live performance of a folk song that consists of an accordion melody playing over groovy bass, shimmering hi hats, punchy snare and electric guitar melody. There are crowd chattering noises in the background. It sounds passionate and groovy. | ["low quality", "live performance", "crowd chattering", "folk", "groovy bass", "accordion melody", "electric guitar melody", "shimmering hi hats", "punchy snare", "passionate", "groovy"] | ["Singing", "Traditional music", "Music", "Musical instrument", "Orchestra", "Speech", "Music of Latin America", "Accordion"] | 4 | 50 | 60 | 0 | 0 | ["/m/015lz1", "/m/02p0sh1", "/m/04rlf", "/m/04szw", "/m/05pd6", "/m/09x0r", "/m/0g293", "/m/0mkg"] | |
_78P-0zWJtg | The low quality recording features a live performance of fruity male vocal singing over funky piano melody and beat played on playback that consists of punchy snare and kick hits, shimmering hi hats and smooth bass. The crowd is also singing along, harmonizing with the lead vocal. It sounds emotional and groovy, even though the recording is noisy. | ["low quality", "folk", "fruity male vocal", "funky piano melody", "playback beat", "shimmering hi hats", "punchy snare", "punchy kick", "smooth bass", "noisy", "live performance", "harmonizing crowd vocals", "emotional", "groovy"] | ["Singing", "Gospel music", "Music", "Vocal music"] | 4 | 420 | 430 | 1 | 1 | ["/m/015lz1", "/m/016cjb", "/m/04rlf", "/m/0y4f8"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );