musiccaps
2 rows where aspect_list contains "folk", aspect_list contains "groovy bass", aspect_list contains "live performance" and aspect_list contains "noisy"
This data as json, CSV (advanced)
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
fiAcNMpd2vM | The low quality recording features a live performance of a folk song that consists of a passionate male vocal singing over groovy bass, woodwind melody, punchy snare and wooden percussion. It sounds joyful, energetic and the recording is noisy. | ["low quality", "folk", "passionate male vocal", "groovy bass", "woodwind melody", "punchy snare", "energetic", "wooden percussions", "live performance", "joyful", "noisy"] | ["Drum", "Drum kit", "Music", "Musical instrument", "Wedding music", "Music of Latin America"] | 4 | 270 | 280 | 0 | 0 | ["/m/026t6", "/m/02hnl", "/m/04rlf", "/m/04szw", "/m/04wptg", "/m/0g293"] | |
gsIB8HjsRtw | The low quality recording features a live performance of a folk song that contains an accordion melody playing over acoustic rhythm guitar, groovy bass, punchy kick and snare hits, shimmering cymbals, saxophone and trumpet melody. It sounds passionate and easygoing, even though the recording is noisy and the stereo image is unbalanced, due to the fact that the sound is leaning towards the left channel. | ["low quality", "live performance", "folk", "accordion melody", "acoustic rhythm guitar", "shimmering cymbals", "punchy kick", "punchy snare", "saxophone melody", "trumpet melody", "groovy bass", "noisy", "unbalanced stereo", "passionate", "easygoing"] | ["Swing music", "Music", "Musical instrument", "Accordion"] | 4 | 100 | 110 | 0 | 0 | ["/m/015y_n", "/m/04rlf", "/m/04szw", "/m/0mkg"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );