musiccaps
2 rows where aspect_list contains "live performance", aspect_list contains "mellow piano chords", aspect_list contains "passionate" and aspect_list contains "punchy kick"
This data as json, CSV (advanced)
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
He63KV_9Pwg | The low quality recording features harmonizing vocals singing over punchy kick and snare hits, shimmering hi hats, synth pads, simple bass and mellow piano chords. There are crowd cheering noises in the background as this is a live performance. It sounds passionate, emotional, exciting and the recording is noisy, in mono and slightly distorted. | ["live performance", "low quality", "noisy", "mono", "crowd cheering", "distorted", "harmonizing vocals", "punchy snare", "punchy kick", "shimmering hi hats", "synth pads", "simple bass", "mellow piano chords", "passionate", "emotional", "exciting"] | ["Singing", "Music", "Pop music", "Whoop"] | 4 | 60 | 70 | 0 | 0 | ["/m/015lz1", "/m/04rlf", "/m/064t9", "/m/07rwj3x"] | |
tuoBmxts5Xo | The low quality recording features a live performance of a jazz song that consists of a tenor sax solo melody playing over mellow piano chords, sustained strings, groovy bass, snappy rimshots, shimmering hi hats and punchy kicks. It sounds emotional, passionate and soulful. | ["low quality", "live performance", "jazz", "tenor sax solo melody", "groovy bass", "mellow piano chords", "shimmering hi hats", "punchy kick", "snappy rimshots", "sustained strings", "passionate", "emotional", "soulful"] | ["Brass instrument", "Jazz", "Music", "Musical instrument", "Wedding music", "Saxophone"] | 4 | 80 | 90 | 0 | 0 | ["/m/01kcd", "/m/03_d0", "/m/04rlf", "/m/04szw", "/m/04wptg", "/m/06ncr"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );