musiccaps
2 rows where aspect_list contains "distorted", aspect_list contains "emotional", aspect_list contains "live performance" and aspect_list contains "punchy snare"
This data as json, CSV (advanced)
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
He63KV_9Pwg | The low quality recording features harmonizing vocals singing over punchy kick and snare hits, shimmering hi hats, synth pads, simple bass and mellow piano chords. There are crowd cheering noises in the background as this is a live performance. It sounds passionate, emotional, exciting and the recording is noisy, in mono and slightly distorted. | ["live performance", "low quality", "noisy", "mono", "crowd cheering", "distorted", "harmonizing vocals", "punchy snare", "punchy kick", "shimmering hi hats", "synth pads", "simple bass", "mellow piano chords", "passionate", "emotional", "exciting"] | ["Singing", "Music", "Pop music", "Whoop"] | 4 | 60 | 70 | 0 | 0 | ["/m/015lz1", "/m/04rlf", "/m/064t9", "/m/07rwj3x"] | |
sTcaIARuemA | The low quality recording features a live performance of a song that is barely audible due to distorted sound, loud cheering crowd and female screams. There is definitely at least a boomy bass, punchy kick and snare hits and energetic cymbals, alongside some vocals singing over it. It sounds messy and muddy, but also emotional and energetic. | ["low quality", "crowd cheering", "female screams", "live performance", "punchy snare", "boomy bass", "energetic cymbals", "muddy", "messy", "loud", "distorted", "punchy kick", "vocals", "emotional", "energetic"] | ["Singing", "Music", "Yell"] | 4 | 150 | 160 | 1 | 1 | ["/m/015lz1", "/m/04rlf", "/m/07sr1lc"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );