musiccaps
3 rows where aspect_list contains "mono", aspect_list contains "uptempo" and is_audioset_eval = 0
This data as json, CSV (advanced)
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
3Yc7_n6mDsI | The low quality recording features a drum solo that consists of punchy snare hits and occasional snare roll, punchy kick and shimmering hi-hats. It sounds energetic and the recording is in mono and noisy. | ["low quality", "noisy", "mono", "uptempo", "drums solo", "punchy snare", "shimmering hi hats", "punchy kick", "snare roll", "energetic"] | ["Drum", "Drum kit", "Drum roll", "Rimshot", "Snare drum", "Bass drum", "Percussion"] | 4 | 40 | 50 | 0 | 0 | ["/m/026t6", "/m/02hnl", "/m/02k_mr", "/m/03t3fj", "/m/06rvn", "/m/0bm02", "/m/0l14md"] | |
8LYWfpPUokc | The low quality recording features a snare roll played over metronome beep. It sounds energetic and exciting, even though the recording is in mono and a bit noisy. | ["low quality", "snare roll", "metronome beep", "uptempo", "energetic", "exciting", "noisy", "mono"] | ["Drum", "Drum roll", "Percussion"] | 4 | 330 | 340 | 0 | 0 | ["/m/026t6", "/m/02k_mr", "/m/0l14md"] | |
PduP4CpaDtY | The low quality recording features a children's song played on some small device that reproduces mono sound. The song consists of a funny male vocal, alongside harmonizing vocals, singing over a shimmering bell melody and some claps. It sounds happy, fun, joyful, muffled and the recording is noisy. | ["children song", "low quality", "uptempo", "mono", "noisy", "funny male vocal", "harmonizing vocals", "shimmering bells melody", "claps", "happy", "fun", "joyful", "muffled"] | ["Christmas music", "Christian music", "Music", "Synthetic singing"] | 4 | 0 | 10 | 0 | 0 | ["/m/0140xf", "/m/02mscn", "/m/04rlf", "/t/dd00006"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );