musiccaps
3 rows where aspect_list contains "flat male vocal" and audioset_ids contains "/m/02k_mr"
This data as json, CSV (advanced)
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
CxgVq6eovRU | The low quality recording features a drums tutorial where a flat male vocalist is talking after shimmering his hats, snappy snare hits and pumping kicks. It sounds groovy and the recording is noisy. | ["low quality", "noisy", "flat male vocal", "drums tutorial", "shimmering hi hats", "pumping kick", "snappy snare", "groovy"] | ["Drum", "Drum kit", "Drum roll", "Rimshot", "Snare drum", "Bass drum", "Percussion"] | 4 | 70 | 80 | 0 | 0 | ["/m/026t6", "/m/02hnl", "/m/02k_mr", "/m/03t3fj", "/m/06rvn", "/m/0bm02", "/m/0l14md"] | |
bl8PgmZ9iOc | The low quality recording features a flat male vocal talking, alongside some claps and snare rolls. It sounds like an interview and the recording is mono and noisy. | ["low quality", "noisy", "mono", "clapping", "flat male vocal", "snare roll", "interview"] | ["Drum", "Drum roll", "Snare drum", "Percussion"] | 4 | 30 | 40 | 0 | 0 | ["/m/026t6", "/m/02k_mr", "/m/06rvn", "/m/0l14md"] | |
jdEbwMS9xqo | The low quality recording features a live performance of a metal song that features a flat male vocal talking, repetitive cowbell percussion and manic drums. The recording is loud, messy, distorted, muffled and it sounds aggressive and uptempo. | ["low quality", "uptempo", "metal", "loud", "energetic", "messy", "distorted", "flat male vocal", "manic drums", "repetitive cowbell percussion", "aggressive", "muffled", "live performance"] | ["Cowbell", "Drum", "Drum kit", "Drum roll", "Rimshot", "Percussion"] | 4 | 0 | 10 | 0 | 0 | ["/m/0239kh", "/m/026t6", "/m/02hnl", "/m/02k_mr", "/m/03t3fj", "/m/0l14md"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );