musiccaps
2 rows where aspect_list contains "flat male vocal" and aspect_list contains "shimmering tambourine"
This data as json, CSV (advanced)
aspect_list (array) 17 ✖
- flat male vocal · 2 ✖
- low quality 2
- shimmering tambourine · 2 ✖
- dull 1
- emotional 1
- groovy 1
- high pitched female vocal 1
- hip hop 1
- mellow piano melody 1
- muffled 1
- noisy 1
- punchy snare 1
- smooth bass 1
- soft crash cymbal 1
- soft kick hits 1
- tutorial 1
- unbalanced stereo 1
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
RXGDlFry3Vo | The low quality recording features a hip hop track that consists of flat male vocal rapping over smooth bass, high pitched female vocal, mellow piano melody, shimmering tambourine layered with punchy snare, followed by soft kick and crash cymbal hits. It sounds groovy and emotional, even though the recording is muffled and kind of dull, due to the bad mixing. | ["hip hop", "low quality", "smooth bass", "high pitched female vocal", "mellow piano melody", "flat male vocal", "shimmering tambourine", "soft crash cymbal", "punchy snare", "soft kick hits", "groovy", "emotional", "muffled", "dull"] | ["Music", "Rapping"] | 4 | 30 | 40 | 0 | 1 | ["/m/04rlf", "/m/06bxc"] | |
mU6cfEWw5Og | The low quality recording features a flat male vocal talking, after which there is a shimmering tambourine being played. The recording is noisy and the stereo image is unbalanced, as the audio is slightly leaning towards the right channel of the stereo image. | ["low quality", "unbalanced stereo", "flat male vocal", "shimmering tambourine", "tutorial", "noisy"] | ["Music", "Musical instrument", "Tambourine", "Speech", "Percussion"] | 4 | 180 | 190 | 0 | 0 | ["/m/04rlf", "/m/04szw", "/m/07brj", "/m/09x0r", "/m/0l14md"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );