musiccaps
2 rows where aspect_list contains "live performance" and aspect_list contains "wide"
This data as json, CSV (advanced)
aspect_list (array) 15 ✖
- live performance · 2 ✖
- low quality 2
- noisy 2
- wide · 2 ✖
- crowd clapping 1
- folk 1
- groovy double bass 1
- groovy keys chords 1
- harmonica melodies 1
- jazz 1
- passionate female vocal 1
- reverberant 1
- shimmering hi hats 1
- shoe tapping 1
- smooth bass 1
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
3Nwsd439zmU | The low quality recording features a live performance of a folk song and it consists of harmonica melodies played over groovy double bass and shimmering hi-hats. The recording is noisy and fairly wide. | ["low quality", "live performance", "folk", "harmonica melodies", "groovy double bass", "shimmering hi hats", "noisy", "wide"] | ["Harmonica", "Wind instrument, woodwind instrument"] | 4 | 260 | 270 | 0 | 0 | ["/m/03qjg", "/m/085jw"] | |
hFqZZrj0rnM | The low quality recording features a live performance of shoe tapping over a jazz song that consists of passionate female vocal singing over smooth bass and groovy key chords. There are crowd clapping sounds in the background. The actual sounds of shoe tappings is widely spread in the stereo image and it is reverberant, as it was probably performed in a huge space. | ["live performance", "crowd clapping", "noisy", "shoe tapping", "jazz", "passionate female vocal", "smooth bass", "groovy keys chords", "low quality", "wide", "reverberant"] | ["Music", "Tap"] | 4 | 30 | 40 | 1 | 1 | ["/m/04rlf", "/m/07qcpgn"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );