musiccaps
3 rows where aspect_list contains "folk", aspect_list contains "noisy" and aspect_list contains "soulful"
This data as json, CSV (advanced)
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
8kx5ST65Fog | The low quality recording features a live performance of a folk song and it consists of a metallic percussive melody, wooden percussions, acoustic guitar melody and flute melody. It sounds soulful, upbeat and passionate and the recording is noisy. | ["low quality", "folk", "noisy", "live performance", "metallic percussive melody", "wooden percussions", "acoustic guitar melody", "flute melody", "soulful", "upbeat", "passionate"] | ["Orchestra"] | 4 | 90 | 100 | 0 | 0 | ["/m/05pd6"] | |
exD5okdopWc | The low quality recording features a folk song that consists of passionate male vocal singing over shimmering shakers, smooth bass, sustained strings melody and groovy piano melody. It sounds emotional, soulful and passionate - like something you would hear in movies. | ["low quality", "folk", "noisy", "mono", "passionate male vocal", "sustained strings melody", "shimmering shakers", "smooth bass", "groovy piano melody", "emotional", "passionate", "soulful"] | ["Music of Asia", "Music"] | 4 | 60 | 70 | 0 | 0 | ["/m/028sqc", "/m/04rlf"] | |
vpcEBryyej4 | The low quality recording features a live performance of a folk song that consists of a saxophone solo melody, accordion melody and acoustic rhythm guitar. It sounds passionate and soulful. The recording is mono and noisy. | ["low quality", "live performance", "folk", "noisy", "mono", "saxophone solo melody", "accordion melody", "acoustic rhythm guitar", "passionate", "soulful"] | ["Accordion"] | 4 | 70 | 80 | 0 | 0 | ["/m/0mkg"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );