musiccaps
2 rows where aspect_list contains "street performance" and audioset_names contains "Speech"
This data as json, CSV (advanced)
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
07mS0mSTDjY | This is an amateur recording of an Afro-cuban dance music performance. There are male vocals singing joyfully. There is a percussion orchestra made up of diverse elements such as the conga, the bongo and the timbales are playing in the rhythmic background. The atmosphere is lively and vibrant. Although the audio quality is not that great, this piece can still be sampled for use in beat-making. | ["afro cuban dance", "amateur recording", "street performance", "male vocals", "percussion", "conga", "bongo", "timbale", "lively", "vibrant"] | ["Blues", "Traditional music", "Music", "Speech", "Dance music"] | 9 | 30 | 40 | 0 | 0 | ["/m/0155w", "/m/02p0sh1", "/m/04rlf", "/m/09x0r", "/m/0ggx5q"] | |
dYVy7moyQCc | This music is instrumental. The tempo is fast with a spirited didgeridoo harmony. The music is droning, rhythmic, deep and rich with the rhythmic tapping /clapping. This is street busking with ambient sounds of people talking, bicycle bells and feet scuffling. The music is droning, hypnotic, meditative, trance and engaging. | ["instrumental", "fast tempo", "didgeridoo", "tapping", "bicycle bell sound", "woman talking", "hypnotic", "trance", "trippy", "psychedelic", "repetitive", "engaging", "captivating", "feet scuffling", "keeping time", "clapping", "street performance", "busking", "ambient sounds", "people talking", "energetic", "unique", "single note", "droning sound"] | ["Didgeridoo", "Music", "Speech"] | 7 | 30 | 40 | 0 | 0 | ["/m/02bxd", "/m/04rlf", "/m/09x0r"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );