musiccaps
2 rows where aspect_list contains "saxophone solo melody" and is_audioset_eval = 1
This data as json, CSV (advanced)
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
ciJOulWFhfA | The low quality recording features a jazz song being played in a big room and it consists of a saxophone solo melody over groovy drums. Due to frequency clashing, it sounds muddy and muffled and it has an unbalanced stereo image, as it was probably recorded with a poor quality microphone. It is still energetic and easygoing at the same time - thanks to that saxophone. | ["unbalanced stereo", "low quality", "jazz", "groovy drums", "muffled", "reverberant", "muddy", "saxophone solo melody", "energetic", "easygoing"] | ["Carnatic music", "Music"] | 4 | 30 | 40 | 0 | 1 | ["/m/015vgc", "/m/04rlf"] | |
mQM3Fd3eN9E | The low quality recording features a song that consists of harmonized children vocals singing over bouncy snare, saxophone solo melody, groovy bass, shimmering open hats and "4 on the floor" kick pattern. There are some laughing, high pitched tire screeches and car engine sound effects in the background. The recording is very noisy and the song is quiet and thin, as it lacks bass frequencies, but it still sounds fun and happy - like something you would hear in TV shows for kids. | ["low quality", "quiet", "noisy", "fun", "happy", "harmonized children vocals", "bouncy snare", "thin", "laughing", "car engine sound effects", "static tv sound effect", "high pitched tire screech", "saxophone solo melody", "groovy bass", "shimmering open hat", "4 on the floor kick"] | ["Music", "Rattle (instrument)"] | 4 | 30 | 40 | 1 | 1 | ["/m/04rlf", "/m/05r5wn"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );