musiccaps
3 rows where aspect_list contains "male voice" and audioset_ids contains "/m/025_jnm"
This data as json, CSV (advanced)
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
63rqIYPHvlc | Someone is beatboxing while doing a bassline with his voice. Finger snipping can be heard supporting the beat. This song may be playing at a talent show. | ["beatbox", "finger snipping", "male voice", "amateur recording", "medium tempo"] | ["Finger snapping", "Beatboxing", "Music"] | 6 | 30 | 40 | 0 | 1 | ["/m/025_jnm", "/m/02cz_7", "/m/04rlf"] | |
II1oyaWPiD0 | This is the recording of a trombone lesson. The male instructor is playing a note on the trombone and then speaking in an instructive manner in the Japanese language. The click of the metronome can be heard in the background. This recording can be sampled for use in beat-making. | ["trombone lesson", "male voice", "japanese language", "instructive speaking", "metronome click"] | ["Brass instrument", "Finger snapping", "Music", "Musical instrument", "Trumpet", "Speech"] | 9 | 160 | 170 | 0 | 0 | ["/m/01kcd", "/m/025_jnm", "/m/04rlf", "/m/04szw", "/m/07gql", "/m/09x0r"] | |
Ob9iaGon5ak | The excerpt features a song sounding from a speaker and being recorded with an amateur device like a phone. After a finger snap, the same song can be heard recorded in similar conditions but lower in volume. | ["male voice", "low quality recording", "finger snap", "different recordings of the same song"] | ["Finger snapping", "Music", "Pop music"] | 2 | 60 | 70 | 0 | 1 | ["/m/025_jnm", "/m/04rlf", "/m/064t9"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );