musiccaps
2 rows where audioset_names contains "Child speech, kid speaking", audioset_names contains "Music for children" and audioset_names contains "Speech"
This data as json, CSV (advanced)
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
OiAJB9uydS8 | This music is instrumental. The tempo is medium with a piano harmony with the voice of a child speaking and a bee buzzing superimposed on the music. This clip is a tutorial on how to speak English. The music is subdued with the emphatic and loud voice of the child. | ["instrumental", "medium tempo", "kindergarten videos", "toddlers", "videos for toddlers", "montessori", "child speaking", "subdued music", "bee buzzing", "children videos", "watch and learn", "english tutorial for kids", "english tutorials"] | ["Music", "Music for children", "Buzz", "Speech", "Child speech, kid speaking"] | 7 | 100 | 110 | 0 | 0 | ["/m/04rlf", "/m/05fw6t", "/m/07pjwq1", "/m/09x0r", "/m/0ytgt"] | |
yhbpAGdv_d8 | These are sounds coming from a cartoon. Two boys are having a conversation in the Korean language. There is a spring sound effect that can be heard repeatedly. In the background, a generic orchestra piece is playing. The trumpet and the flute take turns in playing the main melody while other brass instruments and a snare drum accompanies them. | ["cartoon", "male child voices", "conversation", "korean language", "sound effects", "spring sound", "boing", "backing track", "generic", "brass", "trumpet", "flute", "snare drum"] | ["Music", "Music for children", "Speech", "Child speech, kid speaking", "Boing"] | 9 | 50 | 60 | 0 | 0 | ["/m/04rlf", "/m/05fw6t", "/m/09x0r", "/m/0ytgt", "/t/dd00121"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );