musiccaps
3 rows where aspect_list contains "male voice" and audioset_ids contains "/m/01p970"
This data as json, CSV (advanced)
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
E2v025Ilsqo | This Indian folk song features a male voice singing the main melody. This is accompanied by tablas playing the percussion. A flute and another wind instrument voice are played on a synthesizer. The synth repeats the melody which is sung by the voice. A tambourine plays on every count of each bar. This folk song can be played in a village scene in an Indian movie. | ["folk song", "indian song", "tabla", "flute", "male voice", "tambourine", "synth sounds", "moderate tempo"] | ["Tabla", "Drum", "Percussion"] | 0 | 60 | 70 | 0 | 0 | ["/m/01p970", "/m/026t6", "/m/0l14md"] | |
cG1dpyC8gV4 | This clip features two male voices in conversation. The sound of a tabla is played. There is no music in this clip. There are no other instruments in this clip. | ["male voice", "conversation between two people", "tabla sound", "no other instruments", "no vocal melody"] | ["Tabla", "Drum", "Percussion"] | 0 | 10 | 20 | 0 | 0 | ["/m/01p970", "/m/026t6", "/m/0l14md"] | |
vZ9IanI59gE | This folk song features a male voice singing the main melody. This is accompanied by a tabla and dhol playing the percussion. A keyboard plays fills after the voice pauses. The other instruments are not audible due to the low audio quality. This song can be played at a Hindu religious gathering. | ["low quality audio", "indian folk song", "tabla", "dhol", "male voice", "keyboard", "moderate tempo", "religious function"] | ["Tabla", "Drum", "Percussion"] | 0 | 550 | 560 | 0 | 0 | ["/m/01p970", "/m/026t6", "/m/0l14md"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );