musiccaps
1 row where aspect_list contains "conversation between two people" and aspect_list contains "no vocal melody"
This data as json, CSV (advanced)
aspect_list (array) 5 ✖
- conversation between two people · 1 ✖
- male voice 1
- no other instruments 1
- no vocal melody · 1 ✖
- tabla sound 1
ytid ▼ | url | caption | aspect_list | audioset_names | author_id | start_s | end_s | is_balanced_subset | is_audioset_eval | audioset_ids |
---|---|---|---|---|---|---|---|---|---|---|
cG1dpyC8gV4 | This clip features two male voices in conversation. The sound of a tabla is played. There is no music in this clip. There are no other instruments in this clip. | ["male voice", "conversation between two people", "tabla sound", "no other instruments", "no vocal melody"] | ["Tabla", "Drum", "Percussion"] | 0 | 10 | 20 | 0 | 0 | ["/m/01p970", "/m/026t6", "/m/0l14md"] |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [musiccaps] ( [ytid] TEXT PRIMARY KEY, [url] TEXT, [caption] TEXT, [aspect_list] TEXT, [audioset_names] TEXT, [author_id] TEXT, [start_s] TEXT, [end_s] TEXT, [is_balanced_subset] INTEGER, [is_audioset_eval] INTEGER, [audioset_ids] TEXT );