ytid,url,caption,aspect_list,audioset_names,author_id,start_s,end_s,is_balanced_subset,is_audioset_eval,audioset_ids 4S4NDnNTptY,https://www.youtube.com/watch?v=4S4NDnNTptY&start=30&end=40,"The low quality recording features an orchestra that contains a wide string section, aggressive triangle cymbal and woodwind section dynamically playing an addictive melody. It is very noisy, uptempo, energetic and dynamic.","[""orchestra"", ""low quality"", ""noisy"", ""energetic"", ""dynamic"", ""wide strings section"", ""aggressive triangle cymbal"", ""uptempo"", ""brass section"", ""woodwind section""]","[""Music"", ""Opera""]",4,30,40,0,1,"[""/m/04rlf"", ""/m/05lls""]" BlsbeyimUDE,https://www.youtube.com/watch?v=BlsbeyimUDE&start=30&end=40,"The low quality recording features a jazz song playing in the background while people are shoe tapping to it. The jazz song at least consists of passionate male vocal, groovy bass and brass section, since those elements are audible. It is reverberant and in mono, as it was probably recorded with a phone.","[""low quality"", ""shoe tapping"", ""jazz"", ""brass section"", ""passionate male vocal"", ""groovy bass"", ""reverberant"", ""mono""]","[""Applause"", ""Music""]",4,30,40,0,1,"[""/m/028ght"", ""/m/04rlf""]" HVsXJDR1_Lw,https://www.youtube.com/watch?v=HVsXJDR1_Lw&start=30&end=40,"The low quality recording features a musical played on a small device that reproduces mono sound. The song consists of energetic drums, brass section and passionate female vocal singing on top of it. It sounds noisy, muffled and thin, as it was probably recorded with a poor quality microphone.","[""low quality"", ""live performance"", ""muffled"", ""thin"", ""orchestral"", ""passionate female vocal"", ""mono"", ""musical"", ""energetic drums"", ""brass section"", ""noisy""]","[""Singing"", ""Gospel music"", ""Music""]",4,30,40,0,1,"[""/m/015lz1"", ""/m/016cjb"", ""/m/04rlf""]" _n9boKzVRhs,https://www.youtube.com/watch?v=_n9boKzVRhs&start=30&end=40,"The low quality recording features a mono recording located only in the right channel of the stereo image. It consists of at least brass section melody, shimmering cymbals and melodic female vocals. There are some crowd chattering noises too, but it sounds fun and happy.","[""low quality"", ""mono"", ""noisy"", ""brass section"", ""melodic female vocal"", ""happy"", ""fun"", ""crowd chattering""]","[""Yodeling"", ""Music"", ""Music of Latin America""]",4,30,40,1,1,"[""/m/01swy6"", ""/m/04rlf"", ""/m/0g293""]"