about the audioset-laughter annotations #20

sonebu · 2022-09-21T18:42:13Z

Hi, thanks a lot for the contribution and the repository.

I have two questions about the audioset annotations (calling that 999-element set the "audioset-laughter" set hereon):

There are some weird annotations like start=end=0 (examples are on lines 7, 29, 80, 88, 95, 102, ... there are more). Is that a special annotation (e.g., does that mean the whole file contains a laugh etc.)? I don't understand what a zero-length laugh segment means
does "window_start" correspond to the start time instant in the youtube video for the recorded audio snippet?
"audio_length" and "window_length" seem to be equal at all times, I'm guessing that's the length (in seconds) of the recorded audio snippet I described above, is that correct?
I think this script downloads mp3 audio files for youtube videos that are specified on a csv. Some csv files can be downloaded using this script, but it seems like none of the csv files there correspond to the clips in the audioset-laughter annotations (950 of the IDs on the "unbalanced_train_segments.csv" and 38 of the IDs on the "eval_segments.csv" match with the audioset-laughter IDs, but this even is not a full list). Is there a csv that can be fed to the download script to download just the audioset-laughter audio files?

Provide feedback