Reference the original paper: Drossos, K., Lipping, S., & Virtanen, T. (2020). "Clotho: an Audio Captioning Dataset." Proc. IEEE ICASSP, pp. 736-740 .
The dataset is hosted by the and can be accessed through platforms like Zenodo . Download 736 740 zip
You can also download specific evaluation (1.2 GB) or analysis (14.4 GB) subsets. 🛠️ Producing a Write-up Reference the original paper: Drossos, K
The request to "Download 736 740 zip" most likely refers to downloading the , a prominent audio captioning collection often cited in research papers by its specific page range, 736–740 . 🎧 The Clotho Dataset Reference the original paper: Drossos
If you are writing a technical report or paper using this data, ensure you include these standard sections: