Stel je voor dat je een foto van een negentiende-eeuws marktplein bekijkt. Je ziet marktkramers, paarden met koetsen en een historische tram. Hoe zou het daar geklonken hebben? Het geklop van hoeven, de tram die davert over de sporen, het geroezemoes van stemmen… Een dergelijke scène bestaande uit verschillende geluiden, wordt ook wel een soundscape genoemd. Historici, erfgoedkundigen en akoestische ingenieurs slaan steeds vaker de handen in elkaar om historische soundscapes opnieuw tot leven te wekken, om zo het verleden beter te kunnen beleven en begrijpen. Daar gebruiken ze verschillende bronnen voor, zoals reisdagboeken en historische foto’s. Een probleem waar ze tegen opstoten, is het feit dat het verzamelen en analyseren van deze bronnen heel veel tijd kost. Daarom ben ik in mijn thesis op zoek gegaan hoe artificiële intelligentie ons kan helpen met het automatisch terugvinden van geluidsbronnen in historische foto’s.
Hoe kan een computer “zien” wat geluid maakt?
De eerste stap in het betrouwbaar reconstrueren van een historische soundscape, is het verzamelen van voldoende bronmateriaal, bijvoorbeeld historische foto’s. Daarna moet je weten welke dingen in een foto geluid maken. Mensen zien dat meteen. Wij leggen onmiddellijk de link tussen een tram en het bijhorende geluid. Maar wat als je duizenden foto’s hebt die je moet doornemen? Dat neemt behoorlijk wat tijd in beslag. Enter artificiële intelligentie.
Een computer kan ons bij deze taak helpen via objectdetectie. Dat is een tak binnen AI waarbij een computer leert om objecten in foto’s of video’s te herkennen. Je traint een model door het zelf enkele voorbeelden te geven, waarbij je een kader tekent rond de objecten die het model later zelf moet herkennen. Objectdetectie wordt al veel gebruikt in verschillende toepassingen, denk maar aan zelfrijdende auto’s die voetgangers, verkeersborden en andere auto’s in het verkeer herkennen. Verschillende voorgetrainde modellen zijn vrij beschikbaar en kunnen verder worden getraind op de eigen data om zo de gewenste objecten te herkennen. Maar hoe werken zo’n voorgetrainde modellen nu op historische foto’s? Dat is iets wat tot nu toe nauwelijks is onderzocht.
De computer les geven
In mijn onderzoek ging ik van start met drie populaire bestaande objectdetectie modellen te testen op historische foto’s, zonder deze eerst te trainen op mijn eigen data. Op die manier kon ik achterhalen welk model in de “basisversie” het best werkte op historische foto’s. Het verschil in prestatie op moderne en historische foto’s was, zoals verwacht, groot. Historische foto’s verschillen soms sterk van moderne afbeeldingen, zo zijn ze vervaagd, korrelig of hebben een ander contrast en scherpte en bovendien zien de objecten er stilistisch ook anders uit. Een historische tram verschilt immers van een moderne tram. Voor het menselijk oog blijft het meestal duidelijk wat er afgebeeld staat, maar voor een objectdetectie model dat is voorgetraind op duizenden moderne foto’s zorgen zo’n afwijkingen ervoor dat objecten veel moeilijker te herkennen zijn.
Het best presterende model werd gekozen om verder te trainen aan de hand van historische foto’s, om zo te zorgen dat het model betrouwbaarder zou worden. Daarbij verhoogde ik kunstmatig het aantal trainingsbeelden door foto’s bijvoorbeeld donkerder, lichter, of minder scherp te maken. Hoe meer en gevarieerder de foto’s waar het model van kan leren, hoe beter het presteert. Uiteindelijk trainde ik twee modellen, één dat trams herkende en één dat koetsen kon aanduiden. Voor de trams had het model al een stevige basis, omdat die categorie van objecten ook voorkomt in de moderne datasets waarop het model oorspronkelijk werd getraind. Voor koetsen was dat helemaal anders, dat was namelijk een volledig nieuwe categorie die het model van nul moest leren herkennen. Daar ontdekte ik dat er veel meer data nodig is om tot goede resultaten te komen.
Een moderne foto in een historisch jasje
Om een computer te leren wat er op foto’s staat, is dus een hele hoop data nodig. Maar laat dat nu net een probleem zijn bij historische foto’s. Er zijn minder afbeeldingen beschikbaar en het labelen van de objecten voor de training is een heus monnikenwerk. Zou AI ons daar ook kunnen helpen?
In mijn onderzoek testte ik een bestaand model dat afbeeldingen kan omzetten in een andere stijl. Zo kon het, na training op historische en moderne afbeeldingen, een moderne foto er laten uitzien alsof hij tweehonderd jaar geleden werd genomen, en omgekeerd. Die eerste richting werkte na wat training verrassend goed. Een hedendaagse stadsfoto leek plots op een vergeelde archieffoto. Dat opent de deur om in verder onderzoek moderne datasets, waarvan er heel wat al vooraf gelabeld en vrij beschikbaar zijn, een historisch jasje te geven en deze zo te gebruiken om snel meer trainingsdata te verkrijgen. De omgekeerde weg bleek moeilijker. Het model had moeite om historische foto’s “in te kleuren”. Waar simpele en consistente elementen zoals lucht, water of bomen er wel realistisch uitzagen, struikelde het model over complexe objecten zoals mensen.
En wat hebben we vandaag geleerd?
In mijn thesis ging ik op zoek naar hoe AI onderzoekers kan helpen bij het verzamelen van data uit historische foto’s om later te gebruiken in de reconstructie van soundscapes. Als pilootproject trainde ik twee objectdetectie modellen, één om trams te herkennen en één dat koetsen kan aanduiden. Daarmee zette ik een eerste stap richting een model dat ook werkt op historische foto’s, iets wat tot nu toe nog niet bestond. Het experiment leverde veelbelovende resultaten op en toonde aan dat objectdetectie een interessante hulp kan zijn bij het verzamelen van data. Toekomstig onderzoek zou kunnen inzetten op samenwerking tussen archieven en onderzoekers, zodat één model kan worden opgebouwd dat steeds beter wordt in het herkennen van een brede waaier aan objecten in historische foto’s en op die manier wetenschappers veel tijd bespaart bij het verzamelen van data.
"Add Fasterrcnn Improved Weights #5763." GitHub, accessed 10 April 2025, https://github.com/pytorch/vision/pull/5763.
"Add Retinanet Improved Weights #5756." GitHub, accessed 10 April 2025, https://github.com/pytorch/vision/pull/5756.
Ali, Momina Liaqat, and Zhou Zhang. "The Yolo Framework: A Comprehensive Review of Evolution, Applications, and Benchmarks in Object Detection." Computers 13, 12 (2024). https://doi.org/10.3390/computers13120336.
Andres, Alain, Aitor Martinez-Seras, Ibai Laña, and Javier Del Ser. "On the Black-Box Explainability of Object Detection Models for Safe and Trustworthy Industrial Applications." Results in Engineering 24 (2024). https://doi.org/10.1016/j.rineng.2024.103498.
"Artificial Intelligence and Copyright: Use of Generative Ai Tools to Develop New Content." European Commission, 2024, accessed 4 April 2025, https://intellectual-property-helpdesk.ec.europa.eu/news-events/news/ar….
"Automatically Label Image Data." Roboflow, accessed 14 November 2024, https://roboflow.com/auto-label.
Bai, Jianwei. "Ancient Chinese Painting Style Transfer Based on Cyclegan." Applied and Computational Engineering 51, no. 1 (2024): 129-36. https://doi.org/10.54254/2755-2721/51/20241192.
105
Bender, Ana, Manuela Guerreiro, Dora Agapito, Bernardete Dias Sequeira, and Júlio Mendes. "Sensory Experiences in Heritage Contexts: A Qualitative Approach.". European Journal of Tourism Research 36 (2024). https://doi.org/10.54055/ejtr.v36i.3060.
Bengamra, Siwar, Olfa Mzoughi, André Bigand, and Ezzeddine Zagrouba. "A Comprehensive Survey on Object Detection in Visual Art: Taxonomy and Challenge." Multimedia Tools and Applications 83, no. 5 (2024): 14637–70. https://doi.org/10.1007/s11042-023-15968-9.
Berardi, Umberto, Gino Iannace, and Luigi Maffei. "Virtual Reconstruction of the Historical Acoustics of the Odeon of Pompeii." Journal of Cultural Heritage 19 (2016): 555-66. https://doi.org/10.1016/j.culher.2015.12.004.
"Guide to Generative Adversarial Networks (Gans) in 2025." viso.ai, Updated 1 October 2024, 2024, accessed 24 December 2024, https://viso.ai/deep-learning/generative-adversarial-networks-gan/.
"Bretez." accessed 10 April 2025, https://sites.google.com/site/louisbretez/accueil.
Briatore, Samuele. "Immaginare I Suoni. Ricostruzione Del Paesaggio Sonoro Della Festa Barocca." Arti dello Spettacolo / Performing Arts 3, no. 3 (2017).
Capurro, Carlotta, and Gertjan Plets. "Europeana, Edm, and the Europeanisation of Cultural Heritage Institutions." Digital Culture & Society 6, no. 2 (2020): 163-90. https://doi.org/10.14361/dcs-2020-0209.
Carneiro, Tiago, Raul Victor Medeiros Da NóBrega, Thiago Nepomuceno, Gui-Bin Bian, Victor Hugo C. De Albuquerque, and Pedro Pedrosa Rebouças Filho. "Performance Analysis of Google Colaboratory as a Tool for Accelerating Deep Learning Applications." IEEE Access 6 (2018): 61677-85. https://doi.org/10.1109/ACCESS.2018.2874767.
Chen, Yen-Chia, Hiroki Shibata, Lieu-Hen Chen, and Yasufumi Takama. "Synthesis of Comic-Style Portraits Using Combination of Cyclegan and Pix2pix." Journal of Advanced Computational Intelligence and Intelligent Informatics 28, no. 5 (2024): 1085-94. https://doi.org/10.20965/jaciii.2024.p1085.
Crasto, Nieves. "Class Imbalance in Object Detection: An Experimental Diagnosis and Study of Mitigation Strategies." arXiv (Cornell University) (2024). https://doi.org/10.48550/arXiv.2403.07113.
"Generative Adversarial Networks." Medium, Updated 30 October 2023, 2023, accessed 24 December 2024, https://medium.com/@marcodelpra/generative-adversarial-networks-dba10e1….
"Soundscape Archaeology: A Visit to Paris in the Mid 18th/Early 19th Century." Fondation Napoléon, 2020, accessed 10 April 2025, https://www.napoleon.org/en/history-of-the-two-empires/videos/soundscap….
Doan, Anh-Dzung, Bach Long Nguyen, Surabhi Gupta, Ian Reid, Markus Wagner, and Tat-Jun Chin. "Assessing Domain Gap for Continual Domain Adaptation in Object Detection." Computer Vision and Image Understanding 238 (2024). https://doi.org/10.1016/j.cviu.2023.103885.
"Erfgoed Brugge." Erfgoed Brugge, accessed 10 October 2024, https://erfgoedbrugge.be/.
"Erfgoedbank Brussel." Erfgoedbank Brussel, accessed 10 October 2024, https://erfgoedbankbrussel.be/.
"Europeana." Europeana, accessed 10 October 2024, https://www.europeana.eu/nl.
"Explore Ultralytics Yolov8." Ultralytics, accessed 10 April 2025, https://docs.ultralytics.com/models/yolov8/.
"Fasterrcnn_Resnet50_Fpn_V2." PyTorch, accessed 10 January 2025, https://pytorch.org/vision/main/models/generated/torchvision.models.det….
"Felixarchief." Stad Antwerpen, accessed 10 October 2024, https://felixarchief.antwerpen.be/.
"Fine-Tuning." Ultralytics, accessed 10 April 2025, https://www.ultralytics.com/glossary/fine-tuning.
Fiorucci, Marco , Marina Khoroshiltseva, Massimiliano Pontil, Arianna Traviglia, Alessio Del Bue, and Stuart James. "Machine Learning for Cultural Heritage: A Survey." Pattern Recognition Letters 133 (2020): 102-08. https://doi.org/10.1016/j.patrec.2020.02.017.
Firat, Hasan Baran. "Acoustics as Tangible Heritage: Re-Embodying the Sensory Heritage in the Boundless Reign of Sight." Preservation, Digital Technology & Culture 50, no. 1 (2021): 3-14. https://doi.org/10.1515/pdtc-2020-0028.
106
Firat, Hasan Baran, and Luigi Maffei. "A Methodology for the Historically Informed Soundscape." Inter-.Noise 2020, e-congress, 2020.
Firat, Hasan Baran, Luigi Maffei, and Massimiliano Masullo. "Digital Humanities in the Historical Soundscape Research: Sounds of 18th Century Naples." The Acoustics of Ancient Theatres, Verona, Italy, 2022.
"Fotografisch Geheugen: De Reportages Van Fotopersbureau De Boer." VeleHanden, accessed 22 April 2025, https://velehanden.nl/projecten/bekijk/details/project/ranh_tagselectio….
Galvano, Francesco. "The Triad of Senses, Emotions, and Memory: Dynamic Interactions and Multidisciplinary Implications." (2015): 1-12.
"Launch: Synthetic Image Generation with Dall-E and Gpt-4 Vision." Roboflow, 2023, accessed 3 May 2025, https://blog.roboflow.com/synthetic-data-dall-e-roboflow/.
Ultralytics Yolov8.
Goodfellow, Ian, Jean Pouget-Abadie, Mehd Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. "Generative Adversarial Networks." arXiv (Cornell University) (2014): 1-9. https://doi.org/10.48550/arxiv.1406.2661.
"Google Colaboratory." Google, accessed 14 November 2024, https://colab.google/#:~:text=Google%20Colaboratory,%2C%20data%20scienc….
Graybill, Rhiannon. "‘Hear and Give Ear!’: The Soundscape of Jeremiah." Journal for the Study of the Old Testament 40, no. 4 (2016): 467-90. https://doi.org/10.1177/0309089216628414.
Hatir, Ergün, Korkanç Mustafa, Andreas Schacner, and İsmail İnce. "The Deep Learning Method Applied to the Detection and Mapping of Stone Deterioration in Open-Air Sanctuaries of the Hittite Period in Anatolia." Journal of Cultural Heritage 51 (2021): 37-49. https://doi.org/10.1016/j.culher.2021.07.004.
"Hearing the Past: Reconstructing the Aural Heritage of Antwerp in the 19th Century." University of Antwerp, accessed 10 April 2025, https://www.uantwerpen.be/en/projects/hearing-the-past/about/.
Hindarto, Djarot. "Revolution in Image Data Collection: Cyclegan as a Dataset Generator." Sinkron : Jurnal Dan Penelitian Teknik Informatika 8, no. 1 (2024): 444-54. https://doi.org/10.33395/sinkron.v9i1.13211.
Howes, David. "Introduction to Sensory Museology." The Senses and Society 9, no. 3 (2014): 259-67. https://doi.org/10.2752/174589314X14023847039917.
Ibrahim, Bekkouch Imad Eddine, Victoria Eyharabide, Valérie Le Page, and Frédéric Billiet. "Few-Shot Object Detection: Application to Medieval Musicological Studies." Journal of Imaging 8, no. 18 (2022). https://doi.org/10.3390/jimaging8020018.
Jabir, Brahim, Noureddine Falir, and Khalid Rahmani. "Accuracy and Efficiency Comparison of Object Detection Open-Source Models." International Journal of Online and Biomedical Engineering 15, no. 5 (2021): 165-83. https://doi.org/10.3991/ijoe.v17i05.21833.
Kadish, David, Sebastian Risi, and Anders Sundess Løvlie. "Improving Object Detection in Art Images Using Only Style Transfer." arXiv (Cornell University) (2021). https://doi.org/10.48550/arxiv.2102.06529.
Kang, Jian, Francesco Aletta, Truls T. Gjestland, Lex A. Brown, Dick Botteldooren, Brigitte Schulte-Fortkamp, Peter Lercher, et al. "Ten Questions on the Soundscapes of the Built Environment." Building and Environment 108 (2016): 284-94. https://doi.org/10.1016/j.buildenv.2016.08.011.
Karbouj, Bsher, Garabet A. Topalian-Rivas, and Jörg Krüger. "Comparative Performance Evaluation of One-Stage and Two-Stage Object Detectors for Screw Head Detection and Classification in Disassembly Processes." Procedia CIRP 122 (2024): 527-32. https://doi.org/10.1016/j.procir.2024.01.077.
Kim, Yongho, Chanjong Im, and Thomas Mandl. "Object Detection in Historical Images: Transfer Learning and Pseudo Labelling." ACM Journal on Computing and Cultural Heritage (2024): 1-15. https://doi.org/10.1145/3699963.
"Retinanet: Single-Stage Object Detector with Accuracy Focus." viso.ai, 2024, accessed 3 April 2025, https://viso.ai/deep-learning/retinanet/.
107
Kulkarni, Uday, Meena S.M., Sunil V. Gurlahosur, and Uma Mudengudi. "Classification of Cultural Heritage Sites Using Transfer Learning." IEEE Fifth International Conference on Multimedia Big Data (BigMM), Signapore, 2019.
Lee, Benjamin Charles Germain. "The “Collections as Ml Data” Checklist for Machine Learning and Cultural Heritage." Journal of the Association for Information Science and Technology 76, no. 2 (2023): 375-96. https://doi.org/10.1002/asi.24765.
Leon-Gomez, Eder Arley, Andrés Marino Álvarez-Meza, and German Castellanos-Dominguez. "Cross-Dataset Data Augmentation Using Umap for Deep Learning-Based Wind Speed Prediction." Computers 14, no. 4 (2025): 123-43. https://doi.org/10.3390/computers14040123.
"Library of Congress." Library of Congress, accessed 10 October 2024, https://www.loc.gov/.
Lin, Tsung-Yi, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. "Focal Loss for Dense Object Detection." arXiv (Cornell University) (2017). https://doi.org/10.48550/arXiv.1708.02002.
Luo, Dan, Lieve Doucé, and Karin Nys. "Multisensory Museum Experience: An Integrative View and Future Research Directions." Museum Management and Curatorship (2024): 1-28. https://doi.org/10.1080/09647775.2024.2357071.
Luo, Vinson, Michael Straka, and Lucy Li. "Historical and Modern Image-to-Image Translation with Generative Adversarial Networks." (2017): 1-8.
Maiwald, Ferdinand. "Generation of a Benchmark Dataset Using Historical Photographs for Automated Evaluation of Different Feature Matching Methods." The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences 42, no. 2 (2019): 87-94. https://doi.org/10.5194/isprs-archives-XLII-2-W13-87-2019.
Malcolm-Davies, Jane. "Structuring Reconstructions: Recognising the Advantages of Interdisciplinary Data in Methodical Research." Heritage Science 11, no. 1 (2023): 1-16. https://doi.org/10.1186/s40494-023-00982-9.
"Ai ‘Opt-Outs’: Should Cultural Heritage Institutions (Dis)Allow the Mining of Cultural Heritage Data?" Europeana Pro, 2024, accessed 4 April 2025, https://pro.europeana.eu/post/ai-opt-outs-should-cultural-heritage-inst….
Monna, Fabrice, Tanguy Rolland, Anthony Denaire, Nicolas Navarro, Ludovic Granjon, Rémi Barbé, and Carmela Chateau-Smith. "Deep Learning to Detect Built Cultural Heritage from Satellite Imagery. - Spatial Distribution and Size of Vernacular Houses in Sumba, Indonesia." Journal of Cultural Heritage 52 (2021): 171-83. https://doi.org/10.1016/j.culher.2021.10.004.
Mumuni, Alhassan, and Fuseini Mumuni. "Data Augmentation: A Comprehensive Survey of Modern Approaches." Array 16 (2022). https://doi.org/10.1016/j.array.2022.100258.
Müske, Johannes. "Constructing Sonic Heritage: The Accumulation of Knowledge in the Context of Sound Archives." The Journal of Ethnology and Folkloristics 4, no. 1 (2011): 37-47.
"What Is Yolo? The Ultimate Guide [2025]." Roboflow, 2025, accessed 2 April 2025, https://blog.roboflow.com/guide-to-yolo-models/#yolov12.
Padilla, Rafael, Sergio L. Netto, and Eduardo A. B. da Silva. "A Survey on Performance Metrics for Object-Detection Algorithms." In Proceedings of the 2020 International Conference on Systems, Signals and Image Processing (Iwssip), edited by Anselmo C. Paiva, Aura Conci, Geraldo Braz Jr., João Dallyson S Almeida and Leandro A. F. Fernandes, 237-42: Institute of Computing at Fluminense Federal University (IC-UFF), 2020.
Pai, Suraj, Ibrahim Hadzic, Chinmay Rao, Ivan Zhovannik, Andre Dekker, Alberto Traverso, Stylianos Asteriadis, and Enrique Hortal. "Frequency-Domain-Based Structure Losses for Cyclegan-Based Cone-Beam Computed Tomography Translation." Sensors 23, no. 3 (2023). https://doi.org/10.3390/s23031089.
Pardoen, Mylène. "Projet Bretez: Une Pincée De Son Dans L’histoire.". Digital Studies/Le champ numérique 9, no. 1 (2019): 1-17. https://doi.org/10.16995/dscn.350.
"Restoring the Historical Sound of Paris." Hypotheses, 2016, accessed 10 April 2025, https://sms.hypotheses.org/8560.
Park, Jaihyun, David K. Han, and Hanseok Ko. "Adaptive Weighted Multi-Discriminator Cyclegan for Underwater Image Enhancement." Journal of Marine Science and Engineering 7, no. 7 (2019). https://doi.org/10.3390/jmse7070200.
108
Patra, Aditya, and Rae Crandall. "Machine Learning for Visually Impaired: Benchmarking Object Detection Models." Journal of Student Research 13, no. 2 (2024): 1-14. https://doi.org/10.47611/jsrhs.v13i2.6630.
Pietroni, Eva. "Mapping the Soundscape in Communicative Forms for Cultural Heritage: Between Realism and Symbolism." Heritage 4, no. 4 (2021): 4495-523. https://doi.org/10.3390/heritage4040248.
Poudel, Anil. "Face Recognition on Historical Photographs." Uppsala University, 2021.
"Pseudo Labeling: Leveraging the Power of Self-Supervision in Machine Learning." Medium, Updated 1 February 2024, 2024, accessed 27 December 2024, https://medium.com/@data-overload/pseudo-labeling-leveraging-the-power-….
Redmon, Joseph, Santosh Divvala, Ross Girshick, and Ali Farhadi. "You Only Look Once: Unified, Real-Time Object Detection." arXiv (Cornell University) (2015). https://doi.org/10.48550/arXiv.1506.02640.
"Regulation (Eu) 2024/1689 of the European Parliament and of the Council of 13 June 2024 Laying Down Harmonised Rules on Artificial Intelligence and Amending Regulations (Ec) No 300/2008, (Eu) No 167/2013, (Eu) No 168/2013, (Eu) 2018/858, (Eu) 2018/1139 and (Eu) 2019/2144 and Directives 2014/90/Eu, (Eu) 2016/797 and (Eu) 2020/1828 (Artificial Intelligence Act)." edited by European Union and European Council, 2024. http://data.europa.eu/eli/reg/2024/1689/oj.
Ren, Shaoqing, Kaiming He, Ross Girshick, and Jian Sun. "Faster R-Cnn: Towards Real-Time Object Detection with Region Proposal Networks." arXiv (Cornell University) (2015). https://doi.org/10.48550/arXiv.1506.01497.
"Retinanet_Resnet50_Fpn_V2." PyTorch, accessed 10 January 2025, https://pytorch.org/vision/0.20/models/generated/torchvision.models.det….
"Roboflow." Roboflow, accessed 14 November 2024, https://roboflow.com/.
"Train-Yolov8-Object-Detection-on-Custom-Dataset.Ipynb." Google Colab, accessed 24 January 2025, https://colab.research.google.com/github/roboflow-ai/notebooks/blob/mai….
Sapkota, Ranjan, and Manoj Karkee. "Generative Ai in Agriculture: Creating Image Datasets Using Dall.E's Advanced Large Language Model Capabilities." arXiv (Cornell University) (2023). https://doi.org/10.48550/arXiv.2307.08789.
Schafer, R. Murray. "Open Ears." Soundscape: The Journal of Acoustic Ecology 4, no. 2 (2003): 14-18.
———. The Soundscape: Our Sonic Environment and the Tuning of the World. New York: Alfred Knopf, Inc., 1977.
Sender Contell, Marina, Ana Planells, Ricardo Perelló Roso, Jaume Segura Garcia, and Alicia Giménez. "Virtual Acoustic Reconstruction of a Lost Church: Application to an Order of Saint Jerome Monastery in Alzira, Spain." Journal of Building Performance Simulation 11, no. 3 (2018): 369-90. https://doi.org/10.1080/19401493.2017.1340975.
Shin, Jeongmin, and Hyeryung Jang. "Data Augmentation Techniques Using Text-to-Image Diffusion Models for Enhanced Data Diversity." 2024 15th International Conference on Information and Communication Technology Convergence (ICTC), Jeju Island, Republic of Korea, IEEE, 2024.
Shorten, Connor, and Taghi M Khoshgoftaar. "A Survey on Image Data Augmentation for Deep Learning." Journal of Big Data 6, no. 1 (2019). https://doi.org/10.1186/s40537-019-0197-0.
"Smell Heritage – Sensory Mining." accessed 30 March 2025, https://odeuropa.eu/.
Sohan, Mupparaju, Thotakura SaiRam, and Ch. Venkata RamiReddy. "A Review on Yolov8 and Its Advancements." In Data Intelligence and Cognitive Informatics. Proceedings of Icdici 2023., edited by I. Jeena Jacob, Selwyn Piramuthu and Przemyslaw Falkowski-Gilski. Algorithms for Intelligent Systems, 529-45. Singapore: Springer Singapore, 2024.
"Sonic Heritage - Exploring the Sounds of the World's Most Famous Sights." Cities and Memory, Oxford, accessed 31 March 2025, https://citiesandmemory.com/heritage/.
"The Soundscape of Istanbul Collection - About Collection." Koç University, accessed 31 March 2025, https://librarydigitalcollections.ku.edu.tr/en/collection/soundscape-of….
109
Terven, Juan, Diana-Margarita Córdova-Esparza, and Julio-Alejandro Romero-González. "A Comprehensive Review of Yolo Architectures in Computer Vision: From Yolov1 to Yolov8 and Yolo-Nas." Machine Learning and Knowledge Extraction 5, no. 4 (2023): 1680-716. https://doi.org/10.3390/make5040083.
Tong, Zhaoxiang. "Exploring the Impact of Hyperparameters on the Generation Quality of Cyclegan." Transactions on Computer Science and Intelligent Systems Research 5 (2024): 265-71. https://doi.org/10.62051/01m93a63.
"Transfer Learning with Frozen Layers in Yolov5." Ultralytics, accessed 10 April 2025, https://docs.ultralytics.com/yolov5/tutorials/transfer_learning_with_fr….
Ulmer, Maximilian, Leonard Klüpfel, Maximilian Durner, and Rudolph Triebel. "How Important Are Data Augmentations to Close the Domain Gap for Object Detection in Orbit?". arXiv (Cornell University) (2024). https://doi.org/10.48550/arXiv.2410.15766.
"Ultralytics Yolo Hyperparameter Tuning Guide." Ultralytics, accessed 24 April 2025, https://docs.ultralytics.com/guides/hyperparameter-tuning/.
UNESCO. "39 C/49 the Importance of Sound in Today's World: Promoting Best Practices." General Conference, 39th session, Paris 2017.
———. Basic Texts of the 2003 Convention for the Safeguarding of the Intangible Cultural Heritage. 2022 ed. Paris: France, 2022.
"Unsplash." Unsplash, accessed 10 October 2024, https://unsplash.com/.
"All You Need to Know About Explainable Ai (Xai)." Ultralytics, 2024, accessed 17 April 2025, https://www.ultralytics.com/blog/all-you-need-to-know-about-explainable….
Wei, Jian, Qinzhao Wang, and Zixu Zhao. "Yolo-G: Improved Yolo for Cross-Domain Object Detection." PLoS ONE 18, no. 9 (2023). https://doi.org/10.1371/journal.pone.0291241.
Wevers, Melvin. "Scene Detection in De Boer Historical Photo Collection." In Proceedings of the 13th International Conference on Agents and Artificial Intelligence, edited by Ana Paula Rocha, Luc Steels and Jaap Van den Herik, 301-610. Cham: Springer, 2021.
"What Is Cities and Memory?" Cities and Memory, Oxford, accessed 31 March 2025, https://citiesandmemory.com/what-is-cities-and-memory-about/.
Yelmi, Pinar. "Protecting Contemporary Cultural Soundscapes as Intangible Cultural Heritage: Sounds of Istanbul." International Journal of Heritage Studies 22, no. 4 (2016): 302-3011. https://doi.org/10.1080/13527258.2016.1138237.
Yelmi, Pιnar, Hüseyin Kuşcu, and Asιm Evren Yantaç. "Towards a Sustainable Crowdsourced Sound Heritage Archive by Public Participation: The Soundsslike Project." 9th Nordic Conference on Human-Computer Interaction, Gothenburg, Sweden, Association for Computing Machinery, New York, 2016.
Yin, Yuwei, Jean Kaddour, Xiang Zhang, Yixin Nie, Zhenguang Liu, Lingpeng Kong, and Qi Liu. "Ttida: Controllable Generative Data Augmentation Via Text-to-Text and Text-to-Image Models." arXiv (Cornell University) (2023). https://doi.org/10.48550/arXiv.2304.08821.
"Pytorch-Cyclegan-and-Pix2pix." GitHub, Updated 22 March 2024, accessed 4 January 2025, https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix/tree/master.
Zhu, Jun-Yan, Taesung Park, Phillip Isola, and Alexei A. Efros. "Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks." arXiv (Cornell University) (2017). https://doi.org/10.48550/arxiv.1703.10593.
Zinnen, Mathias, Prathmesh Madhu, Ronak Kosti, Peter Bellz, Andreas Maier, and Vincent Christlein. "Odor: The Icpr2022 Odeuropa Challenge on Olfactory Object Recognition." 26th International Conference on Pattern Recognition (ICPR), Montreal, QC, Canada, 2022."Add Fasterrcnn Improved Weights #5763." GitHub, accessed 10 April 2025, https://github.com/pytorch/vision/pull/5763.
"Add Retinanet Improved Weights #5756." GitHub, accessed 10 April 2025, https://github.com/pytorch/vision/pull/5756.
Ali, Momina Liaqat, and Zhou Zhang. "The Yolo Framework: A Comprehensive Review of Evolution, Applications, and Benchmarks in Object Detection." Computers 13, 12 (2024). https://doi.org/10.3390/computers13120336.
Andres, Alain, Aitor Martinez-Seras, Ibai Laña, and Javier Del Ser. "On the Black-Box Explainability of Object Detection Models for Safe and Trustworthy Industrial Applications." Results in Engineering 24 (2024). https://doi.org/10.1016/j.rineng.2024.103498.
"Artificial Intelligence and Copyright: Use of Generative Ai Tools to Develop New Content." European Commission, 2024, accessed 4 April 2025, https://intellectual-property-helpdesk.ec.europa.eu/news-events/news/ar….
"Automatically Label Image Data." Roboflow, accessed 14 November 2024, https://roboflow.com/auto-label.
Bai, Jianwei. "Ancient Chinese Painting Style Transfer Based on Cyclegan." Applied and Computational Engineering 51, no. 1 (2024): 129-36. https://doi.org/10.54254/2755-2721/51/20241192.
105
Bender, Ana, Manuela Guerreiro, Dora Agapito, Bernardete Dias Sequeira, and Júlio Mendes. "Sensory Experiences in Heritage Contexts: A Qualitative Approach.". European Journal of Tourism Research 36 (2024). https://doi.org/10.54055/ejtr.v36i.3060.
Bengamra, Siwar, Olfa Mzoughi, André Bigand, and Ezzeddine Zagrouba. "A Comprehensive Survey on Object Detection in Visual Art: Taxonomy and Challenge." Multimedia Tools and Applications 83, no. 5 (2024): 14637–70. https://doi.org/10.1007/s11042-023-15968-9.
Berardi, Umberto, Gino Iannace, and Luigi Maffei. "Virtual Reconstruction of the Historical Acoustics of the Odeon of Pompeii." Journal of Cultural Heritage 19 (2016): 555-66. https://doi.org/10.1016/j.culher.2015.12.004.
"Guide to Generative Adversarial Networks (Gans) in 2025." viso.ai, Updated 1 October 2024, 2024, accessed 24 December 2024, https://viso.ai/deep-learning/generative-adversarial-networks-gan/.
"Bretez." accessed 10 April 2025, https://sites.google.com/site/louisbretez/accueil.
Briatore, Samuele. "Immaginare I Suoni. Ricostruzione Del Paesaggio Sonoro Della Festa Barocca." Arti dello Spettacolo / Performing Arts 3, no. 3 (2017).
Capurro, Carlotta, and Gertjan Plets. "Europeana, Edm, and the Europeanisation of Cultural Heritage Institutions." Digital Culture & Society 6, no. 2 (2020): 163-90. https://doi.org/10.14361/dcs-2020-0209.
Carneiro, Tiago, Raul Victor Medeiros Da NóBrega, Thiago Nepomuceno, Gui-Bin Bian, Victor Hugo C. De Albuquerque, and Pedro Pedrosa Rebouças Filho. "Performance Analysis of Google Colaboratory as a Tool for Accelerating Deep Learning Applications." IEEE Access 6 (2018): 61677-85. https://doi.org/10.1109/ACCESS.2018.2874767.
Chen, Yen-Chia, Hiroki Shibata, Lieu-Hen Chen, and Yasufumi Takama. "Synthesis of Comic-Style Portraits Using Combination of Cyclegan and Pix2pix." Journal of Advanced Computational Intelligence and Intelligent Informatics 28, no. 5 (2024): 1085-94. https://doi.org/10.20965/jaciii.2024.p1085.
Crasto, Nieves. "Class Imbalance in Object Detection: An Experimental Diagnosis and Study of Mitigation Strategies." arXiv (Cornell University) (2024). https://doi.org/10.48550/arXiv.2403.07113.
"Generative Adversarial Networks." Medium, Updated 30 October 2023, 2023, accessed 24 December 2024, https://medium.com/@marcodelpra/generative-adversarial-networks-dba10e1….
"Soundscape Archaeology: A Visit to Paris in the Mid 18th/Early 19th Century." Fondation Napoléon, 2020, accessed 10 April 2025, https://www.napoleon.org/en/history-of-the-two-empires/videos/soundscap….
Doan, Anh-Dzung, Bach Long Nguyen, Surabhi Gupta, Ian Reid, Markus Wagner, and Tat-Jun Chin. "Assessing Domain Gap for Continual Domain Adaptation in Object Detection." Computer Vision and Image Understanding 238 (2024). https://doi.org/10.1016/j.cviu.2023.103885.
"Erfgoed Brugge." Erfgoed Brugge, accessed 10 October 2024, https://erfgoedbrugge.be/.
"Erfgoedbank Brussel." Erfgoedbank Brussel, accessed 10 October 2024, https://erfgoedbankbrussel.be/.
"Europeana." Europeana, accessed 10 October 2024, https://www.europeana.eu/nl.
"Explore Ultralytics Yolov8." Ultralytics, accessed 10 April 2025, https://docs.ultralytics.com/models/yolov8/.
"Fasterrcnn_Resnet50_Fpn_V2." PyTorch, accessed 10 January 2025, https://pytorch.org/vision/main/models/generated/torchvision.models.det….
"Felixarchief." Stad Antwerpen, accessed 10 October 2024, https://felixarchief.antwerpen.be/.
"Fine-Tuning." Ultralytics, accessed 10 April 2025, https://www.ultralytics.com/glossary/fine-tuning.
Fiorucci, Marco , Marina Khoroshiltseva, Massimiliano Pontil, Arianna Traviglia, Alessio Del Bue, and Stuart James. "Machine Learning for Cultural Heritage: A Survey." Pattern Recognition Letters 133 (2020): 102-08. https://doi.org/10.1016/j.patrec.2020.02.017.
Firat, Hasan Baran. "Acoustics as Tangible Heritage: Re-Embodying the Sensory Heritage in the Boundless Reign of Sight." Preservation, Digital Technology & Culture 50, no. 1 (2021): 3-14. https://doi.org/10.1515/pdtc-2020-0028.
106
Firat, Hasan Baran, and Luigi Maffei. "A Methodology for the Historically Informed Soundscape." Inter-.Noise 2020, e-congress, 2020.
Firat, Hasan Baran, Luigi Maffei, and Massimiliano Masullo. "Digital Humanities in the Historical Soundscape Research: Sounds of 18th Century Naples." The Acoustics of Ancient Theatres, Verona, Italy, 2022.
"Fotografisch Geheugen: De Reportages Van Fotopersbureau De Boer." VeleHanden, accessed 22 April 2025, https://velehanden.nl/projecten/bekijk/details/project/ranh_tagselectio….
Galvano, Francesco. "The Triad of Senses, Emotions, and Memory: Dynamic Interactions and Multidisciplinary Implications." (2015): 1-12.
"Launch: Synthetic Image Generation with Dall-E and Gpt-4 Vision." Roboflow, 2023, accessed 3 May 2025, https://blog.roboflow.com/synthetic-data-dall-e-roboflow/.
Ultralytics Yolov8.
Goodfellow, Ian, Jean Pouget-Abadie, Mehd Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. "Generative Adversarial Networks." arXiv (Cornell University) (2014): 1-9. https://doi.org/10.48550/arxiv.1406.2661.
"Google Colaboratory." Google, accessed 14 November 2024, https://colab.google/#:~:text=Google%20Colaboratory,%2C%20data%20scienc….
Graybill, Rhiannon. "‘Hear and Give Ear!’: The Soundscape of Jeremiah." Journal for the Study of the Old Testament 40, no. 4 (2016): 467-90. https://doi.org/10.1177/0309089216628414.
Hatir, Ergün, Korkanç Mustafa, Andreas Schacner, and İsmail İnce. "The Deep Learning Method Applied to the Detection and Mapping of Stone Deterioration in Open-Air Sanctuaries of the Hittite Period in Anatolia." Journal of Cultural Heritage 51 (2021): 37-49. https://doi.org/10.1016/j.culher.2021.07.004.
"Hearing the Past: Reconstructing the Aural Heritage of Antwerp in the 19th Century." University of Antwerp, accessed 10 April 2025, https://www.uantwerpen.be/en/projects/hearing-the-past/about/.
Hindarto, Djarot. "Revolution in Image Data Collection: Cyclegan as a Dataset Generator." Sinkron : Jurnal Dan Penelitian Teknik Informatika 8, no. 1 (2024): 444-54. https://doi.org/10.33395/sinkron.v9i1.13211.
Howes, David. "Introduction to Sensory Museology." The Senses and Society 9, no. 3 (2014): 259-67. https://doi.org/10.2752/174589314X14023847039917.
Ibrahim, Bekkouch Imad Eddine, Victoria Eyharabide, Valérie Le Page, and Frédéric Billiet. "Few-Shot Object Detection: Application to Medieval Musicological Studies." Journal of Imaging 8, no. 18 (2022). https://doi.org/10.3390/jimaging8020018.
Jabir, Brahim, Noureddine Falir, and Khalid Rahmani. "Accuracy and Efficiency Comparison of Object Detection Open-Source Models." International Journal of Online and Biomedical Engineering 15, no. 5 (2021): 165-83. https://doi.org/10.3991/ijoe.v17i05.21833.
Kadish, David, Sebastian Risi, and Anders Sundess Løvlie. "Improving Object Detection in Art Images Using Only Style Transfer." arXiv (Cornell University) (2021). https://doi.org/10.48550/arxiv.2102.06529.
Kang, Jian, Francesco Aletta, Truls T. Gjestland, Lex A. Brown, Dick Botteldooren, Brigitte Schulte-Fortkamp, Peter Lercher, et al. "Ten Questions on the Soundscapes of the Built Environment." Building and Environment 108 (2016): 284-94. https://doi.org/10.1016/j.buildenv.2016.08.011.
Karbouj, Bsher, Garabet A. Topalian-Rivas, and Jörg Krüger. "Comparative Performance Evaluation of One-Stage and Two-Stage Object Detectors for Screw Head Detection and Classification in Disassembly Processes." Procedia CIRP 122 (2024): 527-32. https://doi.org/10.1016/j.procir.2024.01.077.
Kim, Yongho, Chanjong Im, and Thomas Mandl. "Object Detection in Historical Images: Transfer Learning and Pseudo Labelling." ACM Journal on Computing and Cultural Heritage (2024): 1-15. https://doi.org/10.1145/3699963.
"Retinanet: Single-Stage Object Detector with Accuracy Focus." viso.ai, 2024, accessed 3 April 2025, https://viso.ai/deep-learning/retinanet/.
107
Kulkarni, Uday, Meena S.M., Sunil V. Gurlahosur, and Uma Mudengudi. "Classification of Cultural Heritage Sites Using Transfer Learning." IEEE Fifth International Conference on Multimedia Big Data (BigMM), Signapore, 2019.
Lee, Benjamin Charles Germain. "The “Collections as Ml Data” Checklist for Machine Learning and Cultural Heritage." Journal of the Association for Information Science and Technology 76, no. 2 (2023): 375-96. https://doi.org/10.1002/asi.24765.
Leon-Gomez, Eder Arley, Andrés Marino Álvarez-Meza, and German Castellanos-Dominguez. "Cross-Dataset Data Augmentation Using Umap for Deep Learning-Based Wind Speed Prediction." Computers 14, no. 4 (2025): 123-43. https://doi.org/10.3390/computers14040123.
"Library of Congress." Library of Congress, accessed 10 October 2024, https://www.loc.gov/.
Lin, Tsung-Yi, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. "Focal Loss for Dense Object Detection." arXiv (Cornell University) (2017). https://doi.org/10.48550/arXiv.1708.02002.
Luo, Dan, Lieve Doucé, and Karin Nys. "Multisensory Museum Experience: An Integrative View and Future Research Directions." Museum Management and Curatorship (2024): 1-28. https://doi.org/10.1080/09647775.2024.2357071.
Luo, Vinson, Michael Straka, and Lucy Li. "Historical and Modern Image-to-Image Translation with Generative Adversarial Networks." (2017): 1-8.
Maiwald, Ferdinand. "Generation of a Benchmark Dataset Using Historical Photographs for Automated Evaluation of Different Feature Matching Methods." The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences 42, no. 2 (2019): 87-94. https://doi.org/10.5194/isprs-archives-XLII-2-W13-87-2019.
Malcolm-Davies, Jane. "Structuring Reconstructions: Recognising the Advantages of Interdisciplinary Data in Methodical Research." Heritage Science 11, no. 1 (2023): 1-16. https://doi.org/10.1186/s40494-023-00982-9.
"Ai ‘Opt-Outs’: Should Cultural Heritage Institutions (Dis)Allow the Mining of Cultural Heritage Data?" Europeana Pro, 2024, accessed 4 April 2025, https://pro.europeana.eu/post/ai-opt-outs-should-cultural-heritage-inst….
Monna, Fabrice, Tanguy Rolland, Anthony Denaire, Nicolas Navarro, Ludovic Granjon, Rémi Barbé, and Carmela Chateau-Smith. "Deep Learning to Detect Built Cultural Heritage from Satellite Imagery. - Spatial Distribution and Size of Vernacular Houses in Sumba, Indonesia." Journal of Cultural Heritage 52 (2021): 171-83. https://doi.org/10.1016/j.culher.2021.10.004.
Mumuni, Alhassan, and Fuseini Mumuni. "Data Augmentation: A Comprehensive Survey of Modern Approaches." Array 16 (2022). https://doi.org/10.1016/j.array.2022.100258.
Müske, Johannes. "Constructing Sonic Heritage: The Accumulation of Knowledge in the Context of Sound Archives." The Journal of Ethnology and Folkloristics 4, no. 1 (2011): 37-47.
"What Is Yolo? The Ultimate Guide [2025]." Roboflow, 2025, accessed 2 April 2025, https://blog.roboflow.com/guide-to-yolo-models/#yolov12.
Padilla, Rafael, Sergio L. Netto, and Eduardo A. B. da Silva. "A Survey on Performance Metrics for Object-Detection Algorithms." In Proceedings of the 2020 International Conference on Systems, Signals and Image Processing (Iwssip), edited by Anselmo C. Paiva, Aura Conci, Geraldo Braz Jr., João Dallyson S Almeida and Leandro A. F. Fernandes, 237-42: Institute of Computing at Fluminense Federal University (IC-UFF), 2020.
Pai, Suraj, Ibrahim Hadzic, Chinmay Rao, Ivan Zhovannik, Andre Dekker, Alberto Traverso, Stylianos Asteriadis, and Enrique Hortal. "Frequency-Domain-Based Structure Losses for Cyclegan-Based Cone-Beam Computed Tomography Translation." Sensors 23, no. 3 (2023). https://doi.org/10.3390/s23031089.
Pardoen, Mylène. "Projet Bretez: Une Pincée De Son Dans L’histoire.". Digital Studies/Le champ numérique 9, no. 1 (2019): 1-17. https://doi.org/10.16995/dscn.350.
"Restoring the Historical Sound of Paris." Hypotheses, 2016, accessed 10 April 2025, https://sms.hypotheses.org/8560.
Park, Jaihyun, David K. Han, and Hanseok Ko. "Adaptive Weighted Multi-Discriminator Cyclegan for Underwater Image Enhancement." Journal of Marine Science and Engineering 7, no. 7 (2019). https://doi.org/10.3390/jmse7070200.
108
Patra, Aditya, and Rae Crandall. "Machine Learning for Visually Impaired: Benchmarking Object Detection Models." Journal of Student Research 13, no. 2 (2024): 1-14. https://doi.org/10.47611/jsrhs.v13i2.6630.
Pietroni, Eva. "Mapping the Soundscape in Communicative Forms for Cultural Heritage: Between Realism and Symbolism." Heritage 4, no. 4 (2021): 4495-523. https://doi.org/10.3390/heritage4040248.
Poudel, Anil. "Face Recognition on Historical Photographs." Uppsala University, 2021.
"Pseudo Labeling: Leveraging the Power of Self-Supervision in Machine Learning." Medium, Updated 1 February 2024, 2024, accessed 27 December 2024, https://medium.com/@data-overload/pseudo-labeling-leveraging-the-power-….
Redmon, Joseph, Santosh Divvala, Ross Girshick, and Ali Farhadi. "You Only Look Once: Unified, Real-Time Object Detection." arXiv (Cornell University) (2015). https://doi.org/10.48550/arXiv.1506.02640.
"Regulation (Eu) 2024/1689 of the European Parliament and of the Council of 13 June 2024 Laying Down Harmonised Rules on Artificial Intelligence and Amending Regulations (Ec) No 300/2008, (Eu) No 167/2013, (Eu) No 168/2013, (Eu) 2018/858, (Eu) 2018/1139 and (Eu) 2019/2144 and Directives 2014/90/Eu, (Eu) 2016/797 and (Eu) 2020/1828 (Artificial Intelligence Act)." edited by European Union and European Council, 2024. http://data.europa.eu/eli/reg/2024/1689/oj.
Ren, Shaoqing, Kaiming He, Ross Girshick, and Jian Sun. "Faster R-Cnn: Towards Real-Time Object Detection with Region Proposal Networks." arXiv (Cornell University) (2015). https://doi.org/10.48550/arXiv.1506.01497.
"Retinanet_Resnet50_Fpn_V2." PyTorch, accessed 10 January 2025, https://pytorch.org/vision/0.20/models/generated/torchvision.models.det….
"Roboflow." Roboflow, accessed 14 November 2024, https://roboflow.com/.
"Train-Yolov8-Object-Detection-on-Custom-Dataset.Ipynb." Google Colab, accessed 24 January 2025, https://colab.research.google.com/github/roboflow-ai/notebooks/blob/mai….
Sapkota, Ranjan, and Manoj Karkee. "Generative Ai in Agriculture: Creating Image Datasets Using Dall.E's Advanced Large Language Model Capabilities." arXiv (Cornell University) (2023). https://doi.org/10.48550/arXiv.2307.08789.
Schafer, R. Murray. "Open Ears." Soundscape: The Journal of Acoustic Ecology 4, no. 2 (2003): 14-18.
———. The Soundscape: Our Sonic Environment and the Tuning of the World. New York: Alfred Knopf, Inc., 1977.
Sender Contell, Marina, Ana Planells, Ricardo Perelló Roso, Jaume Segura Garcia, and Alicia Giménez. "Virtual Acoustic Reconstruction of a Lost Church: Application to an Order of Saint Jerome Monastery in Alzira, Spain." Journal of Building Performance Simulation 11, no. 3 (2018): 369-90. https://doi.org/10.1080/19401493.2017.1340975.
Shin, Jeongmin, and Hyeryung Jang. "Data Augmentation Techniques Using Text-to-Image Diffusion Models for Enhanced Data Diversity." 2024 15th International Conference on Information and Communication Technology Convergence (ICTC), Jeju Island, Republic of Korea, IEEE, 2024.
Shorten, Connor, and Taghi M Khoshgoftaar. "A Survey on Image Data Augmentation for Deep Learning." Journal of Big Data 6, no. 1 (2019). https://doi.org/10.1186/s40537-019-0197-0.
"Smell Heritage – Sensory Mining." accessed 30 March 2025, https://odeuropa.eu/.
Sohan, Mupparaju, Thotakura SaiRam, and Ch. Venkata RamiReddy. "A Review on Yolov8 and Its Advancements." In Data Intelligence and Cognitive Informatics. Proceedings of Icdici 2023., edited by I. Jeena Jacob, Selwyn Piramuthu and Przemyslaw Falkowski-Gilski. Algorithms for Intelligent Systems, 529-45. Singapore: Springer Singapore, 2024.
"Sonic Heritage - Exploring the Sounds of the World's Most Famous Sights." Cities and Memory, Oxford, accessed 31 March 2025, https://citiesandmemory.com/heritage/.
"The Soundscape of Istanbul Collection - About Collection." Koç University, accessed 31 March 2025, https://librarydigitalcollections.ku.edu.tr/en/collection/soundscape-of….
109
Terven, Juan, Diana-Margarita Córdova-Esparza, and Julio-Alejandro Romero-González. "A Comprehensive Review of Yolo Architectures in Computer Vision: From Yolov1 to Yolov8 and Yolo-Nas." Machine Learning and Knowledge Extraction 5, no. 4 (2023): 1680-716. https://doi.org/10.3390/make5040083.
Tong, Zhaoxiang. "Exploring the Impact of Hyperparameters on the Generation Quality of Cyclegan." Transactions on Computer Science and Intelligent Systems Research 5 (2024): 265-71. https://doi.org/10.62051/01m93a63.
"Transfer Learning with Frozen Layers in Yolov5." Ultralytics, accessed 10 April 2025, https://docs.ultralytics.com/yolov5/tutorials/transfer_learning_with_fr….
Ulmer, Maximilian, Leonard Klüpfel, Maximilian Durner, and Rudolph Triebel. "How Important Are Data Augmentations to Close the Domain Gap for Object Detection in Orbit?". arXiv (Cornell University) (2024). https://doi.org/10.48550/arXiv.2410.15766.
"Ultralytics Yolo Hyperparameter Tuning Guide." Ultralytics, accessed 24 April 2025, https://docs.ultralytics.com/guides/hyperparameter-tuning/.
UNESCO. "39 C/49 the Importance of Sound in Today's World: Promoting Best Practices." General Conference, 39th session, Paris 2017.
———. Basic Texts of the 2003 Convention for the Safeguarding of the Intangible Cultural Heritage. 2022 ed. Paris: France, 2022.
"Unsplash." Unsplash, accessed 10 October 2024, https://unsplash.com/.
"All You Need to Know About Explainable Ai (Xai)." Ultralytics, 2024, accessed 17 April 2025, https://www.ultralytics.com/blog/all-you-need-to-know-about-explainable….
Wei, Jian, Qinzhao Wang, and Zixu Zhao. "Yolo-G: Improved Yolo for Cross-Domain Object Detection." PLoS ONE 18, no. 9 (2023). https://doi.org/10.1371/journal.pone.0291241.
Wevers, Melvin. "Scene Detection in De Boer Historical Photo Collection." In Proceedings of the 13th International Conference on Agents and Artificial Intelligence, edited by Ana Paula Rocha, Luc Steels and Jaap Van den Herik, 301-610. Cham: Springer, 2021.
"What Is Cities and Memory?" Cities and Memory, Oxford, accessed 31 March 2025, https://citiesandmemory.com/what-is-cities-and-memory-about/.
Yelmi, Pinar. "Protecting Contemporary Cultural Soundscapes as Intangible Cultural Heritage: Sounds of Istanbul." International Journal of Heritage Studies 22, no. 4 (2016): 302-3011. https://doi.org/10.1080/13527258.2016.1138237.
Yelmi, Pιnar, Hüseyin Kuşcu, and Asιm Evren Yantaç. "Towards a Sustainable Crowdsourced Sound Heritage Archive by Public Participation: The Soundsslike Project." 9th Nordic Conference on Human-Computer Interaction, Gothenburg, Sweden, Association for Computing Machinery, New York, 2016.
Yin, Yuwei, Jean Kaddour, Xiang Zhang, Yixin Nie, Zhenguang Liu, Lingpeng Kong, and Qi Liu. "Ttida: Controllable Generative Data Augmentation Via Text-to-Text and Text-to-Image Models." arXiv (Cornell University) (2023). https://doi.org/10.48550/arXiv.2304.08821.
"Pytorch-Cyclegan-and-Pix2pix." GitHub, Updated 22 March 2024, accessed 4 January 2025, https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix/tree/master.
Zhu, Jun-Yan, Taesung Park, Phillip Isola, and Alexei A. Efros. "Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks." arXiv (Cornell University) (2017). https://doi.org/10.48550/arxiv.1703.10593.
Zinnen, Mathias, Prathmesh Madhu, Ronak Kosti, Peter Bellz, Andreas Maier, and Vincent Christlein. "Odor: The Icpr2022 Odeuropa Challenge on Olfactory Object Recognition." 26th International Conference on Pattern Recognition (ICPR), Montreal, QC, Canada, 2022.