ABSTRACT
Passive acoustic monitoring (PAM) through acoustic recorder units (ARUs) shows promise in detecting early landscape changes linked to functional and structural patterns, including species richness, acoustic diversity, community interactions, and human-induced threats. However, current approaches primarily rely on supervised methods, which require prior knowledge of collected datasets. This reliance poses challenges due to the large volumes of ARU data. In this work, we propose a non-supervised framework using autoencoders to extract soundscape features. We applied this framework to a dataset from Colombian landscapes captured by 31 audiomoth recorders. Our method generates clusters based on autoencoder features and represents cluster information with prototype spectrograms using centroid features and the decoder part of the neural network. Our analysis provides valuable insights into the distribution and temporal patterns of various sound compositions within the study area. By utilizing autoencoders, we identify significant soundscape patterns characterized by recurring and intense sound types across multiple frequency ranges. This comprehensive understanding of the study area's soundscape allows us to pinpoint crucial sound sources and gain deeper insights into its acoustic environment. Our results encourage further exploration of unsupervised algorithms in soundscape analysis as a promising alternative path for understanding and monitoring environmental changes.
ABSTRACT
Soundscape ecology is a promising area that studies landscape patterns based on their acoustic composition. It focuses on the distribution of biotic and abiotic sounds at different frequencies of the landscape acoustic attribute and the relationship of said sounds with ecosystem health metrics and indicators (e.g., species richness, acoustic biodiversity, vectors of structural change, gradients of vegetation cover, landscape connectivity, and temporal and spatial characteristics). To conduct such studies, researchers analyze recordings from Acoustic Recording Units (ARUs). The increasing use of ARUs and their capacity to record hours of audio for months at a time have created a need for automatic processing methods to reduce time consumption, correlate variables implicit in the recordings, extract features, and characterize sound patterns related to landscape attributes. Consequently, traditional machine learning methods have been commonly used to process data on different characteristics of soundscapes, mainly the presence-absence of species. In addition, it has been employed for call segmentation, species identification, and sound source clustering. However, some authors highlight the importance of the new approaches that use unsupervised deep learning methods to improve the results and diversify the assessed attributes. In this paper, we present a systematic review of machine learning methods used in the field of ecoacoustics for data processing. It includes recent trends, such as semi-supervised and unsupervised deep learning methods. Moreover, it maintains the format found in the reviewed papers. First, we describe the ARUs employed in the papers analyzed, their configuration, and the study sites where the datasets were collected. Then, we provide an ecological justification that relates acoustic monitoring to landscape features. Subsequently, we explain the machine learning methods followed to assess various landscape attributes. The results show a trend towards label-free methods that can process the large volumes of data gathered in recent years. Finally, we discuss the need to adopt methods with a machine learning approach in other biological dimensions of landscapes.
ABSTRACT
Anthropogenic noise is increasingly disturbing natural soundscapes and affecting the physiology, behavior, and fitness of wildlife. However, our knowledge about the impact of anthropogenic noise on wild primates is scant. Here, we assess the effects of anthropogenic noise on the behavior of male mantled howler monkeys (Alouatta palliata). Specifically, we describe the types, rates, and sound pressure level (SPL) of anthropogenic noise that occurs in areas inhabited by mantled howler monkeys and determine if the behavioral responses of males to anthropogenic noise are influenced by noise attributes. For 1 year (1753 h), we characterized anthropogenic noise in the Los Tuxtlas Biosphere Reserve (Veracruz, Mexico) and studied the behavior of males belonging to five groups. Anthropogenic noise was common, diverse, and varied among areas in terms of rate, type, and SPL. Males did not display behavioral responses toward most (60%) anthropogenic noises, but were more likely to respond to certain noise types (e.g., aerial traffic) and toward noise with high SPL. Group identity influenced the likelihood of displaying behavioral responses to noise. The most common behavioral responses were vocalizations and vigilance. Males vocalized in response to noise with high SPL, although this relationship depended on group identity. The effect of the number of noises on vocalizations also varied among groups. Males were more likely to display vigilance toward high SPL and infrequent noise, but, again, these relationships varied among groups. In sum, anthropogenic noise is pervasive in areas inhabited by mantled howler monkeys and influences male behavior. Experience and frequency of exposure may modulate the behavioral responses of male mantled howler monkeys to noise and explain the group differences.