Abstract
Traditional orangutan distribution and density monitoring requires costly line transect methods on the ground to detect their nests. Recently researchers have started to use unoccupied aerial vehicles, hereafter referred to as drones, to collect such data faster. However, manually inspecting the images acquired by the drone is time-consuming and hence costly. This study explored a deep learning method for the automated detection of orangutan nests in drone-captured aerial images, which can significantly improve the efficiency of orangutan monitoring efforts. The YOLO v10 model was trained using 868 images containing 1568 annotated orangutan nests collected from sites in Sabah, Malaysia, and Sumatra, Indonesia. Images were captured using multirotor and fixed-wing drones at varying altitudes. The model was trained using a transfer learning approach and achieved a mean Average Precision (mAP) of 0.831. The model was subsequently tested on two independent data sets with results showing a precision of 0.98 and recall of 0.88 for a multirotor drone and precision of 0.98 and a recall of 0.71 for a fixed-wing drone which has the benefit of being able to have longer duration flights. The high precision values indicate the model's accuracy in identifying true nest locations, while the recall values demonstrate its ability to detect a significant portion of the nests present in the images. The study highlights how using drones for data collection can reduce survey times compared to ground surveys, and the automation of nest detection further enhances the efficiency of drone surveys. However, the model's recall, especially for fixed-wing drone data, could be improved to ensure accurate population trend analyses. Further research should focus on expanding training data sets and refining models to account for different camera systems and environmental conditions.