Spatial Clustering for Species Distribution Modeling
Published:
Citizen science biodiversity data can span large extents of space and time, but usually lack the structure required for building species distribution models which account for imperfect detection, i.e., occupancy models. Existing approaches of introducing this structure either throw away too much data due to strict definitions of sites and/or do not account for similarity in environmental feature space, leading to weaker downstream occupancy models. Spatial clustering algortihms from machine learning literature offer lucrative advantages over these existing approaches.