GEO-Bench-2 is a large-scale, capability-aware benchmark for evaluating Geospatial Foundation Models through fine-tuning-based evaluation across diverse sensing modalities, temporal contexts, and downstream applications. It emphasizes open licensing, reproducibility, and capability-specific evaluation, enabling the community to measure progress in perception, reasoning, and generalization within the geospatial domain.
For zero-shot evaluation of Geospatial Vision-Language Models, please refer to our complementary benchmark GEO-Bench-VLM.