Cover Story: Improving OpenStreetMap missing building detection using few-shot transfer learning in sub-Saharan Africa

Cover image of Transactions in GIS

Volume 26, Issue 8 of Transactions in GIS, published in December 2022, has a cover story and it’s us- well, our article anyway. Besides uniting our love of math, mapping, and Mozambique, we’re hoping to use our moment of fame to explain our proposed solution to the issue of missing mapping data in the Global South. Read on for our inspiration, our method, and even an autograph (if you ask nicely).

Filling in the Blanks

What do you do when you have the tools, the motivation, and the need, but not the data?

In an innovative paper first published in Transactions in GIS in May 2022, a collaborative team of current and former researchers from HeiGIT and GIScience including Hao LiBenjamin HerfortSven LautenbachJiaoyan Chen, and Alexander Zipf apply a novel few-shot transfer learning method (FSTL) to the problem of improving humanitarian mapping workflows in the Global South.

Despite the recent much-needed leaps forward in data availability in the developing world from humanitarian mapping campaigns, many large rural areas remain completely unmapped, hindering disaster routing and other disaster preparation and response efforts. The criticality of such routing, preparation, and response efforts can be seen from our work with last year’s floods in Pakistan, the global coverage of our Open Healthcare Access map, and our publication assessing the damage done by Cyclone Idai.

Our team’s bid to tackle the speed and efficiency concerns for these workflows is proposing the FSTL method to increase the accuracy of OSM missing building detection. Even with only one training shot, the approach already demonstrated considerable success.

Figure 3 from paper: Maps of study areas: (1) the Tanzania training area for the base model (well-mapped in OSM); and (2)-(3) the target areas in Cameroon and Mozambique (completely missing in OSM).
Figure 3 from paper: Maps of study areas: (1) the Tanzania training area for the base model (well-mapped in OSM); and (2)-(3) the target areas in Cameroon and Mozambique (completely missing in OSM).

The Nitty-Gritty

The team trained two popular object detection models (Faster R-CNN and SSD) in a training area in Tanzania and transferred the model to target areas in Cameroon and Mozambique. The FSTL method significantly improved the base model performance even with only one training shot.

They also successfully produced a grid-based OSM missing building map (DeepVGI) of 10 m spatial resolution with over 96% overall accuracy (ACC) and 0.85 Matthews correlation coefficient (MCC) in both Cameroon and Mozambique. Such maps show great potential to assess and estimate the overall completeness of OSM buildings to support humanitarian mapping activities, especially in places where other (e.g., buildings, roads) datasets are not available.

Figure 4 from paper: Instance-based building detection results in Cameroon. SSD: (a) the base model; (c) the FSTL model with one shot; (e) the FSTL model with 50 shots. Faster R-CNN: (b) the base model; (d) the FSTL model with one shot; and (f) the FSTL model with 50 shots. TP refers to correctly detected building instances; FP refers to building instances detected by ML models which did not exist in the reference layer; FN denotes those building instances mapped in the reference layer but not captured by our ML models
Figure 4 from paper: Instance-based building detection results in Cameroon. SSD: (a) the base model; (c) the FSTL model with one shot; (e) the FSTL model with 50 shots. Faster R-CNN: (b) the base model; (d) the FSTL model with one shot; and (f) the FSTL model with 50 shots. TP refers to correctly detected building instances; FP refers to building instances detected by ML models which did not exist in the reference layer; FN denotes those building instances mapped in the reference layer but not captured by our ML models.

 

OSM missing building maps (DeepVGI) produced by the proposed FSTL method showed superior mapping accuracies in both target areas. The proposed FSTL method required only limited training shots (e.g., only one shot) to achieve a fast and remarkable accuracy improvement. The proposed FSTL method of reusing a building detection model trained on remote areas provides a promising solution to existing challenges of humanitarian mapping.

FIGURE 5 from paper: Instance-based building detection results in Mozambique. SSD: (a) the base model; (c) the FSTL model with one shot; (e) the FSTL model with 50 shots. Faster R-CNN: (b) the base model; (d) the FSTL model with one shot; (f) the FSTL model with 50 shots; and (g) the Google Open Building layer. TP refers to correctly detected building instances; FP refers to building instances detected by ML models which did not exist in the reference layer; FN denotes those building instances mapped in the reference layer but not captured by our ML models
Figure 5 from paper: Instance-based building detection results in Mozambique. SSD: (a) the base model; (c) the FSTL model with one shot; (e) the FSTL model with 50 shots. Faster R-CNN: (b) the base model; (d) the FSTL model with one shot; (f) the FSTL model with 50 shots; and (g) the Google Open Building layer. TP refers to correctly detected building instances; FP refers to building instances detected by ML models which did not exist in the reference layer; FN denotes those building instances mapped in the reference layer but not captured by our ML models.

Current Limitations

The team identified three limitations for this case study.

  1. The current work focused on areas where OSM buildings were completely missing and future research is needed to validate the proposed method in areas where buildings are partly mapped in OSM. In those partly mapped area, one could directly consider existing OSM buildings as the FSTL training shots and seek to assess the relative completeness of OSM using a similar DeepVGI layer.

  2. As the proposed method was implemented and validated only in sub-Saharan African, a global overview of such a method applied in different geographically remote rural areas (e.g., urban contexts, climate, vegetation, settlement types) is still missing.

  3. Both the current Faster R-CNN and the SSD model only detect bounding boxes of buildings but not their footprints. Further research is encouraged to extend the FSTL method toward the automatic extraction of building footprints in vector formats, for example, using the Mask R-CNN architecture (He et al., 2017) and with post-processing similar as (Sirko et al., 2021).

 

Future Research Avenues

To address the aforementioned limitations, future works are encouraged to adopt or extend the FSTL method proposed in this article and further investigate several research directions.

  1. Besides assessing the relative completeness, the availability of ML-generated layers (e.g., DeepVGI) opens new opportunities of revisiting classic OSM extrinsic quality assessment methods (Hagenauer & Helbich, 2012; Hecht et al., 2013), which commonly rely on external reference datasets.

  2. Thanks to the contribution of volunteers, the large amount of existing finished humanitarian mapping projects worldwide, for instance from MapSwipe App and HOT Tasking Manager (Herfort et al., 2021), provides a great overview of vulnerable rural areas with a pressing need of mapping in OSM. One can think about scaling up the FSTL method to these remote areas and further investigating the impact of different types (or levels) of urban contexts on the model performance.

  3. By thinking about how ML methods can best benefit OSM, especially in rural areas, recent discussion in Mooney & Galvan (2021) and Vargas-Muñoz et al. (2020) shared important insights from either remote sensing or OSM community perspectives. As an intuitive application, one can apply the FSTL method to estimate the overall time and effort required to map all OSM missing buildings, then take these factors into consideration when organizing humanitarian mapping campaigns and volunteer contributions.

  4. The previous work in Herfort et al. (2019) reported an inspiring finding that advanced ML algorithms can potentially improve the accuracy and accelerate the speed of current humanitarian mapping method using the MapSwipe App. Therefore, another potential application of the FSTL method could be assisting human mappers in rural areas by giving them mapping hints based on machine-generated results, such as an early attempt with the RapiD tool from Facebook (RapiD, 2019).

In this context, the team aimed to emphasize the necessity of machine-assisted mapping with the lessons learned in this article. Our future work will seek to develop intelligent OSM mapping workflows by keeping humans and machines in a loop.

In A Sentence

In conclusion, the lessons learned in this article sheds light on facilitating and accelerating current humanitarian mapping workflow by efficiently reusing existing ML models via few-shot learning and accurately detecting OSM missing buildings in rural sub-Saharan Africa areas.

Figure 7: Classification maps of OSM missing building detection in Mozambique: (a) The HRSL layer; (b) the GOB layer; (c) the Faster R-CNN base model; and (d) the Faster R-CNN FSTL model with fifty shots, so called the DeepVGI layer
Figure 7 from paper: Classification maps of OSM missing building detection in Mozambique: (a) The HRSL layer; (b) the GOB layer; (c) the Faster R-CNN base model; and (d) the Faster R-CNN FSTL model with fifty shots, so called the DeepVGI layer.

 

For full paper and all references:

 

Li, H., Herfort, B., Lautenbach, S., Chen, J. & Zipf, A. (2022). Improving OpenStreetMap missing building detection using few-shot transfer learning in sub-Saharan Africa. Transactions in GIS, 00, 1– 22. https://doi.org/10.1111/tgis.12941

Previous related work:

General Overview:

  • Yan, Y., C. Feng, W. Huang, H. Fan, Y. Wang & A. Zipf (2020): Volunteered geographic information research in the first decade: a narrative review of selected journal articles in GIScience. International Journal of Geographical Information Science, DOI: 10.1080/13658816.2020.1730848

  • Degrossi L.C., J. Porto de Albuquerque, R. dos Santos Rocha, A. Zipf (2018): A taxonomy of quality assessment methods for volunteered and crowdsourced geographic information. Transactions in GIS (TGIS). Wiley. DOI:10.1111/tgis.12329.

  • Jokar Arsanjani, J., Zipf, A., Mooney, P., Helbich, M. (Eds.)(2015): OpenStreetMap in GIScience: Experiences, Research, and Applications. Series: Lecture Notes in Geoinformation and Cartography. 2015, VII, 373 p. Sringer Science. Heidelberg, Berlin. ISBN 978-3-319-14279-1

Comments are closed.