6 Economic and social research questions using spatial data

In their critical appraisal of spatial econometrics, Gibbons and Overman (2012) state that:

In many (micro) economic fields—particularly development, education, environment, labor, health, and public finance—empirical work is increasingly concerned with questions about causality … If we increase an individual’s years of education, what happens to their wages? If we decrease class sizes, what happens to student grades? These questions are fundamentally of the type ‘if we change x, what do we expect to happen to y.’ Just as with economics more generally, such questions are fundamental to our understanding of spatial economics (Gibbons and Overman 2012, 172–73).

In the years preceding their description of spatial econometrics risking becoming mostly pointless, practicioners had been realising that interpretation of model output was not as simple as had been assumed. In aspatial models, there is no spatial spillover to consider, and in time-series models, only past observations can influence the present. However, in spatial autoregressive models using for example the average of neighbours’ values of the dependent variable as an explanatory variable, the coefficient of the spatially lagged dependent variable interacts with the coefficients of the independent variables (Kelejian, Tavlas, and Hondroyiannis 2006; LeSage and Fischer 2008; Ward and Gleditsch 2008; LeSage and Pace 2009). The correct interpretation of the estimated coefficients of models using spatial data is, naturally, central to exploring “‘if we change x, what do we expect to happen to y.’” See also Corrado and Fingleton (2012) for further discussion of the importance of thinking of spatial econometric models as tools for studying causality as understood at that time.

A further, associated, bundle of strands of development in spatial econometrics following 2010 also involved causality. Delgado and Florax (2015) draw on Rubin (1974); Rubin (1978) to highlight the risk posed to the stable unit treatment value assumption (SUTVA) by unmodelled spatial dependency in the data. This key assumption for causal alaysis “implies that potential outcomes for person i are unrelated to the treatment status of other individuals” (Angrist, Imbens, and Rubin 1996). The assumption and the risk of its violation when spatial data is modelled aspatially is also discussed at that time by Koschinsky (2013) and Baylis and Ham (2015).

Work by Delgado and Florax (2015) has been followed up by Bardaka, Delgado, and Florax (2018) and Bardaka, Delgado, and Florax (2019), showing practically how a difference-in-difference (DID) econometric design measuring the impact of change on a chosen dependent variable. Dubé et al. (2014) approach spatial difference-in-difference in a similar way, followed up in Dubé et al. (2017) and Dubé, AbdelHalim, and Devaux (2021). Bardaka, Delgado, and Florax (2018) express the two approaches:

Spatial DID models have been proposed by Dubé et al. (2014) and Delgado and Florax (2015); Delgado and Florax (2015) focus on violations of SUTVA in the case of spillover eﬀects local to the treatment whereas Dubé et al. (2014) focus on global eﬀects (Bardaka, Delgado, and Florax 2018, 17).

Dubé et al. (2014) and work derived from this, such as Sunak and Madlener (2016) and Diao, Leonard, and Sing (2017), has been referred to directly and indirectly by an increasing number of applied studies, including Fang (2021), Jia, Shao, and Yang (2021), Qiu and Tong (2021), Liu et al. (2022), Chen et al. (2023), D. Gao and Wang (2023), Pan et al. (2023), Yu and Jin (2023) and Zeng, Blanco-González-Tejero, and Sendra (2023). Some refer to both spatial DID origins: Chagas, Azzoni, and Almeida (2016) and Qiu and Tong (2021), while some derive from Delgado and Florax (2015) alone: Han et al. (2018) and Kosfeld et al. (2021). It is clear that the demand for assessments of the consequences of for example infrastructure investments on house prices or environmental measures is enormous, so it is likely that such studies will continue to proliferate. Both Dubé et al. (2014) and Delgado and Florax (2015) presuppose that applied researchers using their approaches are adequately trained in spatial econometrics, so that these researchers are more than familiar with the spatial extensions to aspatial econometric techniques. These spatial extensions are the core of our book.

Before continuing to present the structure of this book, it is also sensible to cover three “breaking” topics related to causality in a spatial context. The first of the “breaking” topics extends regression discontinuity designs, from discussion in Gibbons, Machin, and Silva (2013) through Calonico, Cattaneo, and Titiunik (2014) and Keele and Titiunik (2015) to Butts (2023a) and Butts (2023b), with Cattaneo and Titiunik (2022) as an up-to-date general review. While neither Dubé et al. (2014) nor Delgado and Florax (2015) appear to be directly backed by software, software packages for R, Python and Stata for work by Sebastian Calonico, Matias D. Cattaneo and Rocío Titiunik is published; ongoing work by Kyle Butts is also available.

The second two “breaking” topics relate directly to Olsson (1970), in which he addressed the question of the extent to which prediction and explanation could be seen as symmetric. Summarising his findings, Olsson asserted that:

… an adequate explanation may lead to a successful prediction, but … successful prediction is not the same as successful explanation. (Olsson 1970, 230)

This relates directly to: “‘if we change x, what do we expect to happen to y’”, which is more about explanation (and hence causality) than prediction. Many contemporary quantitative methods utilise predictive success to attempt to improve model fit, and having achieved predictive success try to re-construct the meaning of the output model in order to work back to explanation.

Secondly, three important surveys of causality in spatial data analysis have appeared recently: Kolak and Anselin (2020), B. Gao et al. (2022), and Akbari, Winter, and Tomko (2023). All of these take up spatial challenges to the stable unit treatment value assumption, Kolak and Anselin (2020) with an example of the impact of changes in minimum legal drinking age laws on mortality for US states. B. Gao et al. (2022) point to the rapid extension of spatial statistics to other knowledge domains including bioinformatics, in which causal inference is clearly important. The main use cases considered by Akbari, Winter, and Tomko (2023) are in spatial cognition, including wayfinding processes and navigation systems, because these are so much broader in impact than program evaluations. Because Akbari, Winter, and Tomko (2023) is a literature review, it does not propose methods, but describes those available. They also comment that of the minority of articles included in the review that reported what software was used, the most commonly used software was R. Only 12 percent of the articles cited code needed to reproduce their results (see also Wolf 2023). They comment:

This low rate of accessibility to code is a big challenge that not only limits reproducibility of the reviewed papers, but also affects the portability and translation of approaches to other case studies in spatial causal inference. … In sum, in most of the reviewed research, there are no clear procedures related to reproducibility and validation. We can trust more the results of papers with straightforward approaches with a sufficient level of details. (Akbari, Winter, and Tomko 2023, 79)

The final “breaking” topic is the influence of spatial autocorrelation on machine learning, statistical learning, and convolutional neural networks. Kattenborn et al. (2022) study the impact of spatial autocorrelation on the training of convolutional neural networks for data acquired by drones, and find:

Our results suggest that violating spatial independence between training and test data can severely inflate model apparent performance (up to almost 30%) and, hence, lead to an overly optimistic evaluation of the generalization of such models. (Kattenborn et al. 2022, 7)

This observation, that the violation of the assumption of spatial independence between training, validation and test data sets prejudices outcomes, has been recognised in much of spatial data science for years, at least from Brenning (2012). There is now an extensive literature both on the split between training and test data sets, and on the use of fitted models for prediction to areas that were un- or under-represented in the data used to fit the model (Meyer et al. 2018, 2019; Valavi et al. 2019; Schratz et al. 2019; Meyer and Pebesma 2021, 2022; Mila et al. 2022; Linnenbrink et al. 2023). These articles are accompanied by software permitting the reproduction of their findings and the application of suggested adaptations, replacing random permutations in machine learning model fitting and tuning by spatially-aware procedures.

Kopczewska (2022) summarises the current research position with regard to spatial data use in machine learning in this way:

It is clear from many studies that unaddressed spatial autocorrelation generates problems, such as overoptimistic fit of models, omitted information and/or biased (suboptimal) prediction. Thus, an up-to-date toolbox dealing with spatial autocorrelation should be used in all ML models in order to ensure methodological appropriateness. (Kopczewska 2022, 732)

She does, however, provide encouraging examples of the spatially-informed use of machine learning methods, and a concise overview of concepts and a listing of relevant R packages in two appendices (Kopczewska 2022, 735–49).

Wagner and Zeileis (2019) use model-based recursive partitioning handling the spatial dependencies by including the spatially lagged dependent variable in a spatial econometric model to study heterogeneous growth. Vidoli, Pignataro, and Benedetti (2022) approach heterogeneity through spatial regimes, as do Piras and Sarrias (2023); all three articles are backed by software.

Nikparvar and Thill (2021) give a broad review of machine learning methods and applications to spatial data, including attention to spatial autocorrelation, spatial scale and spatial heterogeneity. Credit (2022) considers the intersection of random forest models - a machine learning method aggregating the outout of many decision trees - and spatial econometrics models. In addition to the inclusion of the spatially lagged dependent variable (Wagner and Zeileis 2019), spatially lagged independent variables were considered. Yoshida, Murakami, and Seya (2022) compare a selection of spatially-aware machine learning approaches and a nearest-neighbour Gaussian process model for predicting apartment rents, following some of the suggestions made by Credit (2022); references to software are provided.

Consideration of machine learning and the application of deep learning/neural networks overlaps in a fair number of cases, raising similar concerns about how the probable lack of independence between proximate observations in space will be handled. Ahmed et al. (2021) stress the need for interpretability and explainability in deep learning/neural networks (also known as artificial intelligence) as well as in machine learning, and explore model agnostic greedy explanations of model predictions; references to software are provided.

Zhang et al. (2022) and Deng, He, and Liu (2023) also focus on interpretability of machine learning methods for predicting crime risk; Deng, He, and Liu (2023) provide their complete data set. Zhu et al. (2023b; minor correction Zhu et al. 2023a) propose spatial regression graph convolutional neural networks for modelling and predicting multivariate spatial data, also considering what is known as feature selection (or engineering), which is related to interpretability; references to software are provided.

Li et al. (2023), like Credit (2022), introduce the spatially lagged dependent variable (whether a manhole in an urban drainage system overflows or not) into a deep neural network model. Xiao, Song, and Wang (2023) and Wang and Song (2023) in two articles with overlapping authorships look more closely at integrating adaptations of classical spatial econometrics models - spatial autoregressive models - into deep neural network models, both making nonparametric additions to the classical models.

The literatures covering the interpretation of spatial econometric models, causality when the data used are spatial and so challenge standard econometric assumptions, and training/test set data splits affecting machine learning and artificial intelligence applications, are all burgeoning. It might be thought that the superficial mentioning of these questions in this section should direct us to focus attention in this book on emerging research opportunities. However, our reading of contributions to the current literature, taken with readings of the many other articles published since 2020 - Dubé et al. (2014) has been cited hundreds of times, mostly since 2020 - suggests that many authors would benefit substantially from a clearer grasp of the background to spatial econometric models, and many of their internal characteristcs. Hence, based in part on this motivation, we will now move to present the structure of this book.

Ahmed, Zia U., Kang Sun, Michael Shelly, and Lina Mu. 2021. “Explainable Artificial Intelligence (XAI) for Exploring Spatial Variability of Lung and Bronchus Cancer (LBC) Mortality Rates in the Contiguous USA.” Scientific Reports 11: 24090. https://doi.org/10.1038/s41598-021-03198-8.

Akbari, Kamal, Stephan Winter, and Martin Tomko. 2023. “Spatial Causality: A Systematic Review on Spatial Causal Inference.” Geographical Analysis 55 (1): 56–89. https://doi.org/10.1111/gean.12312.

Angrist, Joshua D., Guido W. Imbens, and Donald B. Rubin. 1996. “Identification of Causal Effects Using Instrumental Variables.” Journal of the American Statistical Association 91 (434): 444–72.

Bardaka, Eleni, Michael S. Delgado, and Raymond J. G. M. Florax. 2018. “Causal Identification of Transit-Induced Gentrification and Spatial Spillover Effects: The Case of the Denver Light Rail.” Journal of Transport Geography 71: 15–31. https://doi.org/10.1016/j.jtrangeo.2018.06.025.

———. 2019. “A Spatial Multiple Treatment/Multiple Outcome Difference-in-Differences Model with an Application to Urban Rail Infrastructure and Gentrification.” Transportation Research Part A: Policy and Practice 121: 325–45. https://doi.org/10.1016/j.tra.2019.01.028.

Baylis, Kathy, and Andrés Ham. 2015. “How Important Is Spatial Correlation in Randomized Controlled Trials?” https://ageconsearch.umn.edu/record/205586/.

Brenning, Alexander. 2012. “Spatial Cross-Validation and Bootstrap for the Assessment of Prediction Rules in Remote Sensing: The r Package Sperrorest.” In 2012 IEEE International Geoscience and Remote Sensing Symposium, 5372–75. https://doi.org/10.1109/IGARSS.2012.6352393.

Butts, Kyle. 2023a. “Geographic Difference-in-Discontinuities.” Applied Economics Letters 30 (5): 615–19. https://doi.org/10.1080/13504851.2021.2005236.

———. 2023b. “JUE Insight: Difference-in-Differences with Geocoded Microdata.” Journal of Urban Economics 133: 103493. https://doi.org/10.1016/j.jue.2022.103493.

Calonico, Sebastian, Matias D. Cattaneo, and Rocio Titiunik. 2014. “Robust Nonparametric Confidence Intervals for Regression-Discontinuity Designs.” Econometrica 82 (6): 2295–2326. https://doi.org/10.3982/ECTA11757.

Cattaneo, Matias D., and Rocı́o Titiunik. 2022. “Regression Discontinuity Designs.” Annual Review of Economics 14 (1): 821–51. https://doi.org/10.1146/annurev-economics-051520-021409.

Chagas, André L. S., Carlos R. Azzoni, and Alexandre N. Almeida. 2016. “A Spatial Difference-in-Differences Analysis of the Impact of Sugarcane Production on Respiratory Diseases.” Regional Science and Urban Economics 59: 24–36. https://doi.org/10.1016/j.regsciurbeco.2016.04.002.

Chen, Jinyu, Wenjing Luo, Xiaohang Ren, and Tianqi Liu. 2023. “The Local-Neighborhood Effects of Low-Carbon City Pilots Program on PM2.5 in China: A Spatial Difference-in-Differences Analysis.” Science of The Total Environment 857: 159511. https://doi.org/10.1016/j.scitotenv.2022.159511.

Corrado, Luisa, and Bernard Fingleton. 2012. “Where Is the Economics in Spatial Econometrics?” Journal of Regional Science 52: 210–39.

Credit, Kevin. 2022. “Spatial Models or Random Forest? Evaluating the Use of Spatially Explicit Machine Learning Methods to Predict Employment Density Around New Transit Stations in Los Angeles.” Geographical Analysis 54 (1): 58–83. https://doi.org/10.1111/gean.12273.

Delgado, Michael S., and Raymond J. G. M. Florax. 2015. “Difference-in-Differences Techniques for Spatial Data: Local Autocorrelation and Spatial Interaction.” Economics Letters 137: 123–26. https://doi.org/10.1016/j.econlet.2015.10.035.

Deng, Yue, Rixing He, and Yang Liu. 2023. “Crime Risk Prediction Incorporating Geographical Spatiotemporal Dependency into Machine Learning Models.” Information Sciences 646: 119414. https://doi.org/10.1016/j.ins.2023.119414.

Diao, Mi, Delon Leonard, and Tien Foo Sing. 2017. “Spatial-Difference-in-Differences Models for Impact of New Mass Rapid Transit Line on Private Housing Values.” Regional Science and Urban Economics 67: 64–77. https://doi.org/10.1016/j.regsciurbeco.2017.08.006.

Dubé, Jean, Maha AbdelHalim, and Nicolas Devaux. 2021. “Evaluating the Impact of Floods on Housing Price Using a Spatial Matching Difference-in-Differences (SM-DID) Approach.” Sustainability 13 (2). https://doi.org/10.3390/su13020804.

Dubé, Jean, Diègo Legros, Marius Thériault, and François Des Rosiers. 2014. “A Spatial Difference-in-Differences Estimator to Evaluate the Effect of Change in Public Mass Transit Systems on House Prices.” Transportation Research Part B: Methodological 64: 24–40. https://doi.org/10.1016/j.trb.2014.02.007.

———. 2017. “Measuring and Interpreting Urban Externalities in Real-Estate Data: A Spatio-Temporal Difference-in-Differences (STDID) Estimator.” Buildings 7 (2). https://doi.org/10.3390/buildings7020051.

Fang, Jing. 2021. “Impacts of High-Speed Rail on Urban Smog Pollution in China: A Spatial Difference-in-Difference Approach.” Science of The Total Environment 777: 146153. https://doi.org/10.1016/j.scitotenv.2021.146153.

Gao, Bingbo, Jinfeng Wang, Alfred Stein, and Ziyue Chen. 2022. “Causal Inference in Spatial Statistics.” Spatial Statistics 50: 100621. https://doi.org/10.1016/j.spasta.2022.100621.

Gao, Da, and Guimei Wang. 2023. “Does the Opening of High-Speed Rails Improve Urban Carbon Efficiency? Evidence from a Spatial Difference-in-Difference Method.” Environmental Science and Pollution Research 30: 101873–87.

Gibbons, Stephen, Stephen Machin, and Olmo Silva. 2013. “Valuing School Quality Using Boundary Discontinuities.” Journal of Urban Economics 75: 15–28.

Gibbons, Stephen, and Henry G. Overman. 2012. “Mostly Pointless Spatial Econometrics?” Journal of Regional Science 52: 172–91.

Han, Mengjie, Oana Mihaescu, Yujiao Li, and Niklas Rudholm. 2018. “Comparison and One-Stop Shopping After Big-Box Retail Entry: A Spatial Difference-in-Difference Analysis.” Journal of Retailing and Consumer Services 40: 175–87. https://doi.org/10.1016/j.jretconser.2017.10.003.

Jia, Ruining, Shuai Shao, and Lili Yang. 2021. “High-Speed Rail and CO2 Emissions in Urban China: A Spatial Difference-in-Differences Approach.” Energy Economics 99: 105271. https://doi.org/10.1016/j.eneco.2021.105271.

Kattenborn, Teja, Felix Schiefer, Julian Frey, Hannes Feilhauer, Miguel D. Mahecha, and Carsten F. Dormann. 2022. “Spatially Autocorrelated Training and Validation Samples Inflate Performance Assessment of Convolutional Neural Networks.” ISPRS Open Journal of Photogrammetry and Remote Sensing 5: 100018. https://doi.org/10.1016/j.ophoto.2022.100018.

Keele, Luke J., and Rocío Titiunik. 2015. “Geographic Boundaries as Regression Discontinuities.” Political Analysis 23 (1): 127–55. https://doi.org/10.1093/pan/mpu014.

Kelejian, Harry H., G. Tavlas, and G. Hondroyiannis. 2006. “A Spatial Modelling Approach to Contagion Among Emerging Economies.” Open Economies Review 17: 423–41.

Kolak, Marynia, and Luc Anselin. 2020. “A Spatial Perspective on the Econometrics of Program Evaluation.” International Regional Science Review 43 (1-2): 128–53. https://doi.org/10.1177/0160017619869781.

Kopczewska, Katarzyna. 2022. “Spatial Machine Learning: New Opportunities for Regional Science.” The Annals of Regional Science 68 (3): 713–55. https://doi.org/10.1007/s00168-021-01101-x.

Koschinsky, Julia. 2013. “The Case for Spatial Analysis in Evaluation to Reduce Health Inequities.” Evaluation and Program Planning 36 (1): 172–76. https://doi.org/10.1016/j.evalprogplan.2012.03.004.

Kosfeld, Reinhold, Timo Mitze, Johannes Rode, and Klaus Wälde. 2021. “The Covid-19 Containment Effects of Public Health Measures: A Spatial Difference-in-Differences Approach.” Journal of Regional Science 61 (4): 799–825. https://doi.org/10.1111/jors.12536.

LeSage, James P., and Manfred M. Fischer. 2008. “Spatial Growth Regression: Model Specification, Estimation and Interpretation.” Spatial Economic Analysis 3: 275–304.

LeSage, James P., and R. Kelley Pace. 2009. Introduction to Spatial Econometrics. Boca Raton FL: Chapman; Hall/CRC.

Li, Heng, Chunxiao Zhang, Min Chen, Dingtao Shen, and Yunyun Niu. 2023. “Data-Driven Surrogate Modeling: Introducing Spatial Lag to Consider Spatial Autocorrelation of Flooding Within Urban Drainage Systems.” Environmental Modelling & Software 161: 105623. https://doi.org/10.1016/j.envsoft.2023.105623.

Linnenbrink, J., C. Milà, M. Ludwig, and H. Meyer. 2023. “kNNDM: K-Fold Nearest Neighbour Distance Matching Cross-Validation for Map Accuracy Estimation.” EGUsphere 2023: 1–16. https://doi.org/10.5194/egusphere-2023-1308.

Liu, Jun, Yu Qian, Shun-feng Song, and Rong-rong Duan. 2022. “Industrial Symbiotic Agglomeration and Green Economic Growth: A Spatial Difference-in-Differences Approach.” Journal of Cleaner Production 364: 132560. https://doi.org/10.1016/j.jclepro.2022.132560.

Meyer, Hanna, and Edzer Pebesma. 2021. “Predicting into Unknown Space? Estimating the Area of Applicability of Spatial Prediction Models.” Methods in Ecology and Evolution 12 (9): 1620–33. https://doi.org/10.1111/2041-210X.13650.

———. 2022. “Machine Learning-Based Global Maps of Ecological Variables and the Challenge of Assessing Them.” Nature Communincations 13. https://doi.org/ 10.1038/s41467-022-29838-9 .

Meyer, Hanna, Christoph Reudenbach, Tomislav Hengl, Marwan Katurji, and Thomas Nauss. 2018. “Improving Performance of Spatio-Temporal Machine Learning Models Using Forward Feature Selection and Target-Oriented Validation.” Environmental Modelling & Software 101: 1–9. https://doi.org/10.1016/j.envsoft.2017.12.001.

Meyer, Hanna, Christoph Reudenbach, Stephan Wöllauer, and Thomas Nauss. 2019. “Importance of Spatial Predictor Variable Selection in Machine Learning Applications – Moving from Data Reproduction to Spatial Prediction.” Ecological Modelling 411: 108815. https://doi.org/10.1016/j.ecolmodel.2019.108815.

Mila, Carles, Jorge Mateu, Edzer Pebesma, and Hanna Meyer. 2022. “Nearest Neighbour Distance Matching Leave-One-Out Cross-Validation for Map Validation.” Methods in Ecology and Evolution 13 (6): 1304–16. https://doi.org/10.1111/2041-210X.13851.

Nikparvar, Behnam, and Jean-Claude Thill. 2021. “Machine Learning of Spatial Data.” ISPRS International Journal of Geo-Information 10 (9). https://doi.org/10.3390/ijgi10090600.

Olsson, Gunnar. 1970. “Explanation, Prediction, and Meaning Variance: An Assessment of Distance Interaction Models.” Economic Geography 46: 223–33. http://www.jstor.org/stable/143140.

Pan, Minjie, Weiyong Zou, Kangjuan Lv, and Xinlei Qian. 2023. “Can Environmental Protection Interview Policy Reduce Air Pollution? -a Spatial Difference-in-Differences Approach.” Applied Economics 55 (11): 1217–33. https://doi.org/10.1080/00036846.2022.2096869.

Piras, Gianfranco, and Mauricio Sarrias. 2023. “Heterogeneous Spatial Models in r: Spatial Regimes Models.” Journal of Spatial Econometrics 4. https://doi.org/10.1007/s43071-023-00034-1.

Qiu, Feng, and Qingmeng Tong. 2021. “A Spatial Difference-in-Differences Approach to Evaluate the Impact of Light Rail Transit on Property Values.” Economic Modelling 99: 105496. https://doi.org/10.1016/j.econmod.2021.03.015.

Rubin, Donald B. 1974. “Estimating Causal Effects of Treatments in Randomized and Nonrandomized Studies.” Journal of Educational Psychology 66: 688–701. https://doi.org/10.1037/h0037350.

———. 1978. “Bayesian Inference for Causal Effects: The Role of Randomization.” The Annals of Statistics 6 (1): 34–58.

Schratz, Patrick, Jannes Muenchow, Eugenia Iturritxa, Jakob Richter, and Alexander Brenning. 2019. “Hyperparameter Tuning and Performance Assessment of Statistical and Machine-Learning Algorithms Using Spatial Data.” Ecological Modelling 406: 109–20. https://doi.org/10.1016/j.ecolmodel.2019.06.002.

Sunak, Yasin, and Reinhard Madlener. 2016. “The Impact of Wind Farm Visibility on Property Values: A Spatial Difference-in-Differences Analysis.” Energy Economics 55: 79–91. https://doi.org/10.1016/j.eneco.2015.12.025.

Valavi, Roozbeh, Jane Elith, José J. Lahoz-Monfort, and Gurutzeta Guillera-Arroita. 2019. “blockCV: An R Package for Generating Spatially or Environmentally Separated Folds for k-Fold Cross-Validation of Species Distribution Models.” Methods in Ecology and Evolution 10 (2): 225–32. https://doi.org/10.1111/2041-210X.13107.

Vidoli, Francesco, Giacomo Pignataro, and Roberto Benedetti. 2022. “Identification of Spatial Regimes of the Production Function of Italian Hospitals Through Spatially Constrained Cluster-Wise Regression.” Socio-Economic Planning Sciences 82: 101223. https://doi.org/10.1016/j.seps.2022.101223.

Wagner, Martin, and Achim Zeileis. 2019. “Heterogeneity and Spatial Dependence of Regional Growth in the EU: A Recursive Partitioning Approach.” German Economic Review 20 (1): 67–82. https://doi.org/10.1111/geer.12146.

Wang, Zhijian, and Yunquan Song. 2023. “Deep Learning for the Spatial Additive Autoregressive Model with Nonparametric Endogenous Effect.” Spatial Statistics 55: 100743. https://doi.org/10.1016/j.spasta.2023.100743.

Ward, M. D., and K. S. Gleditsch. 2008. Spatial Regression Models. Thousand Oaks, CA: Sage.

Wolf, Levi J. 2023. “Beyond Open Science: Data, Code, and Causality.” Environment and Planning B: Urban Analytics and City Science 50 (9): 2333–36. https://doi.org/10.1177/23998083231210180.

Xiao, Shuyue, Yunquan Song, and Zhijian Wang. 2023. “Nonparametric Spatial Autoregressive Model Using Deep Neural Networks.” Spatial Statistics 57: 100766. https://doi.org/10.1016/j.spasta.2023.100766.

Yoshida, Takahiro, Daisuke Murakami, and Hajime Seya. 2022. “Spatial Prediction of Apartment Rent Using Regression-Based and Machine Learning-Based Approaches with a Large Dataset.” Journal of Real Estate Finance & Economics, 1–28. https://doi.org/10.1007/s11146-022-09929-6.

Yu, Nannan, and Ying Jin. 2023. “The Unintended Economic Impact of High-Speed Rail on China’s Non-Core Cities: A Spatial-Difference-in-Differences Analysis.” Cities 143: 104618. https://doi.org/10.1016/j.cities.2023.104618.

Zeng, Juying, Cristina Blanco-González-Tejero, and F. Javier Sendra. 2023. “The Spatial Difference-in-Difference Measurement of Policy Effect of Environmental Protection Interview on Green Innovation.” Technological Forecasting and Social Change 191: 122511. https://doi.org/10.1016/j.techfore.2023.122511.

Zhang, Xu, Lin Liu, Minxuan Lan, Guangwen Song, Luzi Xiao, and Jianguo Chen. 2022. “Interpretable Machine Learning Models for Crime Prediction.” Computers, Environment and Urban Systems 94: 101789. https://doi.org/10.1016/j.compenvurbsys.2022.101789.

Zhu, Di, Yu Liu, Xin Yao, and Manfred M. Fischer. 2023a. “Correction to: Spatial Regression Graph Convolutional Neural Networks: A Deep Learning Paradigm for Spatial Multivariate Distributions.” GeoInformatica 27: 641–42. https://doi.org/10.1007/s10707-022-00461-6.

———. 2023b. “Spatial Regression Graph Convolutional Neural Networks: A Deep Learning Paradigm for Spatial Multivariate Distributions.” GeoInformatica 26: 645–76. https://doi.org/10.1007/s10707-021-00454-x.