Geographical-Data-Analysis-of-Poverty-in-Italy-with-R

Geographical Data Analysis of Poverty in Italy

This project explores poverty in Italy through a geographical and statistical perspective, using data from ISTAT and BES (Benessere Equo e Sostenibile).
The analysis combines descriptive statistics, spatial autocorrelation tests, and spatial regression models to highlight territorial inequalities and socio-economic drivers of poverty.

The analysis is based on a poverty rate dataset from Italian Statistical Institute ISTAT avaiable here that contains 8 sociodemographic variables listed here and is analized by 5 main scripts in src descripted here

Objectives


Workflow

  1. 01_data_preparation.R → Load shapefile and poverty dataset, merge, clean, and explore.
  2. 02_exploratory_analysis.R → Descriptive statistics, plots, and OLS baseline models.
  3. 03_spatial_weights.R → Build spatial weight matrices (Queen contiguity).
  4. 04_spatial_tests.R → Moran’s I, Geary’s C, and Local Indicators of Spatial Autocorrelation (LISA).
  5. 05_spatial_models.R → Estimate SAR and SAC spatial regression models, compare AIC, compute direct and indirect effects.

Technologies


Key Results

Regression Analysis

The linear regression model identifies the main drivers of poverty risk:

The model shows excellent fit (Adjusted R² = 0.98), confirming that these socio-economic variables explain most of the variance in poverty risk.


Spatial Effects (SAC Model)

The spatial autoregressive model highlights that poverty is not randomly distributed, but spreads across neighboring regions:

These findings imply the need for coordinated regional policies, since poverty dynamics do not stop at administrative borders.


Spatial Autocorrelation (Moran’s I and Local Moran)


Conclusions


References

## From a F.Cecere, G.Masiello & S.Spagnuolo collaboration.