Data
This data product contains the classification of the entire building stock of 10 European countries, geographically split based on NUTS1 regions (2024).
Available data
Below is an overview of the available data.
Coverage
The current coverage includes Austria, Belgium, Czechia, France, Germany, Lithuania, the Netherlands, Poland, Slovakia, and Spain. Additional European countries are expected in 2026.
Releases
The dataset is currently split in two releases. The first one, from September 2025, covers Austria, Czechia, Germany, Lithuania, Poland, and Slovakia. The second, from November 2025 covers Belgium, France, the Netherlands, and Spain (withut Basque region). The countries from the latter release are assgined to the existing taxonomic tree (as of now).
Contents
Structure of all releases:
{NUTS1}.parquet- Hierarchical Morphotope Classification results linked to building geometry{NUTS1}_morphotopes.parquet- Hierarchical Morphotope Classification results linked to morphotope geometry{NUTS1}_data.parquet- Underlying morphometric data linkable to building geometrymorphotope_data.parquet- Aggregated data linkable to individual morphotopes
Additional files available in the initial release (2025-09):
label_name.json- Names of the cluster labels for the first three levels of taxonomypen_portraits.json- Descriptions of the clusters for the first three levels of taxonomyward_linkage.npy- The taxonomy itself encoded as a scipy linkage object
Additionally, the repository managed by Charles University contains PMTiles representation.
complete.pmtiles- Hierarchical Morphotope Classification results linked to building geometry as a single PMTiles object
File formats
The majority of the data is stored as Apache Parquet, with geospatial data being encoded as GeoParquet.
The data is split regionally based on NUTS1 regions.
{NUTS1}.parquet- GeoParquet 1.1.0, ZSTD compressed, with GeoArrow geometry encoding and covering bbox for fast spatial queries.{NUTS1}_morphotopes.parquet- GeoParquet 1.1.0, ZSTD compressed, with GeoArrow geometry encoding and covering bbox for fast spatial queries.{NUTS1}_data.parquet- Parquet, ZSTD compressedmorphotope_data.parquet- Parquet, Snappy compressed. All characters for non-outlier morphotopes - used to build the taxonomy.label_name.json- JSONpen_portraits.json- JSONward_linkage.npy- Numpy NPYcomplete.pmtiles- PMTiles
Data repositories
Zenodo
The main data repository is Zenodo, which contains a complete copy of the data product and issues a DOI to each version.
2025-09 release
The initial release covering AT, CZ, DE, LT, PL, SK is avaialble from doi.org/10.5281/zenodo.17076283.
2025-11 release
The first expansion covering BE, NL, FR, ES is asvailable from doi.org/10.5281/zenodo.17600617.
Direct download
You can also download the data directly from the repository managed by Charles University.
2025-09 release
2025-11 release
License
This dataset is based on data from multiple sources. The combined database is licensed under the Open Database License (ODbL) v1.0.
Attribution: Contains data © Charles University, OpenStreetMap contributors, TomTom, GeoBasis-DE/LGB, GeoSN, LGL Baden-Württemberg, GeoBasis-DE/MV, GeoBasis-DE/LVGL-SL, IT.NRW, GeoBasis-DE/LGLN, GDI-Th, LVermGeo ST, GeoBasis-DE/LVermGeo SH, Bayerische Vermessungsverwaltung, LGV Hamburg, Landesamt GeoInformation Bremen, GUGiK (PL), ČÚZK (CZ), ÚGKK SR (SK), BEV (AT), Geoportal.lt (LT), IGN, 3DGI, DGC, and others.
Individual source building data is available under the following licenses and attributions:
- Brandenburg: GeoBasis-DE/LGB — Datenlizenz Deutschland – Namensnennung – Version 2.0
- Saxony: State Office for Geobasis Information Saxony (GeoSN) — Datenlizenz Deutschland – Namensnennung – Version 2.0
- Baden-Württemberg: LGL, www.lgl-bw.de — Datenlizenz Deutschland – Namensnennung – Version 2.0
- Mecklenburg-Vorpommern: GeoBasis-DE/MV — CC-BY 4.0
- Rheinland-Pfalz: GeoBasis-DE/LGB — Datenlizenz Deutschland – Namensnennung – Version 2.0
- Saarland: © GeoBasis-DE/LVGL-SL (2024) — Datenlizenz Deutschland – Namensnennung – Version 2.0
- Nordrhein-Westfalen: Geoinformationszentrum, Information und Technik Nordrhein-Westfalen — Datenlizenz Deutschland – Namensnennung – Version 2.0
- Niedersachsen: © GeoBasis-DE/LGLN — CC-BY 4.0
- Hessen: Datenlizenz Deutschland – Zero – Version 2.0
- Thuringia: © GDI-Th — Datenlizenz Deutschland – Namensnennung – Version 2.0
- Saxony-Anhalt: GeoBasis-DE / LVermGeo ST — Datenlizenz Deutschland – Namensnennung – Version 2.0
- Schleswig-Holstein: © GeoBasis-DE/LVermGeo SH — CC-BY-SA 4.0
- Bavaria: Bayerische Vermessungsverwaltung — CC-BY 4.0
- Berlin: Datenlizenz Deutschland – Zero – Version 2.0
- Hamburg: © LGV Hamburg — Datenlizenz Deutschland – Namensnennung – Version 2.0
- Bremen: Landesamt GeoInformation Bremen — CC-BY 4.0
- Poland: Główny Urząd Geodezji i Kartografii — CC-BY 4.0
- Czechia: ČÚZK — CC-BY 4.0
- Slovakia: Úrad geodézie, kartografie a katastra SR — CC-BY 4.0
- Austria: BEV — CC-BY 4.0
- Lithuania: Geoportal.lt — CC-BY 4.0
- France: Licence Ouverte / Open Licence 2.0 © IGN
- Belgium: Open Database License (ODbL) OpenStreetMap Contributors
- Netherlands: CC-BY-4.0 © 3DBAG by tudelft3d and 3DGI
- Spain: LICENCIA DE ACCESO Y USO DE LOS SERVICIOS Y CONJUNTOS DE DATOS INSPIRE DE LA DIRECCIÓN GENERAL DEL CATASTRO © DGC