Skip to content

Data Catalogue

This catalogue explains the starter sources of the example data catalogue in configs/sources.yml. They are examples for the hackathon, not a closed list. Participants can use these sources directly or copy one entry to add another official/open source.

Readiness Labels

Label Meaning
Analysis-ready The payload already contains rows or records that can be loaded directly.
Needs cleanup The payload contains useful records, but labels, dates, or URIs should be normalized before analysis.
Catalogue only The payload mostly contains metadata and links to the real downloadable files.
Publication page The payload is a page snapshot with links and text, useful for document discovery or text mining.

Starter Sources of the Example Data Catalogue

Source ID What the payload contains Important variables Readiness Good first use
opendata_swiss CKAN package metadata for the Swiss energy-balance dataset. result.title, result.description, result.keywords, result.resources[].format, result.resources[].download_url, result.resources[].rights, result.metadata_modified. Catalogue only Find the official CSV/SPARQL resources and license references for the SFOE energy balance.
sfoe_energy_balance_csv Direct CSV table from the Swiss Federal Office of Energy. Jahr, Rubrik, Energietraeger, TJ. Analysis-ready Build energy dependency, consumption mix, or resilience indicators over time.
meteo_swiss_smn STAC collection metadata for MeteoSwiss automatic weather stations. id, title, description, extent.spatial, extent.temporal, assets, links. Catalogue only Use assets for station/parameter inventories and links for actual STAC items.
geoadmin_army_nature_landscape STAC collection metadata for nature and landscape values on military-used land. id, title, description, extent.spatial, extent.temporal, links. Catalogue only Download geodata assets for military land, environmental constraints, and spatial exposure.
geoadmin_civil_protection_meeting_points STAC collection metadata for emergency meeting points. id, title, description, extent.spatial, links. Catalogue only Download point geometries and join them with population or hazard layers.
geoadmin_surface_runoff_hazard STAC collection metadata for the overland-flow hazard map. id, title, description, extent.spatial, extent.temporal, links. Catalogue only Download hazard geodata and combine it with critical infrastructure or population exposure.
geoadmin_nuclear_emergency_zones STAC collection metadata for nuclear emergency planning zones. id, title, description, extent.spatial, extent.temporal, links. Catalogue only Download polygon zones for emergency-planning or cascading-risk scenarios.
armasuisse_st_publications HTML snapshot of the official armasuisse Science and Technology publications page. Page text, href links, PDF URLs, topic terms such as cyber, artificial intelligence, drones, quantum, research. Publication page Discover publications and PDFs for technology trend scanning.
aramis_armasuisse_research_projects JSON list of armasuisse projects from the ARAMIS federal research database. CountOfMatchedProjects, Projects[].Id, Projects[].ProjectNumber, Projects[].Titel.Text, Projects[].Abstract.Text, Projects[].Department.Text, Projects[].Status.Text, Projects[].StartDate, Projects[].EndDate. Analysis-ready Mine federal research projects for technology signals, defence priorities, and emerging capability areas.
parliament_affairs JSON list of recent Swiss parliamentary affairs matching strategic terms. ID, BusinessShortNumber, BusinessTypeName, Title, BusinessStatusText, ResponsibleDepartmentName, SubmissionDate, Modified, TagNames. Needs cleanup Track political signals around energy, infrastructure, security, climate, supply, and the armed forces.
lindas CSV SPARQL result for SFOE energy-balance linked data. observation, year, rubric, energy_carrier, tj. Needs cleanup Use linked-data identifiers for reproducible energy indicators; map URI values to readable labels before presentation.
fedlex JSON SPARQL result for legal and regulatory signals. act, title, date. Analysis-ready Track federal gazette and legal signals around energy, supply, infrastructure, and climate.
bfs_pxweb CSV table from BFS STAT-TAB with canton population exposure. Year, Canton, Population type, Sex, Marital status, Age - total. Analysis-ready Use canton population as exposure/context data for hazard or resilience indicators.

Folder Pattern

Each run writes a timestamped snapshot:

data/raw/<source_id>/<timestamp>/payload.json
data/raw/<source_id>/<timestamp>/metadata.json

The payload can also be payload.csv or payload.html, depending on the source type. In RenkuLab, the same logical raw folder may resolve to the mounted connector ../govtech-raw when that folder exists next to the repository.

Metadata

Every snapshot has a metadata.json file. It records the source name, connector type, format, retrieval timestamp, documentation URL, access path, license or terms, geographic reference, update logic, and the exact request details such as URL, query, parameters, or POST body.

For a hackathon submission, this metadata is the reproducibility evidence: it shows where the data came from, when it was retrieved, and how another participant can retrieve it again.