Data Catalogue¶
This catalogue explains the starter sources of the example data catalogue in configs/sources.yml. They are examples for the hackathon, not a closed list. Participants can use these sources directly or copy one entry to add another official/open source.
Readiness Labels¶
| Label | Meaning |
|---|---|
| Analysis-ready | The payload already contains rows or records that can be loaded directly. |
| Needs cleanup | The payload contains useful records, but labels, dates, or URIs should be normalized before analysis. |
| Catalogue only | The payload mostly contains metadata and links to the real downloadable files. |
| Publication page | The payload is a page snapshot with links and text, useful for document discovery or text mining. |
Starter Sources of the Example Data Catalogue¶
| Source ID | What the payload contains | Important variables | Readiness | Good first use |
|---|---|---|---|---|
opendata_swiss |
CKAN package metadata for the Swiss energy-balance dataset. | result.title, result.description, result.keywords, result.resources[].format, result.resources[].download_url, result.resources[].rights, result.metadata_modified. |
Catalogue only | Find the official CSV/SPARQL resources and license references for the SFOE energy balance. |
sfoe_energy_balance_csv |
Direct CSV table from the Swiss Federal Office of Energy. | Jahr, Rubrik, Energietraeger, TJ. |
Analysis-ready | Build energy dependency, consumption mix, or resilience indicators over time. |
meteo_swiss_smn |
STAC collection metadata for MeteoSwiss automatic weather stations. | id, title, description, extent.spatial, extent.temporal, assets, links. |
Catalogue only | Use assets for station/parameter inventories and links for actual STAC items. |
geoadmin_army_nature_landscape |
STAC collection metadata for nature and landscape values on military-used land. | id, title, description, extent.spatial, extent.temporal, links. |
Catalogue only | Download geodata assets for military land, environmental constraints, and spatial exposure. |
geoadmin_civil_protection_meeting_points |
STAC collection metadata for emergency meeting points. | id, title, description, extent.spatial, links. |
Catalogue only | Download point geometries and join them with population or hazard layers. |
geoadmin_surface_runoff_hazard |
STAC collection metadata for the overland-flow hazard map. | id, title, description, extent.spatial, extent.temporal, links. |
Catalogue only | Download hazard geodata and combine it with critical infrastructure or population exposure. |
geoadmin_nuclear_emergency_zones |
STAC collection metadata for nuclear emergency planning zones. | id, title, description, extent.spatial, extent.temporal, links. |
Catalogue only | Download polygon zones for emergency-planning or cascading-risk scenarios. |
armasuisse_st_publications |
HTML snapshot of the official armasuisse Science and Technology publications page. | Page text, href links, PDF URLs, topic terms such as cyber, artificial intelligence, drones, quantum, research. |
Publication page | Discover publications and PDFs for technology trend scanning. |
aramis_armasuisse_research_projects |
JSON list of armasuisse projects from the ARAMIS federal research database. | CountOfMatchedProjects, Projects[].Id, Projects[].ProjectNumber, Projects[].Titel.Text, Projects[].Abstract.Text, Projects[].Department.Text, Projects[].Status.Text, Projects[].StartDate, Projects[].EndDate. |
Analysis-ready | Mine federal research projects for technology signals, defence priorities, and emerging capability areas. |
parliament_affairs |
JSON list of recent Swiss parliamentary affairs matching strategic terms. | ID, BusinessShortNumber, BusinessTypeName, Title, BusinessStatusText, ResponsibleDepartmentName, SubmissionDate, Modified, TagNames. |
Needs cleanup | Track political signals around energy, infrastructure, security, climate, supply, and the armed forces. |
lindas |
CSV SPARQL result for SFOE energy-balance linked data. | observation, year, rubric, energy_carrier, tj. |
Needs cleanup | Use linked-data identifiers for reproducible energy indicators; map URI values to readable labels before presentation. |
fedlex |
JSON SPARQL result for legal and regulatory signals. | act, title, date. |
Analysis-ready | Track federal gazette and legal signals around energy, supply, infrastructure, and climate. |
bfs_pxweb |
CSV table from BFS STAT-TAB with canton population exposure. | Year, Canton, Population type, Sex, Marital status, Age - total. |
Analysis-ready | Use canton population as exposure/context data for hazard or resilience indicators. |
Folder Pattern¶
Each run writes a timestamped snapshot:
data/raw/<source_id>/<timestamp>/payload.json
data/raw/<source_id>/<timestamp>/metadata.json
The payload can also be payload.csv or payload.html, depending on the source type.
In RenkuLab, the same logical raw folder may resolve to the mounted connector
../govtech-raw when that folder exists next to the repository.
Metadata¶
Every snapshot has a metadata.json file. It records the source name, connector type, format, retrieval timestamp, documentation URL, access path, license or terms, geographic reference, update logic, and the exact request details such as URL, query, parameters, or POST body.
For a hackathon submission, this metadata is the reproducibility evidence: it shows where the data came from, when it was retrieved, and how another participant can retrieve it again.