Skip to content

Catalogs management

Emanuele Tajariol edited this page Nov 14, 2017 · 24 revisions

Testing the existing PA's catalogs across the current developments

Below some examples of some relevant catalogs to test during the ongoing developments:

1. CKAN catalogs not compliant with the DCAT-AP_IT profile:

  • Bari http://opendata.comune.bari.it/

    • Harvester: "CKAN harvester for DCATAPIT"

      • Groups are mapped to themes
    • Configuration:

      {"default_extras":{
          "source_catalog_title":"Comune di Bari",
          "source_catalog_description":"Catalogo OpenData del Comune di Bari",  
          "source_catalog_homepage":"http://opendata.comune.bari.it/", 
          "source_catalog_language":"ITA", 
          "source_catalog_modified":"2017-01-01T00:00:00"}}
      
  • Basilicata http://dati.regione.basilicata.it/

    • Harvester: "CKAN harvester for DCATAPIT"

      • Groups are mapped to themes
    • Configuration:

      {"default_extras":{
          "source_catalog_title":"Regione Basilicata",
          "source_catalog_description":"Catalogo OpenData della Regione Basilicata",  
          "source_catalog_homepage":"http://dati.regione.basilicata.it/", 
          "source_catalog_language":"ITA", 
          "source_catalog_modified":"2017-01-01T00:00:00"}}
      
  • Umbria http://dati.umbria.it/

    • Harvester: "CKAN harvester for DCATAPIT"

      • Groups are mapped to themes
    • Configuration:

      {"clean_tags":true,
       "default_extras":{
           "source_catalog_title":"Regione Umbria",
           "source_catalog_description":"Catalogo OpenData della Regione Umbria",  
           "source_catalog_homepage":"http://dati.umbria.it/", 
           "source_catalog_language":"ITA", 
           "source_catalog_modified":"2017-01-01T00:00:00"}}
      
  • Marche http://goodpa.regione.marche.it/

    • Questo catalogo non suddivide i dataset tramite gruppi.
  • Veneto http://dati.veneto.it/

    • Harvester: "CKAN harvester for DCATAPIT"

      • Groups are mapped to themes
    • Configuration:

      {"clean_tags":true,
       "default_extras":{
           "source_catalog_title":"Regione Veneto",
           "source_catalog_description":"Catalogo OpenData della Regione Veneto",
           "source_catalog_homepage":"http://dati.veneto.it/", 
           "source_catalog_language":"ITA", 
           "source_catalog_modified":"2017-01-01T00:00:00"}}
      

2. CKAN catalogs not compliant with the DCAT-AP_IT profile which expose json file (data.json DCAT JSON harvester):

Genova http://dati.comune.genova.it/data.json

Lombardia https://www.dati.lombardia.it/data.json —> here there are blockers during harvesting but they are not still compliant with dcatapit profile (work in progress)

3. CKAN catalogs with several problems on Tags:

  • Regione Sardegna (several errors on tags, 801 dataset, only the half part can be currently harvested due to the exsisting issues on tags validation), http://opendata.regione.sardegna.it/

    • URL: http://dati.regione.sardegna.it/

    • Harvester: "CKAN harvester for DCATAPIT"

    • Configuration:

      {"clean_tags":true,
       "default_extras":{
           "source_catalog_title":"Regione Sardegna",
           "source_catalog_description":"Catalogo OpenData della Regione Sardegna", 
           "source_catalog_homepage":"http://opendata.regione.sardegna.it/", 
           "source_catalog_language":"ITA", 
           "source_catalog_modified":"2017-01-01T00:00:00"}}
      
  • Lombardia (https://www.dati.lombardia.it/data.json)

The involved tags are mainly in geospatial data (same problem for Sardegna, Bolzano and Trentino)

4. Good RDF catalogs:

MIBACT (http://dati.beniculturali.it/sparql?default-graph-uri=&query=construct+%7B%3Fs+%3Fp+%3Fo%7Dwhere%7Bgraph+%3Chttp%3A%2F%2Fdati.beniculturali.it%2Fdescrizione_dataset%3E+%7B%3Fs+%3Fp+%3Fo%7D%7D&format=application%2Frdf%2Bxml))

Camera di Commercio di Macerata (http://www.mc.camcom.it/uploaded/linked-open-data/dcat-opendata-catalog.rdf)

Protezione Civile (http://dati.protezionecivile.it/geoportalOD/GetCatalogXSL)

Regione Piemonte (some small error that depends on Regione Piemonte, nothing to do at the moment http://api.smartdatanet.it/metadataapi/api/dcat/dataset_list?opendata=true&tenant=regpie&rows=1000&linkedData=true&outputFormat=turtle)

Complete list of Harvest Sources

HERE a complete list of Harvest Sources useful for testing

Clone this wiki locally