Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Include results from OpenAire #4

Open
dgarijo opened this issue Jul 4, 2020 · 0 comments
Open

Include results from OpenAire #4

dgarijo opened this issue Jul 4, 2020 · 0 comments

Comments

@dgarijo
Copy link
Contributor

dgarijo commented Jul 4, 2020

Once we have a first version of the prototype, we could expand with the results from OpenAIRE:
https://zenodo.org/record/3516918#.Xv-9j-d7lPY

This is a result sample (this one is not particularly interesting because it doesn't have github id, but others do. We can

<?xml version="1.0" encoding="UTF-8"?>
<record>
  <result xmlns:dri="http://www.driver-repository.eu/namespace/dri" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
    <header>
      <dri:objIdentifier>datacite____::04386576aee3ff62d84758dc2f8a4080</dri:objIdentifier>
      <dri:dateOfCollection>2018-10-28T00:39:04.337Z</dri:dateOfCollection>
      <dri:dateOfTransformation/>
      <counters>
          <counter_doi value="2"/>
          </counters>
    </header>
    <metadata>
      <oaf:entity xmlns:oaf="http://namespace.openaire.eu/oaf" xsi:schemaLocation="http://namespace.openaire.eu/oaf https://www.openaire.eu/schema/1.0/oaf-1.0.xsd">
                <oaf:result>
                        <publisher>nanoHUB</publisher><creator rank="1" name="Hong-Hyun" surname="Park">Park, Hong-Hyun</creator><creator rank="2" name="Zhengping" surname="Jiang">Jiang, Zhengping</creator><creator rank="3" name="Arun" surname="Akkala">Akkala, Arun</creator><creator rank="4" name="Sebastian" surname="Steiger">Steiger, Sebastian</creator><creator rank="5" name="Michael" surname="Povolotskyi">Povolotskyi, Michael</creator><creator rank="6" name="Tillmann" surname="Kubis">Kubis, Tillmann</creator><creator rank="7" name="Jean" surname="Sellier">Sellier, Jean</creator><creator rank="8" name="Yaohua" surname="Tan">Tan, Yaohua</creator><creator rank="9" name="Sunggeun" surname="Kim">Kim, SungGeun</creator><creator rank="10" name="Mathieu" surname="Luisier">Luisier, Mathieu</creator><creator rank="11" name="Samarth" surname="Agarwal">Agarwal, Samarth</creator><creator rank="12" name="Michael" surname="Mclennan">McLennan, Michael</creator><creator rank="13" name="Gerhard" surname="Klimeck">Klimeck, Gerhard</creator><creator rank="14" name="Junzhe" surname="Geng">Geng, Junzhe</creator><dateofacceptance>2008-01-01</dateofacceptance><size/><resourcetype classid="Simulation Tool" classname="Simulation Tool" schemeid="dnet:dataCite_resource" schemename="dnet:dataCite_resource"/><programmingLanguage classid="" classname="" schemeid="dnet:programming_languages" schemename="dnet:programming_languages"/><language classid="en" classname="en" schemeid="dnet:languages" schemename="dnet:languages"/><relevantdate classid="UNKNOWN" classname="UNKNOWN" schemeid="dnet:dataCite_date" schemename="dnet:dataCite_date">2008-09-05 18:32:34</relevantdate><storagedate>2008</storagedate><title classid="main title" classname="main title" schemeid="dnet:dataCite_title" schemename="dnet:dataCite_title">RTD Tool</title><resulttype classid="software" classname="software" schemeid="dnet:result_typologies" schemename="dnet:result_typologies"/><version>None</version><country classid="" classname="" schemeid="" schemename=""/><subject classid="" classname="" schemeid="" schemename=""/><description/><embargoenddate/><source/><fulltext/><format/><contributor/><coverage/><refereed/><device/><lastmetadataupdate/><metadataversionnumber/><documentationUrl/><codeRepositoryUrl/><contactperson/><contactgroup/><tool/><collectedfrom name="Datacite" id="openaire____::9e3be59865b2c1c335d32dae2fe7b254"/><pid classid="doi" classname="doi" schemeid="dnet:pid_types" schemename="dnet:pid_types">10.4231/d35t3fz9c</pid><pid classid="doi" classname="doi" schemeid="dnet:pid_types" schemename="dnet:pid_types">https://doi.org/10.4231/d35t3fz9c</pid><originalId>http://dx.doi.org/10.4231/d35t3fz9c</originalId><originalId>10.4231/d35t3fz9c</originalId><originalId>https://doi.org/10.4231/d35t3fz9c</originalId><bestaccessright classid="UNKNOWN" classname="not available" schemeid="dnet:access_modes" schemename="dnet:access_modes"/><datainfo><inferred>false</inferred><deletedbyinference>false</deletedbyinference><trust>0.9</trust><inferenceprovenance/><provenanceaction classid="sysimport:crosswalk:datasetarchive" classname="sysimport:crosswalk:datasetarchive" schemeid="dnet:provenanceActions" schemename="dnet:provenanceActions"/></datainfo>
                  <rels>
                  </rels>
                  <children>
                        <instance id="openaire____::55045bd2a65019fd8e6741a755395c8c">
                            <collectedfrom name="Datacite" id="openaire____::9e3be59865b2c1c335d32dae2fe7b254"/><license/><distributionlocation/><accessright classid="UNKNOWN" classname="UNKNOWN" schemeid="dnet:access_modes" schemename="dnet:access_modes"/><hostedby name="Unknown Repository" id="openaire____::55045bd2a65019fd8e6741a755395c8c"/><instancetype classid="0029" classname="Software" schemeid="dnet:dataCite_resource" schemename="dnet:dataCite_resource"/><dateofacceptance>2008-01-01</dateofacceptance>
                                <webresource>
                                  <url>http://dx.doi.org/10.4231/d35t3fz9c</url>
                                </webresource>

                        </instance>

                  </children>
                </oaf:result>

      </oaf:entity>
    </metadata>
  </result>

Command:
gunzip -c file.json.gz | head -n 10 | jq '.body."$binary"' -r | while IFS= read -r line; do echo "$line" | base64 --decode | bsdtar -x -O ; done (e.g., for 20)

@dgarijo dgarijo added this to the New sources of data milestone Jul 7, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant