Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RMLStreamer produces no output, no errors, no processing #38

Open
sparkling opened this issue May 17, 2022 · 1 comment
Open

RMLStreamer produces no output, no errors, no processing #38

sparkling opened this issue May 17, 2022 · 1 comment

Comments

@sparkling
Copy link

sparkling commented May 17, 2022

>  pwd
/Users/blah/source/myorg/data
> ls -al
total 2649984
drwxrwxrwx   5 blah  staff         160 17 May 21:52 .
drwxr-xr-x  13 blah  staff         416 17 May 21:56 ..
-rwxrwxrwx@  1 blah  staff  1356784150 17 May 13:19 datafeed_1025648.csv
-rwxrwxrwx   1 blah  staff         671 17 May 21:52 map.ttl
-rw-r--r--   1 blah  staff           0 17 May 21:52 output.nq

map.ttl:

@prefix rr: <http://www.w3.org/ns/r2rml#> .
@prefix foaf: <http://xmlns.com/foaf/0.1/> .
@prefix ex: <http://example.com/> .
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .
@prefix rml: <http://semweb.mmlab.be/ns/rml#> .
@prefix ql: <http://semweb.mmlab.be/ns/ql#> .

@base <http://myorg.com/ns/> .

<TriplesMap1>
  a rr:TriplesMap;
        
  rml:logicalSource [ 
    rml:source "/mnt/data/datafeed_1025648.csv";
    rml:referenceFormulation ql:CSV
  ] ;
	
  rr:subjectMap [ 
    rr:template "http://myorg.com/{aw_product_id}" 
  ]; 
	
  rr:predicateObjectMap [ 
    rr:predicate foaf:aw_deep_link ; 
    rr:objectMap [ 
      rml:reference "aw_deep_link" 
    ]
  ].

Works fine with the RMLMapper java app. When I send this command to the Flink cluster:
toFile --mapping-file /mnt/data/map.ttl --output-path /mnt/data/output.nq

no errors are shown, and no output is generated (see screenshot below)

docker-compose.yml:

version: '3'
services:
  jobmanager:
    image: flink:1.14.0-scala_2.11-java11
    expose:
      - "6123"
    ports:
      - "8081:8081"
    command: jobmanager
    environment:
      - JOB_MANAGER_RPC_ADDRESS=jobmanager
    volumes:
      - /Users/blah/source/myorg/data:/mnt/data

  taskmanager:
    image: flink:1.14.0-scala_2.11-java11
    expose:
      - "6121"
      - "6122"
    depends_on:
      - jobmanager
    command: taskmanager
    links:
      - "jobmanager:jobmanager"
    environment:
      - JOB_MANAGER_RPC_ADDRESS=jobmanager
    volumes:
      - /Users/blah/source/myorg/data:/mnt/data

volumes:
  # This volume will show with 'docker volume ls' as 'docker_data'
  data:

Screenshot 2022-05-17 at 22 01 38

@ghsnd
Copy link
Contributor

ghsnd commented May 19, 2022

Hi, Thanks for the detailed description. I tried to reproduce it, but everything goes well. It would be helpful if you could also provide:

  • the version of RMLStreamer (release number or commit nr or current branch name...)
  • the CSV file or, if the data is sensitive, a sample with fake data but in the same format, same nr of fields, ...
  • the Docker version.

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants