Configuring SparkGraphComputer for OLAP #269

sandeepdoctily · 2018-04-14T07:21:51Z

Hi All

I am trying to configure sparkGraphComputer with DYnamodb local.
Please find below the configuration. Kindly help me out.

TinkerPop Hadoop Graph for OLAP

gremlin.graph=org.apache.tinkerpop.gremlin.hadoop.structure.HadoopGraph

Set the default OLAP computer for graph.traversal().withComputer()

gremlin.hadoop.defaultGraphComputer=org.apache.tinkerpop.gremlin.spark.process.computer.SparkGraphComputer

gremlin.hadoop.graphInputFormat=org.apache.hadoop.dynamodb.read.DynamoDBInputFormat

gremlin.hadoop.graphOutputFormat=org.apache.hadoop.dynamodb.write.DynamoDBOutputFormat

####################################

SparkGraphComputer Configuration

####################################

spark.master=local[*]

spark.executor.memory=200m

spark.serializer=org.apache.spark.serializer.KryoSerializer

spark.akka.timeout=500000

#spark.kryo.registrationRequired=false

spark.storage.memoryFraction=0.2

spark.eventLog.enabled=true

spark.eventLog.dir=/tmp/spark-event-logs

spark.ui.killEnabled=true

spark.dynamicAllocation.enabled=false

spark.network.timeout=60000

spark.rpc.askTimeout=80000

spark.sql.broadcastTimeout=90000

#spark.serializer=org.apache.spark.serializer.KryoSerializer

#janusgraphmr.ioformat.conf.storage.backend==com.amazon.janusgraph.diskstorage.dynamodb.DynamoDBStoreManager

#janusgraphmr.ioformat.conf.storage.dynamodb.client.credentials.class-name=com.amazonaws.auth.BasicAWSCredentials

#janusgraphmr.ioformat.conf.storage.dynamodb.client.credentials.constructor-args=access,secret

#janusgraphmr.ioformat.conf.storage.dynamodb.client.signing-region=us-east-1

#janusgraphmr.ioformat.conf.storage.dynamodb.client.endpoint=http://localhost:8000

#gremlin.graph=org.janusgraph.core.JanusGraphFactory

#metrics.enabled=true

#metrics.prefix=j

#metrics.csv.interval=1000

#metrics.csv.directory=metrics

storage.write-time=1 ms

storage.read-time=1 ms

storage.backend=com.amazon.janusgraph.diskstorage.dynamodb.DynamoDBStoreManager

storage.dynamodb.client.credentials.class-name=com.amazonaws.auth.BasicAWSCredentials

storage.dynamodb.client.credentials.constructor-args=access,secret

storage.dynamodb.client.signing-region=us-east-1

storage.dynamodb.client.endpoint=http://localhost:8000

When I run a query I get the below expection:
gremlin> g.V().count()

java.lang.RuntimeException: class org.apache.hadoop.dynamodb.read.DynamoDBInputFormat not org.apache.hadoop.mapreduce.InputFormat
at org.apache.hadoop.conf.Configuration.getClass(Configuration.java:2221)
at org.apache.tinkerpop.gremlin.spark.process.computer.SparkGraphComputer.lambda$submitWithExecutor$0(SparkGraphComputer.java:177)

amcp · 2018-05-09T06:32:55Z

DynamoDBInputFormat is not implemented yet, but could be implemented by copying from or depending on the DynamoDB EMR connector. https://github.com/awslabs/emr-dynamodb-connector/blob/master/emr-dynamodb-hadoop/src/main/java/org/apache/hadoop/dynamodb/read/DynamoDBInputFormat.java

danielwhatmuff · 2018-12-12T11:58:17Z

Is DynamoDBInputFormat now implemented?

sandeepdoctily changed the title ~~Using SparkGraphComputer for OLAP~~ Configuring SparkGraphComputer for OLAP Apr 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configuring SparkGraphComputer for OLAP #269

Configuring SparkGraphComputer for OLAP #269

sandeepdoctily commented Apr 14, 2018 •

edited

Loading

amcp commented May 9, 2018

danielwhatmuff commented Dec 12, 2018

Configuring SparkGraphComputer for OLAP #269

Configuring SparkGraphComputer for OLAP #269

Comments

sandeepdoctily commented Apr 14, 2018 • edited Loading

TinkerPop Hadoop Graph for OLAP

Set the default OLAP computer for graph.traversal().withComputer()

SparkGraphComputer Configuration

amcp commented May 9, 2018

danielwhatmuff commented Dec 12, 2018

sandeepdoctily commented Apr 14, 2018 •

edited

Loading