Skip to content

Running Spark Tasks on Flyte #1902

Answered by SandraGH5
SandraGH5 asked this question in Q&A
Discussion options

You must be logged in to vote

I think your example code doesn't have flytekitplugins-spark. Please check the example: https://github.com/flyteorg/flytesnacks/blob/master/cookbook/integrations/kubernetes/k8s_spark/requirements.in

requirements.in

-r ../../../common/requirements-common.in
pyspark

flytekitplugins-spark>=0.16.0

Also, if Java is installed, set JAVA_HOME env variable in the Dockerfile, using a docker file like this:
https://github.com/flyteorg/flytesnacks/blob/master/cookbook/integrations/kubernetes/k8s_spark/Dockerfile which has java installed

Dockerfile

FROM ubuntu:focal
LABEL org.opencontainers.image.source https://github.com/flyteorg/flytesnacks

WORKDIR /root
ENV VENV /opt/venv
ENV LANG C.UTF-8
ENV LC…

Replies: 1 comment

Comment options

SandraGH5
Dec 6, 2021
Collaborator Author

You must be logged in to vote
0 replies
Answer selected by SandraGH5
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
1 participant