-
Notifications
You must be signed in to change notification settings - Fork 92
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Why clearml agent worker ignores PYTHONPATH and CLEARML_AGENT_EXTRA_PYTHON_PATH? #220
Comments
UPD |
Hi @gzzv, How are you passing the PYTHONPATH and the other env var? Can you include logs for this failure? |
Hi @jkhenning!
|
My values.yaml: imageCredentials:
enabled: true
existingSecret: regcredcloud
clearml:
existingAgentk8sglueSecret: agentk8sglue
agentk8sglue:
apiServerUrlReference: "https://example.com/"
fileServerUrlReference: "https://example.com/"
webServerUrlReference: "https://example.com/"
defaultContainerImage: test_image:0.0.1
queue: Processing
basePodTemplate:
env:
- name: CLEARML_AGENT_SKIP_CONTAINER_APT
value: 'true'
- name: CLEARML_AGENT_NO_UPDATE
value: '1'
- name: CLEARML_AGENT_SKIP_PYTHON_ENV_INSTALL
value: '1'
tolerations:
- key: "nvidia.com/gpu"
operator: Exists
effect: "NoSchedule"
resources:
limits:
nvidia.com/gpu: "1"
requests:
nvidia.com/gpu: "1" |
Logs: Using environment access key CLEARML_API_ACCESS_KEY=5X487RWJ4KAW3OB8AUS1OLFW062QDV
Using environment secret key CLEARML_API_SECRET_KEY=********
Current configuration (clearml_agent v1.9.2, location: /tmp/.clearml_agent.9_sf_1el.cfg):
----------------------
*****Long clearml-config description******
Executing task id [12572f62ea75456daa3128419a735a4a]:
repository =
branch =
version_num =
tag =
docker_cmd = swr.ru-moscow-1.hc.sbercloud.ru/srobotics-cheburator/cheburator:x86.0.0.31
entry_point = task_remote_run.py
working_dir = .
Running task id [12572f62ea75456daa3128419a735a4a]:
[.]$ /usr/bin/python3.10 -u /root/.clearml/venvs-builds/code/task_remote_run.py
Summary - installed python packages:
*****Long pip summary******
Environment setup completed successfully
Starting Task Execution:
Traceback (most recent call last):
File "/root/.clearml/venvs-builds/code/task_remote_run.py", line 19, in <module>
from cheburator_description.description_parameters import DescriptionParametersNoROS
ModuleNotFoundError: No module named 'cheburator_description' |
UPD |
I've manually added PYTHONPATH with value to values.yaml file and it works |
I would assume that this is because the PYTHONPATH env var is somehow added to bashrc or something that's loaded when you "manually" execute into the container, but for some reason not when the agent command is executed inside that container... |
I try to run remote clearml agent in k8s cluster. I have several python packages which are located in the different paths. This paths are specified in PYTHONPATH, but agent worker can't import this packages.
I also tried to add path to the CLEARML_AGENT_EXTRA_PYTHON_PATH, but again nothing. Can anyone help me and give me some advice?
The text was updated successfully, but these errors were encountered: