Skip to content

ElasticManager: Uni-directional communication in ElasticManager? #8

Open
@marius311

Description

@marius311

I'm using ClusterManagers.jl on a cluster where the compute nodes (which run workers) can connect to the login nodes (which run the master process), but the other direction is blocked by firewalls. The current code which sets up an ElasticManager first has the workers connecting to master (which works), but then, via code in Base.Distributed, the master establishes a second connection to the workers (which in my case is blocked). I'm wondering:

  1. What's the reason for the extra connection here? (which very naively to me seems unnecessary since we already have one), and

  2. Is there any reason we can't just use the original worker->master connection for all communication? I've been hacking at the code a bit and it kind of seems to work, but was wondering if there's a showstopper I should know about before spending more energy getting it fully working.

Thanks.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions