Skip to content

processing job infra spin up takes 16x longer than the job itself  #4358

@jlonge4

Description

@jlonge4

Describe the feature you'd like
The feature I am proposing would be to implement some sort of warm pool (similar to estimator) or scheduled run to keep infra up to allow the processing job to take only as long as the script runtime. Requesting this due to my 30s script runtime taking 8min 17s total bc of infra allocation.

How would this feature be used? Please describe.
This feature would be used to cut down on processing time and reduce latency

Describe alternatives you've considered
Alternatives I have considered are sagemaker notebook jobs, and lambda container.

Additional context
The goal of my processing job would read in a file, process and index it (create vector embeddings and add to docstore) using the library of my choice (langchain, haystack, etc)

Metadata

Metadata

Assignees

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions