processing job infra spin up takes 16x longer than the job itself 

**Describe the feature you'd like**
The feature I am proposing would be to implement some sort of warm pool (similar to estimator) or scheduled run to keep infra up to allow the processing job to take only as long as the script runtime. Requesting this due to my 30s script runtime taking 8min 17s total bc of infra allocation.

**How would this feature be used? Please describe.**
This feature would be used to cut down on processing time and reduce latency

**Describe alternatives you've considered**
Alternatives I have considered are sagemaker notebook jobs, and lambda container.

**Additional context**
The goal of my processing job would read in a file, process and index it (create vector embeddings and add to docstore) using the library of my choice (langchain, haystack, etc)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

processing job infra spin up takes 16x longer than the job itself #4358

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

processing job infra spin up takes 16x longer than the job itself #4358

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions