Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improvements for the indexer manager #11

Open
1 of 5 tasks
DSuveges opened this issue Dec 9, 2019 · 1 comment
Open
1 of 5 tasks

Improvements for the indexer manager #11

DSuveges opened this issue Dec 9, 2019 · 1 comment

Comments

@DSuveges
Copy link
Contributor

DSuveges commented Dec 9, 2019

Implementing the following features would improve the performance of the indexer manager significantly:

  • Before jobs are submitted to lsf, the log directory should be wiped off. If logfiles are being overwritten it is more difficult to parse failed jobs.
  • When checking for statuses of jobs the script needs to make more resilient for missing statuses.
  • When the script finds a failed jobs it should check why the job failed and take appropriate action.
  • If a study stuck in UNKNW (unknown) status, it should be killed and resubmitted.
  • Certain, seemingly active jobs with RUN status are just hanging. I think it make sense to implement a test to see if the job is still actively running or not, if a job doesn't do anything for days, it should be restarted.
@DSuveges
Copy link
Contributor Author

Deleting old log files is now part of the data release plan: before submitting the jobs, old logfiles are now deleted as part of the solr indexing tasks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant