openstack-queue

a batch job queueing system for OpenStack-based cloud services

building and installing

Here are some steps for building and installing on Ubuntu Trusty. Easily transferrable to other distros!

Get the dependencies: apt-get install maven2 openjdk-7-jdk redis-server
Clone this repository: git clone https://github.com/jasonrig/openstack-queue.git
Build the source: mvn install

Move the jars somewhere you think is suitable, e.g. /opt/openstack-queue/:

mkdir /opt/openstack-queue
cp ./target/openstack-queue-0.0.1-SNAPSHOT.jar /opt/openstack-queue/
cp -R ./target/dependency-jars/ /opt/openstack-queue/

Create a directory for configuration files: mkdir /etc/openstack-queue
Create a user account under which the queue will run: sudo useradd -d /etc/openstack-queue -r -s /bin/bash openstack-queue
Create a configuration file (warning: should be readable only by root and the openstack-queue user or else other users may take control of your VMs). Use the openstack-queue.properties example file as a template. It should be stored in the home directory of the openstack-queue user, i.e. /etc/openstack-queue, and must be called openstack-queue.properties.
Create a directory for ssh authorised keys, allowing nodes to log in to the queue server node: mkdir /etc/openstack-queue/.ssh
Set appropriate permissions for the .ssh directory: chown openstack-queue:openstack-queue /etc/openstack-queue/.ssh && chmod 700 /etc/openstack-queue/.ssh

If you run the queue interactively, i.e. not as a service, the configuration file location should be as described by the Apache Commons Configuration docs (http://commons.apache.org/proper/commons-configuration/userguide/howto_properties.html):
- in the current directory
- in the user home directory
- in the classpath
Create an upstart script so you can run this as a service. Here is an example (/etc/init/openstack-queue.conf):
```
description     "OpenStack Queue"
stop on runlevel [!2345]

umask 002

script
    mkdir -p /tmp/openstack-queue-tmp/
    chown openstack-queue:root /tmp/openstack-queue-tmp/
    chmod 700 /tmp/openstack-queue-tmp
    cd /tmp/openstack-queue-tmp/
    su -p -s /bin/bash -c "/usr/bin/java -jar /opt/openstack-queue/openstack-queue-0.0.1-SNAPSHOT.jar" openstack-queue >> /var/log/openstack-queue.log 2>&1
end script
```
This will create a space in /tmp where openstack-queue will store its temporary files, and log to /var/log/openstack-queue.log. This being a development version, lots of logs are produced and this file will get big quickly. To reduce the verbosity, edit src/main/resources/logback.xml and rebuild the jar.
Start the required services:

service redis-server start
service openstack-queue start

Notes on security: This queue trusts the users to do the right thing! Any user may delete or submit or even terminate the queue by sending the correct commands to the redis message queue. The best way to implement this as far as I see it is to use redis authentication and then create a client that will enforce whatever security policies you want. This has not been done yet!

notes on logging

The upstart script above provides basic logging, however in its default state, DEBUG level logging is generated. Therefore, this file grows rapidly. More sophisticated logging (including log rotation) can be achieved using svlogd, which is part of the runit package in Ubuntu. Setting this up is fairly straightforward.

Create a logging directory (e.g. mkdir /var/log/openstack-queue)
Set the ownership appropriately (e.g. chown openstack-queue:root /var/log/openstack-queue)
Edit the upstart config file, replacing the file redirection to /var/log/openstack-queue.log with a pipe to svlogd /var/log/openstack-queue/

In the end, you should have a log file called "current" located in /var/log/openstack-queue with default log rotation options. These can be specified explicitly by creating a file called /var/log/openstack-queue/config -- see svlogd docs (man svlogd)

emulating PBS(ish) qsub, qstat and qterm

Here are some crude examples of how to emulate qsub, qstat and qterm in my own special way. Yes, I use PHP... sorry. But it works!

Get the dependencies: apt-get install php5-cli php5-redis php5-json
Create scripts as desired:

qstat

#!/usr/bin/php
<?php

function processRedisMessage($redis, $channel, $message) {
        $redis->close();
        $queueStatus = json_decode($message);

        echo "Job ID\tName\tStart time\tLatest finish\tStatus\n";
        foreach ($queueStatus as &$job) {
                $job = get_object_vars($job);
                $startTime = ($job['startTime']==0)?"not started":date("r", floor($job['startTime'] / 1000));
                $latestFinish = ($job['timeLimit'] == 0)?"unlimited":date("r", floor(($job['startTime']+$job['timeLimit']) / 1000));
                echo $job['id']."\t".$job['name']."\t".$startTime."\t".$latestFinish."\t".$job['status']."\n";
        }

}

$redis=new Redis() or die("PHP redis package not available.");
$redis->connect('127.0.0.1');

echo "Waiting for queue status...\n";
$redis->subscribe(array('jobqueue-status'), 'processRedisMessage');

?>

qsub

#!/usr/bin/php
<?php

$redis=new Redis() or die("PHP redis package not available.");
$redis->connect('127.0.0.1');

$jobData = file_get_contents($argv[1]);
$redis->publish('jobqueue-submit', $jobData);

?>

qdel

#!/usr/bin/php
<?php

$redis=new Redis() or die("PHP redis package not available.");
$redis->connect('127.0.0.1');

$redis->publish('jobqueue-admin', "killjob ".$argv[1]);

?>

qterm

#!/usr/bin/php
<?php

$redis=new Redis() or die("PHP redis package not available.");
$redis->connect('127.0.0.1');

$redis->publish('jobqueue-admin', "shutdown");

?>

a "hello world" MPI job

Get the build dependencies: apt-get install build-essential libopenmpi-dev openmpi-bin

Build the following MPI code (taken from http://mpitutorial.com/mpi-hello-world/) using mpicc:

#include <mpi.h>

int main(int argc, char** argv) {
 // Initialize the MPI environment
 MPI_Init(NULL, NULL);

 // Get the number of processes
 int world_size;
 MPI_Comm_size(MPI_COMM_WORLD, &world_size);

 // Get the rank of the process
 int world_rank;
 MPI_Comm_rank(MPI_COMM_WORLD, &world_rank);

 // Get the name of the processor
 char processor_name[MPI_MAX_PROCESSOR_NAME];
 int name_len;
 MPI_Get_processor_name(processor_name, &name_len);

 // Print off a hello world message
 printf("Hello world from processor %s, rank %d"
        " out of %d processors\n",
        processor_name, world_rank, world_size);

 // Finalize the MPI environment.
 MPI_Finalize();
}

Create a submit json file (note: logPath and resultsPath must be writable by the openstack-queue user or group):

{ 'groupName':'hello', 'minNodes':1, 'maxNodes':1, 'minNodeSize':8, 'logPath':'/somewhere/to/store/server/logs/', 'resultsPath':'/somewhere/to/copy/results/files/', 'payloadFiles':['/path/to/compiled/program/a.out'], 'bootstrapScript':'apt-get -y update && apt-get -y install openmpi-bin', 'executeScript':'mpirun --hostfile /tmp/hostfile /tmp/a.out', 'cleanupScript':'' }

```

Submit the json file, e.g. qsub myjob.json
Monitor the logs to see it in action tail -f /var/log/openstack-queue.log or check with qstat

some notes on the json file

The example given above is about the most minimal set of parameters required for something to run. The fields are as follows:

groupName: The name given to the server group used for this calculation. Synonymous with a job name.
minNodes: The smallest acceptable number of nodes before this job can run
maxNodes: The maximum number of nodes that can be allocated. The queue aims for this number.
minNodeSize: The smallest number of CPUs in the nodes created
logPath: Where stdout and stderr for scripts executed on the nodes are stored (two files per node)
resultsPath: Any data stored in /tmp/copyback on the nodes created will be returned to this directory in gzipped tarballs
payloadFiles: Data required for the computation that is copied to every node created
bootstrapScript: A script run after the payload is copied but before the job begins. Suitable for installing dependencies.
executeScript: A script run after boostrapping is complete in order to run the real calculation
cleanupScript: Run after the execute script is complete. Suitable for putting relevant results files into /tmp/copyback

There are more possible fields that are not listed here. See the javadoc for jrigby.openstack_queue.request_templates.ThreeStageJobResourceRequestJsonMessage for a complete list.

more notes

All scripts run as root on the compute nodes created. This isn't a big issue since they generally don't have privileged access to anything outside of the confines of the calculation/job you're running.
Payload files are given 777 permissions
The IP address of each participating node is printed at the top of each server log file
The private key for the default user of the VM image is also written to the logPath. This should be kept somewhere secret where only you and the openstack-queue user may read/write, otherwise someone could take over your VMs. Not a huge issue in the end though, since these servers only last for the duration of the job.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.idea/libraries		.idea/libraries
.settings		.settings
doc		doc
src/main		src/main
.classpath		.classpath
.gitignore		.gitignore
.project		.project
LICENSE		LICENSE
README.md		README.md
openstack-queue.properties		openstack-queue.properties
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

openstack-queue

building and installing

notes on logging

emulating PBS(ish) qsub, qstat and qterm

a "hello world" MPI job

some notes on the json file

more notes

About

Releases

Packages

Languages

License

jasonrig/openstack-queue

Folders and files

Latest commit

History

Repository files navigation

openstack-queue

building and installing

notes on logging

emulating PBS(ish) qsub, qstat and qterm

a "hello world" MPI job

some notes on the json file

more notes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages