PerfectQueue is a highly available distributed queue built on top of RDBMS. PerfectQueue provides similar API to Amazon SQS, while PerfectQueue focuses on reliability and flexible scheduling rather than scalability.
PerfectQueue introduces following concepts:
- At-least-once semantics: Even if a worker node fails during processing a task, another worker takes over the task.
- Multiuser-aware fair scheduling: PerfectQueue schedules tasks submitted by users who have larger resource assignment.
- Decider: Decider is a simple mechanism to implement complex workflows on top of queues while keeping loose coupling.
- Graceful and live restarting: PerfectQueue continues processing of long-running tasks even during restarting.
- Idempotent task submission: All tasks have unique identifier and PerfectQueue prevents storing same task twice.
- Note: client applications should consider how to always generate a same string for a (semantically) same task.
- Idempotent task processing support: Workers can use a constant unique identifier to process tasks idempotently.
All you have to consider is implementing idempotent worker programs. PerfectQueue manages the other problems.
# open a queue
PerfectQueue.open(config, &block) #=> #<Queue>
# submit a task
Queue#submit(task_id, type, data, options={})
# poll a task
# (you don't have to use this method directly. see following sections)
Queue#poll #=> #<AcquiredTask>
# get data associated with a task
AcquiredTask#data #=> #<Hash>
# finish a task
AcquiredTask#finish!
# retry a task
AcquiredTask#retry!
# create a task reference
Queue#[](key) #=> #<Task>
# chack the existance of the task
Task#exists?
# force finish a task
# be aware that worker programs can't detect it
Task#force_finish!
TaskError
##
# Workers may get these errors:
#
AlreadyFinishedError < TaskError
PreemptedError < TaskError
ProcessStopError < RuntimeError
ImmediateProcessStopError < ProcessStopError
GracefulProcessStopError < ProcessStopError
##
# Client or other situation:
#
ConfigError < RuntimeError
NotFoundError < TaskError
AlreadyExistsError < TaskError
NotSupportedError < TaskError
# submit tasks
PerfectQueue.open(config) {|queue|
data = {'key'=>"value"}
queue.submit("task-id", "type1", data)
}
class TestHandler < PerfectQueue::Application::Base
# implement run method
def run
# do something ...
puts "acquired task: #{task.inspect}"
# call task.finish!, task.retry! or task.release!
task.finish!
end
end
class Dispatch < PerfectQueue::Application::Dispatch
# describe routing
route "type1" => TestHandler
route /^regexp-.*$/ => :TestHandler # String or Regexp => Class or Symbol
end
In a launcher script or rake file:
system('perfectqueue run -I. -rapp/workers/dispatch Dispatch')
or:
require 'perfectqueue'
require 'app/workers/dispatch'
PerfectQueue::Worker.run(Dispatch) {
# this method is called when the worker process is restarted
raw = File.read('config/perfectqueue.yml')
yml = YAJL.load(raw)
yml[ENV['RAILS_ENV'] || 'development']
}
- TERM: graceful shutdown
- QUIT: immediate shutdown
- USR1: graceful restart
- HUP: immediate restart
- USR2: reopen log files
- INT: detach process for live restarting
- type: backend type (required; see following sections)
- log: log file path (default: use stderr)
- processors: number of child processes (default: 1)
- processor_type: type of processor ('process' or 'thread') (default: 'process')
- poll_interval: interval to poll tasks in seconds (default: 1.0 sec)
- retention_time: duration to retain finished tasks (default: 300 sec)
- task_heartbeat_interval: interval to send heartbeat requests (default: 2 sec)
- alive_time: duration to continue a heartbeat request (default: 300 sec)
- retry_wait: duration to retry a retried task (default: 300 sec)
- child_kill_interval: interval to send signals to a child process (default: 2.0 sec)
- child_graceful_kill_limit: threshold time to switch SIGTERM to SIGKILL (default: never)
- child_heartbeat_interval: interval to send heartbeat packets to a child process (default: 2 sec)
- child_heartbeat_limit: threshold time to detect freeze of a child process (default: 10.0 sec)
- detach_wait: sleep for this seconds before detaching process for live restarting (default: 10.0 sec)
additional configuration:
- url: URL to the RDBMS (example: 'mysql://user:password@host:port/database')
- table: name of the table to use
Not implemented yet.
development:
type: rdb_compat
url: mysql2://root:@localhost:3306/perfectqueue
table: queues
Usage: perfectqueue [options] <command>
commands:
list Show list of tasks
submit <key> <type> <data> Submit a new task
force_finish <key> Force finish a task
run <class> Run a worker process
init Initialize a backend database
options:
-e, --environment ENV Framework environment (default: development)
-c, --config PATH.yml Path to a configuration file (default: config/perfectqueue.yml)
options for submit:
-u, --user USER Set user
-t, --time UNIXTIME Set time to run the task
options for run:
-I, --include PATH Add $LOAD_PATH directory
-r, --require PATH Require files before starting
# assume that the config/perfectqueue.yml exists
$ perfectqueue init
$ perfectqueue submit k1 user_task '{"uid":1}' -u user_1
$ perfectqueue list
key type user status created_at timeout data
k1 user_task user_1 waiting 2012-05-18 13:05:31 -0700 2012-05-18 14:27:36 -0700 {"uid"=>1, "type"=>"user_task"}
k2 user_task user_2 waiting 2012-05-18 13:35:33 -0700 2012-05-18 14:35:33 -0700 {"uid"=>2, "type"=>"user_task"}
k3 system_task waiting 2012-05-18 14:04:02 -0700 2012-05-22 15:04:02 -0700 {"task_id"=>32, "type"=>"system_task"}
3 entries.
$ perfectqueue force_finish k2
$ perfectqueue run -I. -Ilib -rconfig/boot.rb -rapps/workers/task_dispatch.rb TaskDispatch
Running spec utilize 'mysql2://root:@localhost/perfectqueue_test' as the connection string. Please install MySQL server at localhost then run;
$ mysql -h localhost -u root -e 'create database perfectqueue_test;'
You can run spec.
$ bundle exec rake spec