-
Notifications
You must be signed in to change notification settings - Fork 10
v2.5.0 and Earlier
This page is for version 2.5.0, which provides an older, deprecated Command API. Documentation for versions 2.6.0 and later are available in the rest of the wiki, starting at Home.
Version 2.6.0 is a backwards compatible release, so functionality describe here will still work, but moving to the newer SyncCommand
and AsyncCommand
base classes is preferred.
What dangerous code might you wrap in a Mjolnir Command?
- Network operations (inter-cluster, database, cache, search, etc.).
- Operations that use files or read/write from/to disk.
- Long-running or high-resource (CPU, Memory) operations.
Installation is fairly minimal - just grab Hudl.Mjolnir from NuGet (www.nuget.org) using your Package Manager GUI or Console.
Mjolnir requires you to specify a configuration provider. The configurable values within the app all have sensible defaults, but you'll want to adjust them after observing how your application behaves.
Configuration
Mjolnir ships with a configuration provider implementation that can read configuration values from a file; set the provider on application startup (before you use any Commands):
using Hudl.Config;
//...
ConfigProvider.UseProvider(new FileConfigurationProvider(@"c:\path\to", "config-file.txt"));
Configuration file contents are just Key=Value
pairs, e.g.:
mjolnir.command.core-client.GetUser.Timeout=5000
mjolnir.pools.core-user.threadCount=20
If a specific configuration value (e.g. for a breaker or command) isn't defined, you can also define fallback defaults via config, e.g.:
mjolnir.breaker.default.windowMillis=10000
mjolnir.pools.default.threadCount=10
See the different sections of this README for available Configuration.
If you don't set a configuration provider, default values (defined in the code) will be used for everything in Mjolnir (which isn't ideal, because you'll want to tune components and commands for your application).
Metrics/Stats
You can also inject your own metrics handler. We typically use Riemann, but other services like statsd should work well, also.
You'll need to implement Hudl.Mjolnir.External.IStats
and set it on CommandContext
.
using Hudl.Mjolnir;
using Hudl.Mjolnir.External;
//...
CommandContext.Stats = new MyStats();
You'll want to set CommandContext.Stats
early on application startup; breakers and pools will cache their stats implementations, and won't pick up a new one if you set it after they've been created.
See the list of available metrics.
SmartThreadPool dependency
Mjolnir uses SmartThreadPool, but has to package and reference it directly. See Issue #16.
Commands are the heart of Mjolnir, and are how you use its protections. You can either:
- extend
Command<TResult>
and put dangerous code in itsExecuteAsync()
method, or - add
[Command]
to aninterface
, which wraps all of its methods in aCommand
.
Extend Command<TResult>
Example
public class GetUserCommand : Command<UserDto>
{
private readonly string _userId;
public GetUserCommand(string userId)
: base("core-client", "core-user", TimeSpan.FromMilliseconds(15000))
{
_userId = userId;
}
protected override Task<UserDto> ExecuteAsync(CancellationToken token)
{
// Go over the network to query the UserDto using _userId.
}
}
Constructor
Command
needs to be constructed with a few required values. Here are the base
constructor signatures:
// group: A logical grouping for the command. Commands within the same
// client package or collection of commands typically get grouped
// together.
// breakerKey: The named circuit breaker to use for the Command.
// poolKey: The named thread pool to use for the Command.
// defaultTimeout: If not overridden via config, the timeout after which the
// Command will be cancelled.
Command(string group, string breakerKey, string poolKey, TimeSpan defaultTimeout)
// isolationKey: Sets both the breakerKey and poolKey.
Command(string group, string isolationKey, TimeSpan defaultTimeout)
For more information on the keys, see Circuit Breakers and Thread Pools.
[Command]
attribute
Because extending Command
can get very boilerplatey for lots of service calls, you can use Mjolnir's [Command]
attribute if you're okay with sacrificing a little flexibility. The attribute lets you wrap each method of an interface
within a Command
.
Example
[Command("core-client", "core-user", 15000)]
public interface IUserService
{
UserDto GetUser(string userId, CancellationToken? token = null);
void UpdateUser(UserDto userDto, CancellationToken? token = null);
}
public class UserService : IUserService { /* implementation */ }
// ...
void Main() {
// This is typically done in a service locator of some sort and cached.
var proxy = CommandInterceptor.CreateProxy<IUserService>(new UserService());
var userDto = proxy.GetUser("1234");
...
}
Using [Command]
provides the same benefits as extending Command<TResult>
, but can be more convenient.
Note the presence of CancellationToken
s on the interface methods. If your interface method signature contains a CancellationToken
(which is optional), Mjolnir will pass a token created from the timeout through to your method, allowing you to cooperatively cancel your operation. If no CancellationToken
parameter is present, the Command timeout may be less effective. Mjolnir will only pass its token through if it doesn't already see a non-null or non-empty token as the parameter value.
Fallbacks aren't supported when using [Command]
.
If you defined the Command by extending Command<TResult>
, you can invoke it using InvokeAsync()
:
var result = await new GetUserCommand("1234").InvokeAsync();
Though InvokeAsync()
is preferred, we realize that it's difficult to convert lower-level code to use async
. For those situations, you can use Invoke()
, which safely unwraps and returns the result.
var result = new GetUserCommand("1234").Invoke();
If you defined the Command using [Command]
, you can simply call the methods.
[Command("foo", "bar")]
public interface ICommandInterface {
Task<bool> MyMethod(CancellationToken? token = null);
}
//...
// Assumes your service locator has used CommandInterceptor.CreateProxy() on ICommandInterface.
var result = myServiceLocator.GetService<ICommandInterface>().MyMethod();
See CommandInterceptor
for details on how different method return types affect the way the Command behaves when using [Command]
.
Thread pools help guard against one type of operation consuming more than its share of resources and starving out other operations.
Example:
Imagine we have an operation that makes a network call to a different cluster. If that downstream cluster becomes very slow, our calls will start blocking and waiting for responses.
Under high enough traffic volume, our cluster will start building up pending operations that are waiting for the downstream cluster to respond, which means they're taking up increasingly more threads - potentially as many as they can - leaving fewer threads for normal, unrelated operations to work with.
To prevent this, Commands are grouped into thread pools. Each thread pool receives a fixed number of threads to work with, along with a small queue in front of it.
When the thread pool is at capacity, operations will begin getting rejected from the pool, resulting in an immediately-thrown CommandFailedException
.
Configuration + Defaults
# Number of threads to allocate.
mjolnir.pools.<pool-key>.threadCount=10
mjolnir.pools.default.threadCount=10
# Length of the queue that fronts the pool.
mjolnir.pools.<pool-key>.queueLength=10
mjolnir.pools.default.queueLength=10
Changing these values requires an application restart (i.e. pools don't dynamically resize after creation).
Breakers track the success/failure rates of operations, and trip if the failure rate exceeds a configured threshold. A tripped breaker immediately rejects operations that attempt to go through it.
After a configured period, the breaker sends a test operation through. If the operation succeeds, the breaker fixes itself and allows operations again.
Configuration + Defaults
# The minimum operation count the breaker must see before considering tripping.
mjolnir.breaker.<breaker-key>.minimumOperations=10
mjolnir.breaker.default.minimumOperations=10
# The error percentage at which the breaker should trip.
mjolnir.breaker.<breaker-key>.thresholdPercentage=50
mjolnir.breaker.default.thresholdPercentage=50
# When the breaker trips, the duration to wait before allowing a test operation.
mjolnir.breaker.<breaker-key>.trippedDurationMillis=10000
mjolnir.breaker.default.trippedDurationMillis=10000
# Forces the breaker tripped. Takes precedence over forceFixed.
mjolnir.breaker.<breaker-key>.forceTripped=false
mjolnir.breaker.default.forceTripped=false
# Forces the breaker fixed.
mjolnir.breaker.<breaker-key>.forceFixed=false
mjolnir.breaker.default.forceFixed=false
# Period to accumulate metrics in. Resets at the end of every window.
mjolnir.metrics.<breaker-key>.windowMillis=30000
mjolnir.metrics.default.windowMillis=30000
# How long to cache the metrics snapshot that breakers read. Probably doesn't need adjusting.
mjolnir.metrics.<breaker-key>.snapshotTtlMillis=1000
mjolnir.metrics.default.snapshotTtlMillis=1000
These values can be changed at runtime.
Note: Fallbacks aren't very proven out yet, and need some more work. For now, we recommend wrapping your command invocations in try/catch blocks and falling back that way.
A Command
can optionally define a Fallback()
implementation. The fallback will execute if the command's ExecuteAsync()
throws an Exception.
[Command]
attributes do not support fallbacks - you must extend Command<TResult>
to implement one.
Examples of what a fallback might do:
- Return an empty collection or
null
- Return a default value
- Query from a snapshot/nightly database or cache
Every command supports cancellation using a CancellationToken.
Cancellation is cooperative. Mjolnir will not terminate/abort threads when the timeout is reached. Instead, it relies on implementations to use the CancellationToken
or pass it through to things that support it (e.g. network operations). ExecuteAsync(CancellationToken token)
receives the token as an argument for this reason.
If using [Command]
, Mjolnir will attempt to pass its CancellationToken
through. The token will be passed if the method
- has a
CancellationToken
orCancellationToken?
parameter, and - the value for that parameter is null or
CancellationToken.None
If the method does not have a CancellationToken
parameter, you lose the timeout protections Mjolnir provides.
Configuration + Defaults
# Timeouts do not have a global default value. The default is provided via the
# implementation's constructor.
command.<name>.Timeout=<millis>
See Command Names for details on how command names are generated.
Commands automatically receive a Name
property that's used for configuring the command and tracking the command's metrics.
The Name
is built from the Command's group and a generated component. If the Command's group is "my-group", names will look like:
- If you extend
Command<TResult>
, the name will be your Command class name, minus any command suffix. Examples:-
class MyFooCommand : Command<int>
=> "my-group.MyFoo" -
class MyCommandThatFoos : Command<int>
=> "my-group.MyCommandThatFoos"
-
- If you use
[Command]
, the name will be built from the interface name and method name, separated by a dash. Examples:-
IMyFooInterface.MyBarMethod()
=> "my-group.IMyFooInterface-MyBarMethod"
-
Command names will always have exactly one dot (.
) separator. Configuring a Command's timeout, therefore, might look like:
command.my-group.MyFoo.Timeout=15000
The default timeouts are fairly permissive; timeouts should be tuned after observing the Command's typical behavior in production.
- The Netflix Engineering team, authors of Hystrix OSS, which inspired much of Mjolnir.
- Michael Nygard, for Release It! Design and Deploy Production-Ready Software, which does a fantastic job of discussing patterns like circuit breakers and bulkheads. Buy it, it's good.
- Ami Bar, for SmartThreadPool.
-
Stephen Cleary, author of many great articles and StackOverflow answers about
async
/await
and TAP.