- Introduction
- Documentation
- Requirements
- Installation
- Support Me
- General Information
- Options
- Locks
- Conflict Strategy
- Usage
- Debugging
- Communication
- Testing
- Contributing
- Contributors
The goal of this gem is to ensure your Sidekiq jobs are unique. We do this by creating unique keys in Redis based on how you configure uniqueness.
This is the documentation for the master branch. You can find the documentation for each release by navigating to its tag: https://github.com/mhenrixon/sidekiq-unique-jobs/tree/v5.0.10.
Below are links to the latest major versions (4 & 5):
See https://github.com/mperham/sidekiq#requirements for what is required. Starting from 5.0.0 only sidekiq >= 4 is supported and support for MRI <= 2.1 is dropped. ActiveJob is not supported
Version 6 requires Redis >= 3 and pure Sidekiq, no ActiveJob supported anymore. See About ActiveJob for why.
Add this line to your application's Gemfile:
gem 'sidekiq-unique-jobs'
And then execute:
$ bundle
Or install it yourself as:
$ gem install sidekiq-unique-jobs
Want to show me some ❤️ for the hard work I do on this gem? You can use the following PayPal link https://paypal.me/mhenrixon. Any amount is welcome and let me tell you it feels good to be appreciated. Even a dollar makes me super excited about all of this.
See Interaction w/ Sidekiq on how the gem interacts with Sidekiq.
See Locking & Unlocking for an overview of the differences on when the various lock types are locked and unlocked.
This is probably not the configuration option you want...
Since the client and the server are disconnected and not running inside the same process, setting a lock expiration is probably not what you want. Any keys that are used by this gem WILL be removed at the time of the expiration. For jobs that are scheduled in the future the key will expire when that job is scheduled + whatever expiration you have set.
In previous versions there was a default expiration of 30 minutes which didn't work for a lot of long running jobs. Since version 6 there will be no expiration of any jobs from the default configuration. Please don't use lock_expiration
unless you really know what you are doing.
sidekiq_options lock_expiration: nil # default - don't expire keys
sidekiq_options lock_expiration: 20.days # expire this lock in 20 days
This is the timeout (how long to wait) when creating the lock. By default we don't use a timeout so we won't wait for the lock to be created. If you want it is possible to set this like below.
sidekiq_options lock_timeout: 0 # default - don't wait at all
sidekiq_options lock_timeout: 5 # wait 5 seconds
sidekiq_options lock_timeout: nil # lock indefinitely, this process won't continue until it gets a lock. VERY DANGEROUS!!
This configuration option is slightly misleading. It doesn't disregard the queue on other jobs. Just on itself, this means that a worker that might schedule jobs into multiple queues will be able to have uniqueness enforced on all queues it is pushed to.
class Worker
include Sidekiq::Worker
sidekiq_options unique_across_queues: true, queue: 'default'
def perform(args); end
end
Now if you push override the queue with Worker.set(queue: 'another').perform_async(1)
it will still be considered unique when compared to Worker.perform_async(1)
(that was actually pushed to the queue default
).
This configuration option is slightly misleading. It doesn't disregard the worker class on other jobs. Just on itself, this means that a worker that the worker class won't be used for generating the unique digest. The only way this option really makes sense is when you want to have uniqueness between two different worker classes.
class WorkerOne
include Sidekiq::Worker
sidekiq_options unique_across_workers: true, queue: 'default'
def perform(args); end
end
class WorkerTwo
include Sidekiq::Worker
sidekiq_options unique_across_workers: true, queue: 'default'
def perform(args); end
end
WorkerOne.perform_async(1)
# => 'the jobs unique id'
WorkerTwo.perform_async(1)
# => nil because WorkerOne just stole the lock
Locks from when the client pushes the job to the queue. Will be unlocked before the server starts processing the job.
NOTE this is probably not so good for jobs that shouldn't be running simultaneously (aka slow jobs).
sidekiq_options lock: :until_executing
Locks from when the client pushes the job to the queue. Will be unlocked when the server has successfully processed the job.
sidekiq_options lock: :until_executed
Locks from when the client pushes the job to the queue. Will be unlocked when the specified timeout has been reached.
sidekiq_options lock: :until_expired
Locks when the client pushes the job to the queue. The queue will be unlocked when the server starts processing the job. The server then goes on to creating a runtime lock for the job to prevent simultaneous jobs from being executed. As soon as the server starts processing a job, the client can push the same job to the queue.
sidekiq_options lock: :until_and_while_executing
With this lock type it is possible to put any number of these jobs on the queue, but as the server pops the job from the queue it will create a lock and then wait until other locks are done processing. It looks like multiple jobs are running at the same time but in fact the second job will only be waiting for the first job to finish.
NOTE Unless this job is configured with a lock_timeout: nil
or lock_timeout: > 0
then all jobs that are attempted to be executed will just be dropped without waiting.
sidekiq_options lock: :while_executing, lock_timeout: nil
There is an example of this to try it out in the rails_example
application. Run foreman start
in the root of the directory and open the url: localhost:5000/work/duplicate_while_executing
.
In the console you should see something like:
0:32:24 worker.1 | 2017-04-23T08:32:24.955Z 84404 TID-ougq4thko WhileExecutingWorker JID-400ec51c9523f41cd4a35058 INFO: start
10:32:24 worker.1 | 2017-04-23T08:32:24.956Z 84404 TID-ougq8csew WhileExecutingWorker JID-8d6d9168368eedaed7f75763 INFO: start
10:32:24 worker.1 | 2017-04-23T08:32:24.957Z 84404 TID-ougq8crt8 WhileExecutingWorker JID-affcd079094c9b26e8b9ba60 INFO: start
10:32:24 worker.1 | 2017-04-23T08:32:24.959Z 84404 TID-ougq8cs8s WhileExecutingWorker JID-9e197460c067b22eb1b5d07f INFO: start
10:32:24 worker.1 | 2017-04-23T08:32:24.959Z 84404 TID-ougq4thko WhileExecutingWorker JID-400ec51c9523f41cd4a35058 WhileExecutingWorker INFO: perform(1, 2)
10:32:34 worker.1 | 2017-04-23T08:32:34.964Z 84404 TID-ougq4thko WhileExecutingWorker JID-400ec51c9523f41cd4a35058 INFO: done: 10.009 sec
10:32:34 worker.1 | 2017-04-23T08:32:34.965Z 84404 TID-ougq8csew WhileExecutingWorker JID-8d6d9168368eedaed7f75763 WhileExecutingWorker INFO: perform(1, 2)
10:32:44 worker.1 | 2017-04-23T08:32:44.965Z 84404 TID-ougq8crt8 WhileExecutingWorker JID-affcd079094c9b26e8b9ba60 WhileExecutingWorker INFO: perform(1, 2)
10:32:44 worker.1 | 2017-04-23T08:32:44.965Z 84404 TID-ougq8csew WhileExecutingWorker JID-8d6d9168368eedaed7f75763 INFO: done: 20.009 sec
10:32:54 worker.1 | 2017-04-23T08:32:54.970Z 84404 TID-ougq8cs8s WhileExecutingWorker JID-9e197460c067b22eb1b5d07f WhileExecutingWorker INFO: perform(1, 2)
10:32:54 worker.1 | 2017-04-23T08:32:54.969Z 84404 TID-ougq8crt8 WhileExecutingWorker JID-affcd079094c9b26e8b9ba60 INFO: done: 30.012 sec
10:33:04 worker.1 | 2017-04-23T08:33:04.973Z 84404 TID-ougq8cs8s WhileExecutingWorker JID-9e197460c067b22eb1b5d07f INFO: done: 40.014 sec
Decides how we handle conflict. We can either reject the job to the dead queue or reschedule it. Both are useful for jobs that absolutely need to run and have been configured to use the lock WhileExecuting
that is used only by the sidekiq server process.
The last one is log which can be be used with the lock UntilExecuted
and UntilExpired
. Now we write a log entry saying the job could not be pushed because it is a duplicate of another job with the same arguments
This strategy is intended to be used with UntilExecuted
and UntilExpired
. It will log a line about that this is job is a duplicate of another.
sidekiq_options lock: :until_executed, on_conflict: :log
This strategy is intended to be used with WhileExecuting
. Basically it will allow us to let the server process crash with a specific error message and be retried without messing up the Sidekiq stats.
sidekiq_options lock: :while_executing, on_conflict: :raise, retry: 10
This strategy is intended to be used with WhileExecuting
and will push the job to the dead queue on conflict.
sidekiq_options lock: :while_executing, on_conflict: :reject
This strategy is intended to be used with client locks like UntilExecuted
.
It will delete any existing job for these arguments from retry, schedule and
queue and retry the lock again.
This is slightly dangerous and should probably only be used for jobs that are always scheduled in the future. Currently only attempting to retry one time.
sidekiq_options lock: :until_executed, on_conflict: :replace
This strategy is intended to be used with WhileExecuting
and will delay the job to be tried again in 5 seconds. This will mess up the sidekiq stats but will prevent exceptions from being logged and confuse your sysadmins.
sidekiq_options lock: :while_executing, on_conflict: :reschedule
All that is required is that you specifically set the sidekiq option for unique to a valid value like below:
sidekiq_options lock: :while_executing
Requiring the gem in your gemfile should be sufficient to enable unique jobs.
Sometimes it is desired to have a finer control over which arguments are used in determining uniqueness of the job, and others may be transient. For this use-case, you need to define either a unique_args
method, or a ruby proc.
The unique_args method need to return an array of values to use for uniqueness check.
The method or the proc can return a modified version of args without the transient arguments included, as shown below:
class UniqueJobWithFilterMethod
include Sidekiq::Worker
sidekiq_options lock: :until_and_while_executing,
unique_args: :unique_args # this is default and will be used if such a method is defined
def self.unique_args(args)
[ args[0], args[2][:type] ]
end
...
end
class UniqueJobWithFilterProc
include Sidekiq::Worker
sidekiq_options lock: :until_executed,
unique_args: ->(args) { [ args.first ] }
...
end
It is also quite possible to ensure different types of unique args based on context. I can't vouch for the below example but see #203 for the discussion.
class UniqueJobWithFilterMethod
include Sidekiq::Worker
sidekiq_options lock: :until_and_while_executing, unique_args: :unique_args
def self.unique_args(args)
if Sidekiq::ProcessSet.new.size > 1
# sidekiq runtime; uniqueness for the object (first arg)
args.first
else
# queuing from the app; uniqueness for all params
args
end
end
end
If you need to perform any additional work after the lock has been released you can provide an #after_unlock
instance method. The method will be called when the lock has been unlocked. Most times this means after yield but there are two exceptions to that.
Exception 1: UntilExecuting unlocks and calls back before yielding. Exception 2: UntilExpired expires eventually, no after_unlock hook is called.
class UniqueJobWithFilterMethod
include Sidekiq::Worker
sidekiq_options lock: :while_executing,
def after_unlock
# block has yielded and lock is released
end
...
end.
To see logging in sidekiq when duplicate payload has been filtered out you can enable on a per worker basis using the sidekiq options. The default value is false
class UniqueJobWithFilterMethod
include Sidekiq::Worker
sidekiq_options lock: :while_executing,
log_duplicate_payload: true
...
end
For sidekiq versions before 5.1 a sidekiq_retries_exhausted
block is required per worker class.
class MyWorker
sidekiq_retries_exhausted do |msg, _ex|
SidekiqUniqueJobs::Digests.del(digest: msg['unique_digest']) if msg['unique_digest']
end
end
Starting in v5.1, Sidekiq can also fire a global callback when a job dies:
# this goes in your initializer
Sidekiq.configure_server do |config|
config.death_handlers << ->(job, _ex) do
SidekiqUniqueJobs::Digests.del(digest: job['unique_digest']) if job['unique_digest']
end
end
There are several ways of removing keys that are stuck. The prefered way is by using the unique extension to Sidekiq::Web
. The old console and command line versions still work but might be deprecated in the future. It is better to search for the digest itself and delete the keys matching that digest.
To use the web extension you need to require it in your routes.
# app/config/routes.rb
require 'sidekiq_unique_jobs/web'
mount Sidekiq::Web, at: '/sidekiq'
There is no need to require 'sidekiq/web'
since sidekiq_unique_jobs/web
already does this.
To filter/search for keys we can use the wildcard *
. If we have a unique digest 'uniquejobs:9e9b5ce5d423d3ea470977004b50ff84
we can search for it by enter *ff84
and it should return all digests that end with ff84
.
There is a for praise or scorn. This would be a good place to have lengthy discuss or brilliant suggestions or simply just nudge me if I forget about anything.
This has been probably the most confusing part of this gem. People get really confused with how unreliable the unique jobs have been. I there for decided to do what Mike is doing for sidekiq enterprise. Read the section about unique jobs.
https://www.dailydrip.com/topics/sidekiq/drips/sidekiq-enterprise-unique-jobs
SidekiqUniqueJobs.configure do |config|
config.enabled = !Rails.env.test?
end
If you truly wanted to test the sidekiq client push you could do something like below. Note that it will only work for the jobs that lock when the client pushes the job to redis (UntilExecuted, UntilAndWhileExecuting and UntilExpired).
RSpec.describe Workers::CoolOne do
before do
SidekiqUniqueJobs.config.enabled = false
end
# ... your tests that don't test uniqueness
context 'when Sidekiq::Testing.disabled?' do
before do
Sidekiq::Testing.disable!
Sidekiq.redis(&:flushdb)
end
after do
Sidekiq.redis(&:flushdb)
end
it 'prevents duplicate jobs from being scheduled' do
SidekiqUniqueJobs.use_config(enabled: true) do
expect(described_class.perform_async(1)).not_to eq(nil)
expect(described_class.perform_async(1)).to eq(nil)
end
end
end
end
I would strongly suggest you let this gem test uniqueness. If you care about how the gem is integration tested have a look at the following specs:
- spec/integration/sidekiq_unique_jobs/lock/until_and_while_executing_spec.rb
- spec/integration/sidekiq_unique_jobs/lock/until_executed_spec.rb
- spec/integration/sidekiq_unique_jobs/lock/until_expired_spec.rb
- spec/integration/sidekiq_unique_jobs/lock/while_executing_reject_spec.rb
- spec/integration/sidekiq_unique_jobs/lock/while_executing_spec.rb
- Fork it
- Create your feature branch (
git checkout -b my-new-feature
) - Commit your changes (
git commit -am 'Add some feature'
) - Push to the branch (
git push origin my-new-feature
) - Create new Pull Request
You can find a list of contributors over on https://github.com/mhenrixon/sidekiq-unique-jobs/graphs/contributors