Back up mysql databases to... anywhere!
mysql-backup is a simple way to do MySQL database backups and restores when the database is running in a container.
It has the following features:
- dump and restore
- dump to local filesystem or to SMB server
- select database user and password
- connect to any container running on the same system
- select how often to run a dump
- select when to start the first dump, whether time of day or relative to container start time
Please see CONTRIBUTORS.md for a list of contributors.
Support is available at the Slack channel https://slack.databack.io. We accept issues here and general support questions on Slack.
To run a backup, launch mysql-backup
image as a container with the correct parameters. Everything is controlled by environment variables passed to the container.
For example:
docker run -d --restart=always -e DB_DUMP_FREQ=60 -e DB_DUMP_BEGIN=2330 -e DB_DUMP_TARGET=/db -e DB_SERVER=my-db-container -v /local/file/path:/db databack/mysql-backup
The above will run a dump every 60 minutes, beginning at the next 2330 local time, from the database accessible in the container my-db-container
.
The following are the environment variables for a backup:
You should consider the use of --env-file=
, docker secrets to keep your secrets out of your shell history
DB_SERVER
: hostname to connect to database. Required.DB_PORT
: port to use to connect to database. Optional, defaults to3306
DB_USER
: username for the databaseDB_PASS
: password for the databaseDB_NAMES
: names of databases to dump (separated by space); defaults to all databases in the database serverDB_NAMES_EXCLUDE
: names of databases (separated by space) to exclude from the dump;information_schema
.performance_schema
,sys
andmysql
are excluded by default. This only applies ifDB_DUMP_BY_SCHEMA
is set totrue
. For example, if you setDB_NAMES_EXCLUDE=database1 db2
andDB_DUMP_BY_SCHEMA=true
then these two databases will not be dumped by mysqldumpSINGLE_DATABASE
: If is set totrue
, mysqldump command will run without--databases
flag. This avoidUSE <database>;
statement which is useful for the cases in which you want to import the dumpfile into a database with a different name.DB_DUMP_FREQ
: How often to do a dump, in minutes. Defaults to 1440 minutes, or once per day.DB_DUMP_BEGIN
: What time to do the first dump. Defaults to immediate. Must be in one of two formats:- Absolute: HHMM, e.g.
2330
or0415
- Relative: +MM, i.e. how many minutes after starting the container, e.g.
+0
(immediate),+10
(in 10 minutes), or+90
in an hour and a half
- Absolute: HHMM, e.g.
DB_DUMP_CRON
: Set the dump schedule using standard crontab syntax, a single line.RUN_ONCE
: Run the backup once and exit ifRUN_ONCE
is set. Useful if you use an external scheduler (e.g. as part of an orchestration solution like Cattle or Docker Swarm or kubernetes cron jobs) and don't want the container to do the scheduling internally. If you use this option, all other scheduling options, likeDB_DUMP_FREQ
andDB_DUMP_BEGIN
andDB_DUMP_CRON
, become obsolete.DB_DUMP_DEBUG
: If set totrue
, print copious shell script messages to the container log. Otherwise only basic messages are printed.DB_DUMP_TARGET
: Where to put the dump file, should be a directory. Supports four formats:- Local: If the value of
DB_DUMP_TARGET
starts with a/
character, will dump to a local path, which should be volume-mounted. - SMB: If the value of
DB_DUMP_TARGET
is a URL of the formatsmb://hostname/share/path/
then it will connect via SMB. - S3: If the value of
DB_DUMP_TARGET
is a URL of the formats3://bucketname/path
then it will connect via awscli. - Multiple: If the value of
DB_DUMP_TARGET
contains multiple targets, the targets should be separated by a whitespace and the value surrounded by quotes, e.g."/db s3://bucketname/path"
.
- Local: If the value of
DB_DUMP_SAFECHARS
: The dump filename usually includes the character:
in the date, to comply with RFC3339. Some systems and shells don't like that character. If this environment variable is set, it will replace all:
with-
.AWS_ACCESS_KEY_ID
: AWS Key IDAWS_SECRET_ACCESS_KEY
: AWS Secret Access KeyAWS_DEFAULT_REGION
: Region in which the bucket residesAWS_ENDPOINT_URL
: Specify an alternative endpoint for s3 interopable systems e.g. DigitaloceanAWS_CLI_OPTS
: Additional arguments to be passed to theaws
part of theaws s3 cp
command, click here for a list. Be careful, as you can break something!AWS_CLI_S3_CP_OPTS
: Additional arguments to be passed to thes3 cp
part of theaws s3 cp
command, click here for a list. If you are using AWS KMS,sse
,sse-kms-key-id
, etc., may be of interest.SMB_USER
: SMB username. May also be specified inDB_DUMP_TARGET
with ansmb://
url. If both specified, this variable overrides the value in the URL.SMB_PASS
: SMB password. May also be specified inDB_DUMP_TARGET
with ansmb://
url. If both specified, this variable overrides the value in the URL.COMPRESSION
: Compression to use. Supported are:gzip
(default),bzip2
DB_DUMP_BY_SCHEMA
: Whether to use separate files per schema in the compressed file (true
), or a single dump file (false
). Defaults tofalse
.DB_DUMP_KEEP_PERMISSIONS
: Whether to keep permissions for a file target. By default,mysql-backup
copies the backup compressed file to the target withcp -a
. In certain filesystems with certain permissions, this may cause errors. You can disable the-a
flag by settingDB_DUMP_KEEP_PERMISSIONS=false
. Defaults totrue
.MYSQLDUMP_OPTS
: A string of options to pass tomysqldump
, e.g.MYSQLDUMP_OPTS="--opt abc --param def --max_allowed_packet=123455678"
will runmysqldump --opt abc --param def --max_allowed_packet=123455678
NICE
: true to perform mysqldump with ionice and nice option:- check for more information :- http://eosrei.net/articles/2013/03/forcing-mysqldump-always-be-nice-cpu-and-ioTMP_PATH
: tmp directory to be used during backup creation and other operations. Optional, defaults to/tmp
There are several options for scheduling how often a backup should run:
RUN_ONCE
: run just once and exit.DB_DUMP_FREQ
andDB_DUMP_BEGIN
: run every x minutes, and run the first one at a particular time.DB_DUMP_CRON
: run on a schedule.
If a cron-scheduled backup takes longer than the beginning of the next backup window, it will be skipped. For example, if your cron line is scheduled to backup every hour, as follows:
0 * * * *
And the backup that runs at 13:00 finishes at 14:05, the next backup will not be immediate, but rather at 15:00.
The cron algorithm is as follows: after each backup run, calculate the next time that the cron statement will be true and schedule the backup then.
The scheduling options have an order of priority:
RUN_ONCE
runs once, immediately, and exits, ignoring everything else.DB_DUMP_CRON
: runs according to the cron schedule, ignoringDB_DUMP_FREQ
andDB_DUMP_BEGIN
.DB_DUMP_FREQ
andDB_DUMP_BEGIN
: if nothing else is set.
By default, the backup/restore process does not run as root (UID O). Whenever possible, you should run processes (not just in containers) as users other than root. In this case, it runs as username appuser
with UID/GID 1005
.
In most scenarios, this will not affect your backup process negatively. However, if you are using the "Local" dump target, i.e. your DB_DUMP_TARGET
starts with /
- and, most likely, is a volume mounted into the container - you can run into permissions issues. For example, if your mounted directory is owned by root on the host, then the backup process will be unable to write to it.
In this case, you have two options:
- Run the container as root,
docker run --user 0 ...
or, in idocker-compose.yml
,user: "0"
- Ensure your mounted directory is writable as UID or GID
1005
.
In order to perform the actual dump, mysql-backup
needs to connect to the database container. You must pass the database hostname - which can be another container or any database process accessible from the backup container - by passing the environment variable DB_SERVER
with the hostname or IP address of the database. You may override the default port of 3306
by passing the environment variable DB_PORT
.
docker run -d --restart=always -e DB_USER=user123 -e DB_PASS=pass123 -e DB_DUMP_FREQ=60 -e DB_DUMP_BEGIN=2330 -e DB_DUMP_TARGET=/db -e DB_SERVER=my-db-container -v /local/file/path:/db databack/mysql-backup
The dump target is where you want the backup files to be saved. The backup file always is a compressed file the following format:
db_backup_YYYY-MM-DDTHH:mm:ssZ.<compression>
Where the date is RFC3339 date format, excluding the milliseconds portion.
- YYYY = year in 4 digits
- MM = month number from 01-12
- DD = date for 01-31
- HH = hour from 00-23
- mm = minute from 00-59
- ss = seconds from 00-59
- T = literal character
T
, indicating the separation between date and time portions - Z = literal character
Z
, indicating that the time provided is UTC, or "Zulu" - compression = appropriate file ending for selected compression, one of:
gz
(gzip, default);bz2
(bzip2)
The time used is UTC time at the moment the dump begins.
Notes on format:
- SMB does not allow for
:
in a filename (depending on server options), so they are replaced with the-
character when writing to SMB. - Some shells do not handle a
:
in the filename gracefully. Although these usually are legitimate characters as far as the filesystem is concerned, your shell may not like it. To avoid this issue, you can set the "no-colons" options with the environment variableDB_DUMP_SAFECHARS
The dump target is the location where the dump should be placed, defaults to /backup
in the container. Of course, having the backup in the container does not help very much, so we very strongly recommend you volume mount it outside somewhere. See the above example.
If you use a URL like smb://host/share/path
, you can have it save to an SMB server. If you need loging credentials, use smb://user:pass@host/share/path
.
Note that for smb, if the username includes a domain, e.g. your user is mydom\myuser
, then you should use the samb convention of replacing the '' with a ';'. In other words smb://mydom;myuser:pass@host/share/path
If you use a URL like s3://bucket/path
, you can have it save to an S3 bucket.
Note that for s3, you'll need to specify your AWS credentials and default AWS region via AWS_ACCESS_KEY_ID
, AWS_SECRET_ACCESS_KEY
and AWS_DEFAULT_REGION
Also note that if you are using an s3 interopable storage system like DigitalOcean you can use that as the target by setting AWS_ENDPOINT_URL
to ${REGION_NAME}.digitaloceanspaces.com
and setting DB_DUMP_TARGET
to s3://bucketname/path
.
There may be use-cases where you need to modify the source path of the backup file before it gets uploaded to the dump target.
An example is combining multiple compressed files into one and giving it a new name, i.e. db-other-files-combined.tar.gz
.
To do that, place an executable file called source.sh
in the following path:
/scripts.d/source.sh
Whatever your script returns to stdout will be used as the source name for the backup file.
The following exported environment variables will be available to the script above:
DUMPFILE
: full path in the container to the output fileNOW
: date of the backup, as included inDUMPFILE
and given bydate -u +"%Y-%m-%dT%H:%M:%SZ"
DUMPDIR
: path to the destination directory so for example you can copy a new tarball including some other files along with the sql dump.DB_DUMP_DEBUG
: To enable debug mode in post-backup scripts.
Example run:
NOW=20180930151304 DUMPFILE=/tmp/backups/db_backup_201809301513.gz DUMPDIR=/backup DB_DUMP_DEBUG=true /scripts.d/source.sh
Example custom source script:
#!/bin/bash
# Rename source file
echo -n "db-plus-wordpress_${NOW}.gz"
There may be use-cases where you need to modify the target upload path of the backup file before it gets uploaded.
An example is uploading a backup to a date stamped object key path in S3, i.e. s3://bucket/2018/08/23/path
.
To do that, place an executable file called target.sh
in the following path:
/scripts.d/target.sh
Whatever your script returns to stdout will be used as the name for the backup file.
The following exported environment variables will be available to the script above:
DUMPFILE
: full path in the container to the output fileNOW
: date of the backup, as included inDUMPFILE
and given bydate -u +"%Y-%m-%dT%H:%M:%SZ"
DUMPDIR
: path to the destination directory so for example you can copy a new tarball including some other files along with the sql dump.DB_DUMP_DEBUG
: To enable debug mode in post-backup scripts.
Example run:
NOW=20180930151304 DUMPFILE=/tmp/backups/db_backup_201809301513.gz DUMPDIR=/backup DB_DUMP_DEBUG=true /scripts.d/target.sh
Example custom target script:
#!/bin/bash
# Rename target file
echo -n "db-plus-wordpress-uploaded_${NOW}.gz"
Any executable script with .sh extension in /scripts.d/pre-backup/ or /scripts.d/post-backup/ directories in the container will be executed before and after the backup dump process has finished respectively, but before uploading the backup file to its ultimate target. This is useful if you need to include some files along with the database dump, for example, to backup a WordPress install.
To use them you need to add a host volume that points to the post-backup scripts in the docker host. Start the container like this:
docker run -d --restart=always -e DB_USER=user123 -e DB_PASS=pass123 -e DB_DUMP_FREQ=60 \
-e DB_DUMP_BEGIN=2330 -e DB_DUMP_TARGET=/db -e DB_SERVER=my-db-container:db \
-v /path/to/pre-backup/scripts:/scripts.d/pre-backup \
-v /path/to/post-backup/scripts:/scripts.d/post-backup \
-v /local/file/path:/db \
databack/mysql-backup
Or, if you prefer compose:
version: '2.1'
services:
backup:
image: databack/mysql-backup
restart: always
volumes:
- /local/file/path:/db
- /path/to/pre-backup/scripts:/scripts.d/pre-backup
- /path/to/post-backup/scripts:/scripts.d/post-backup
env:
- DB_DUMP_TARGET=/db
- DB_USER=user123
- DB_PASS=pass123
- DB_DUMP_FREQ=60
- DB_DUMP_BEGIN=2330
- DB_SERVER=mysql_db
mysql_db:
image: mysql
....
The scripts are executed in the entrypoint script, which means it has access to all exported environment variables. The following are available, but we are happy to export more as required (just open an issue or better yet, a pull request):
DUMPFILE
: full path in the container to the output fileNOW
: date of the backup, as included inDUMPFILE
and given bydate -u +"%Y-%m-%dT%H:%M:%SZ"
DUMPDIR
: path to the destination directory so for example you can copy a new tarball including some other files along with the sql dump.DB_DUMP_DEBUG
: To enable debug mode in post-backup scripts.
In addition, all of the environment variables set for the container will be available to the script.
For example, the following script will rename the backup file after the dump is done:
#!/bin/bash
# Rename backup file.
if [[ -n "$DB_DUMP_DEBUG" ]]; then
set -x
fi
if [ -e ${DUMPFILE} ];
then
now=$(date +"%Y-%m-%d-%H_%M")
new_name=db_backup-${now}.gz
old_name=$(basename ${DUMPFILE})
echo "Renaming backup file from ${old_name} to ${new_name}"
mv ${DUMPFILE} ${DUMPDIR}/${new_name}
else
echo "ERROR: Backup file ${DUMPFILE} does not exist!"
fi
You can think of this as a sort of basic plugin system. Look at the source of the entrypoint script for other variables that can be used.
Post-processing also give you options to encrypt the backup using openssl. The openssl binary is available to the processing scripts.
The sample examples/encrypt.sh provides a sample post-processing script that you can use to encrypt your backup with AES256.
If you wish to run a restore to an existing database, you can use mysql-backup to do a restore.
You need only the following environment variables:
You should consider the use of --env-file=
to keep your secrets out of your shell history
DB_SERVER
: hostname to connect to database. Required.DB_PORT
: port to use to connect to database. Optional, defaults to3306
DB_USER
: username for the databaseDB_PASS
: password for the databaseDB_RESTORE_TARGET
: path to the actual restore file, which should be a compressed dump file. The target can be an absolute path, which should be volume mounted, an smb or S3 URL, similar to the target.DB_DUMP_DEBUG
: iftrue
, dump copious outputs to the container logs while restoring.- To use the S3 driver
AWS_ACCESS_KEY_ID
,AWS_SECRET_ACCESS_KEY
andAWS_DEFAULT_REGION
will need to be defined.
Examples:
- Restore from a local file:
docker run -e DB_SERVER=gotodb.example.com -e DB_USER=user123 -e DB_PASS=pass123 -e DB_RESTORE_TARGET=/backup/db_backup_201509271627.gz -v /local/path:/backup databack/mysql-backup
- Restore from an SMB file:
docker run -e DB_SERVER=gotodb.example.com -e DB_USER=user123 -e DB_PASS=pass123 -e DB_RESTORE_TARGET=smb://smbserver/share1/backup/db_backup_201509271627.gz databack/mysql-backup
- Restore from an S3 file:
docker run -e DB_SERVER=gotodb.example.com -e AWS_ACCESS_KEY_ID=awskeyid -e AWS_SECRET_ACCESS_KEY=secret -e AWS_DEFAULT_REGION=eu-central-1 -e DB_USER=user123 -e DB_PASS=pass123 -e DB_RESTORE_TARGET=s3://bucket/path/db_backup_201509271627.gz databack/mysql-backup
docker-compose
automagically creates a network when started. docker run
simply attaches to the bridge network. If you are trying to communicate with a mysql container started by docker-compose, you'll need to specify the network in your command arguments. You can use docker network ls
to see what network is being used, or you can declare a network in your docker-compose.yml.
docker run -e DB_SERVER=gotodb.example.com -e DB_USER=user123 -e DB_PASS=pass123 -e DB_RESTORE_TARGET=/backup/db_backup_201509271627.gz -v /local/path:/backup --network="skynet" databack/mysql-backup
Environment variables used in this image can be passed in files as well. This is useful when you are using docker (or rancher) secrets for storing sensitive information.
As you can set environment variable with -e ENVIRONMENT_VARIABLE=value
, you can also use -e ENVIRONMENT_VARIABLE_FILE=/path/to/file
. Contents of that file will be assigned to the environment variable.
Example:
docker run -d \
-e DB_HOST_FILE=/run/secrets/DB_HOST \
-e DB_USER_FILE=/run/secrets/DB_USER \
-e DB_PASS_FILE=/run/secrets/DB_PASS \
-v /local/file/path:/db \
databack/mysql-backup
As with backups pre and post processing, you can do the same with restore operations. Any executable script with .sh extension in /scripts.d/pre-restore/ or /scripts.d/post-restore/ directories in the container will be executed before the restore process starts and after it finishes respectively. This is useful if you need to restore a backup file that includes some files along with the database dump.
For example, to restore a WordPress install, you would uncompress a tarball containing
the db backup and a second tarball with the contents of a WordPress install on
pre-restore
. Then on post-restore
, uncompress the WordPress files on the container's web server root directory.
For an example take a look at the post-backup examples, all variables defined for post-backup scripts are available for pre-processing too. Also don't forget to add the same host volumes for pre-restore
and post-restore
directories as described for post-backup processing.
This github repo is the source for the mysql-backup image. The actual image is stored on the docker hub at databack/mysql-backup
, and is triggered with each commit to the source by automated build via Webhooks.
There are 2 builds: 1 for version based on the git tag, and another for the particular version number.
The tests all run in docker containers, to avoid the need to install anything other than make
and docker
, and even can run over remote docker connections, avoiding any local bind-mounts. To run all tests:
make test
To run with debugging
make test DEBUG=debug
The above will generate copious outputs, so you might want to redirect stdout and stderr to a file.
This runs each of the several testing targets, each of which is a script in test/test_*.sh
, which sets up tests, builds containers, runs the tests, and collects the output.
Released under the MIT License. Copyright Avi Deitcher https://github.com/deitch