moredis

Warning

October 2023: moredis is no longer actively maintained

moredis is a tool to sync data from MongoDB into redis.

Motivation

MongoDB (and any database for that matter) becomes unwieldy if you have many applications using it for many different purposes. Oftentimes building out your infrastructure like this makes sense to start, but as time goes on and the number of applications increases it becomes harder to do things like diagnose database performance problems and make application-specific database optimizations.

moredis is a tool that reduces an application's direct dependency on MongoDB by syncing specific data out of MongoDB and into redis. The data is synced in a way that optimizes for the query patterns needed by the application, so that MongoDB to no longer lies in the request path of the application.

See this talk by foursquare for more detailed motivation behind breaking up MongoDB monoliths into a more service-oriented persistence layer: Service Oriented Clusters.

How it Works

In a nutshell, moredis works by taking a user specified MongoDB query, then for each returned document, mapping some some value in the document to another value in that document using a redis hash. moredis also allows you to parameterize your query with values passed in at runtime.

For more specific examples, see Examples

Usage

Usage of ./moredis:
  -m, -mongo_url    MongoDB URL, can also be set via the MONGO_URL environment variable
  -p, -params       JSON object with params used for substitution into queries and collection names in config.yml
  -r, -redis_url    Redis URL, can also be set via the REDIS_URL environment variable
  -f, -conf_file    Config file, defaults to ./config.yml
  -h, -help         Print this usage message.

Configuration

moredis cache configuration is done using yaml. You can specify a config file to use, or moredis will default to config.yml in the same folder as the moredis executable. This repo contains a sample config.yml which you can to modify to suit your needs. The sample has comments to describe the various fields and their purposes.

You also need to provide moredis with connection parameters for both your MongoDB instance and Redis instance. These settings can be set with either command line flags or environment variables (with the command line flags taking precedence). Mongo URL should be a MongoDB connection string (exact form expected can be found in the mgo docs). Note that if you want to connect to a MongoDB secondary, you will need to specify ?connect=direct in your mongo url. Redis URL should be in the form "host:port". There is experimental support for Sentinel which will attempt to resolve the master on connect, but will not attempt any failover. Sentinel will be used if the Redis URL is of the form "sentinel://host:port/set".

For each, the settings locations are:

Mongo URL
- flag: -m
- env: MONGO_URL
- default: localhost
Redis URL
- flag: -r
- env: REDIS_URL
- default: localhost

Examples

Simple case insensitive map

Lets say you have a MongoDB collection called 'users', and in this collection you have documents that look like:

{
  _id: ObjectId("507f1f77bcf86cd799431111"),
  username: "CoolDude",
  email: "[email protected]",
  group: ObjectId("507f1f77bcf86cd799432222")
}

Now imagine you were writing a service which required very fast lookups of ids by email in a case-insensitive way. You could accomplish this with the following moredis configuration:

name: 'demo-cache'
collections:
  - collection: 'users'
    query: '{}'
    maps:
      - name: 'users:email'
        key: '{{toLower .email}}'
        val: '{{toString ._id}}'

Then run moredis with:

$ ./moredis

After this runs, you will have a key in redis called 'users:email'. This value for this key will be the key for a hash that has all of your email-to-id mappings. Doing a lookup of the user id for email '[email protected]' in redis would look like the following in redis-cli:

> GET users:email
"moredis:map:1"

> HGET moredis:map:1 [email protected]
"507f1f77bcf86cd799431111"

Specifying a query

That's great if you want to create the mapping for every document in a collection, but often you only want to create the mapping for some subset of documents in a collection. The natural way is to use a query to find the set of documents to operate on (i.e. only for users who are tagged with a specific group).

With moredis you can do this by specifying a query in the above config like so:

name: 'demo-cache'
collections:
  - collection: 'users'
    query: '{"group": "507f1f77bcf86cd799432222"}'
    maps:
      - name: 'users:email'
        key: '{{toLower .email}}'
        val: '{{toString ._id}}'

With this config, we will now only do the mapping for documents in the user collection with the given group id.

Parameterizing your query

To take this example one step further, not only do you only want to create the cache for a specific group, you want to be able to specify this group at runtime without modifying your config.yml. With moredis, you can do this by taking advantage of parameterization in your config, then you can pass in the parameters you want to use on the command line.

To accomplish passing the group id in at runtime, we could modify our config to now look like:

name: demo-cache
collections:
  - collection: users
    query: '{"group": "{{.group}}"}'
    maps:
      - name: 'users:email:{{.group}}'
        key: '{{toLower .email}}'
        val: '{{toString ._id}}'

Now you can see that both our query and our map name are parameterized by this "group" parameter. You can pass that parameter in on the commandline like so:

$ ./moredis -p '{"group": "507f1f77bcf86cd799432222"}'

The result of this run will be the same as from the previous example, except the map will now contain the group id in the key name (so that caches for different groups don't overwrite each other).

Installation

You can grab the latest moredis release for your platform from the Releases page. Then, just extract, configure, and run.

Local development

You can grab moredis for local development in the usual golang way with:

$ go get github.com/Clever/moredis

You can run tests with:

$ make test

Using as a library

moredis can also be used as a library, for example:

package main

import (
  "github.com/Clever/moredis/moredis"
  "log"
)

func main() {
  config, _ := moredis.LoadConfig("./config.yml")
  err := moredis.BuildCache(config, moredis.Params{}, "", "")
  if err != nil {
    log.Fatal(err)
  }
}

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
.circleci		.circleci
.github/workflows		.github/workflows
cmd/moredis		cmd/moredis
logger		logger
moredis		moredis
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
VERSION		VERSION
config.yml		config.yml
glide.yaml		glide.yaml
golang.mk		golang.mk

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

moredis

Motivation

How it Works

Usage

Configuration

Examples

Simple case insensitive map

Specifying a query

Parameterizing your query

Installation

Local development

Using as a library

About

Releases 8

Packages

Contributors 16

Languages

License

Clever/moredis

Folders and files

Latest commit

History

Repository files navigation

moredis

Motivation

How it Works

Usage

Configuration

Examples

Simple case insensitive map

Specifying a query

Parameterizing your query

Installation

Local development

Using as a library

About

Resources

License

Stars

Watchers

Forks

Releases 8

Packages 0

Contributors 16

Languages

Packages