Tails the blockchain's transactions and pushes them into a postgres DB
Tails the node utilizing the rest interface/client, and maintains state for each registered TransactionProcessor
. On
startup, by default, will retry any previously errored versions for each registered processor.
When developing your own, ensure each TransactionProcessor
is idempotent, and being called with the same input won't
result in an error if some or all of the processing had previously been completed.
Example invocation:
cargo run -- --pg-uri "postgresql://localhost/postgres" --node-url "https://fullnode.devnet.aptoslabs.com" --emit-every 25 --batch-size 100
Try running the indexer with --help
to get more details
Notes:
- Diesel uses the
DATABASE_URL
env var to connect to the databasediesel migration run
sets up the database and runs all available migrations.
diesel migration generate <your_migration_name>
generates a new folder containingup.sql + down.sql
for your migrationdiesel migration run
to apply the missing migrations,diesel migration redo
to rollback and apply the last migrationdiesel database reset
drops the existing database and reruns all the migrations- You can find more information in the Diesel documentation
The Tailer
is the central glue that holds all the other components together. It's responsible for the following:
- Maintaining processor state. The
Tailer
keeps a record of theResult
of eachTransactionProcessor
's output for each transaction version (eg: transaction). If aTransactionProcessor
returns aResult::Err()
for a transaction, theTailer
will mark that version as failed in the database (along with the stringified error text) and continue on. - Retry failed versions for each
TransactionProcessor
. By default, when aTailer
is started, it will re-fetch the versions for allTransactionProcessor
which have failed, and attempt to re-process them. TheResult::Ok
/Result::Err
returned from theTransactionProcessor::process_version
replace the state in the DB for the givenTransactionProcessor
/version combination. - Piping new transactions from the
Fetcher
into eachTransactionProcessor
that was registered to it. EachTransactionProcessor
gets its own copy, in its owntokio::Task
, for each version. These are done in batches, the size of which is specifiable via--batch-size
. For other tunable parameters, trycargo run -- --help
.
The Fetcher
is responsible for fetching transactions from a node in one of two ways:
- One at a time (used by the
Tailer
when retrying previously errored transactions). - In bulk, with an internal buffer. Although the
Tailer
only fetches one transaction at a time from theFetcher
, internally theFetcher
will fetch from the/transactions
endpoint, which returns potentially hundreds of transactions at a time. This is much more efficient than making hundreds of individual HTTP calls. In the future, when there is a streaming Node API, that would be the optimal source of transactions.
All the above comes free 'out of the box'. The TransactionProcessor
is where everything becomes useful for those
writing their own indexers. The trait only has one main method that needs to be implemented: process_transaction
. You
can do anything you want in a TransactionProcessor
- write data to Postgres tables like the DefaultProcessor
does,
make restful HTTP calls to some other service, submit its own transactions to the chain: anything at all. There is just
one note: transaction processing is guaranteed at least once. It's possible for a given TransactionProcessor
to
receive the same transaction more than once: and so your implementation must be idempotent.
To implement your own TransactionProcessor
, check out the documentation and source code
here: ./src/indexer/transaction_processor.rs
.