Skip to content

Commit

Permalink
edits for jupyter notebook example
Browse files Browse the repository at this point in the history
  • Loading branch information
jraviotta committed Aug 7, 2019
1 parent 6c0c77b commit 8418dfb
Show file tree
Hide file tree
Showing 5 changed files with 235 additions and 117 deletions.
4 changes: 3 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -81,7 +81,6 @@ target/

# Jupyter Notebook
.ipynb_checkpoints
*.ipynb

# pyenv
.python-version
Expand All @@ -93,3 +92,6 @@ target/

.pytest_cache/
.mypy_cache/

#exceptions
!user_data/noteboks/*example.ipynb
160 changes: 44 additions & 116 deletions docs/data-analysis.md
Original file line number Diff line number Diff line change
@@ -1,164 +1,92 @@
# Analyzing bot data

After performing backtests, or after running the bot for some time, it will be interesting to analyze the results your bot generated.
You can analyze the results of backtests and trading history easily using Jupyter notebooks. A sample notebook is located at `user_data/notebooks/analysis_example.ipynb`. For usage instructions, see [jupyter.org](https://jupyter.org/documentation).

A good way for this is using Jupyter (notebook or lab) - which provides an interactive environment to analyze the data.
## Strategy debugging example

The following helpers will help you loading the data into Pandas DataFrames, and may also give you some starting points in analyzing the results.
Debugging a strategy can be time-consuming. FreqTrade offers helper functions to visualize raw data.

## Strategy development problem analysis

Debugging a strategy (are there no buy signals, ...) can be very time-consuming.
FreqTrade tries to help you by exposing a few helper-functions, which can be very handy.

It's recommended using Juptyer Notebooks for analysis, since it offers a dynamic way to rerun certain parts of the code.

The following is a full code-snippet, which will be explained by both comments, and step by step below.
### Import requirements and define variables used in the script

```python
# Some necessary imports
# Imports
from pathlib import Path

import os
from freqtrade.data.history import load_pair_history
from freqtrade.resolvers import StrategyResolver
# Define some constants
ticker_interval = "5m"

# Name of the strategy class
strategyname = 'Awesomestrategy'
# Location of the strategy
strategy_location = '../xmatt/strategies'
# Location of the data
data_location = '../freqtrade/user_data/data/binance/'
# Only use one pair here
pair = "XRP_ETH"

### End constants

# Load data
bt_data = load_pair_history(datadir=Path(data_location),
ticker_interval = ticker_interval,
pair=pair)
print(len(bt_data))

### Start strategy reload
# Load strategy - best done in a new cell
# Rerun each time the strategy-file is changed.
strategy = StrategyResolver({'strategy': strategyname,
'user_data_dir': Path.cwd(),
'strategy_path': location}).strategy

# Run strategy (just like in backtesting)
df = strategy.analyze_ticker(bt_data, {'pair': pair})
print(f"Generated {df['buy'].sum()} buy signals")

# Reindex data to be "nicer" and show data
data = df.set_index('date', drop=True)
data.tail()

```

### Explanation

#### Imports and constant definition

``` python
# Some necessary imports
from pathlib import Path
from freqtrade.data.btanalysis import load_backtest_data
from freqtrade.data.btanalysis import load_trades_from_db

from freqtrade.data.history import load_pair_history
from freqtrade.resolvers import StrategyResolver
# Define some constants
ticker_interval = "5m"

ticker_interval = "1m"
# Name of the strategy class
strategyname = 'Awesomestrategy'
strategy_name = 'NewStrategy'
# Path to user data
user_data_dir = 'user_data'
# Location of the strategy
strategy_location = 'user_data/strategies'
strategy_location = os.path.join(user_data_dir, 'strategies')
# Location of the data
data_location = 'user_data/data/binance'
data_location = os.path.join(user_data_dir, 'data', 'binance')
# Pair to analyze
# Only use one pair here
pair = "XRP_ETH"
pair = "BTC_USDT"
```

This first section imports necessary modules, and defines some constants you'll probably need to adjust for your case.
### Load exchange data

#### Load candles

``` python
# Load data
```python
# Load data using values set above
bt_data = load_pair_history(datadir=Path(data_location),
ticker_interval = ticker_interval,
ticker_interval=ticker_interval,
pair=pair)
print(len(bt_data))

# Confirm success
print("Loaded " + str(len(bt_data)) + f" rows of data for {pair} from {data_location}")
```

This second section loads the historic data and prints the amount of candles in the DataFrame.
You can also inspect this dataframe by using `bt_data.head()` or `bt_data.tail()`.
### Load and run strategy

#### Run strategy and analyze results
* Rerun each time the strategy file is changed
* Display the trade details. Note that using `data.head()` would also work, however most indicators have some "startup" data at the top of the dataframe.

Now, it's time to load and run your strategy.
For this, I recommend using a new cell in your notebook, since you'll want to repeat this until you're satisfied with your strategy.
Some possible problems:

``` python
# Load strategy - best done in a new cell
# Needs to be ran each time the strategy-file is changed.
strategy = StrategyResolver({'strategy': strategyname,
'user_data_dir': Path.cwd(),
'strategy_path': location}).strategy
* Columns with NaN values at the end of the dataframe
* Columns used in `crossed*()` functions with completely different units

```python
# Load strategy using values set above
strategy = StrategyResolver({'strategy': strategy_name,
'user_data_dir': user_data_dir,
'strategy_path': strategy_location}).strategy

# Run strategy (just like in backtesting)
df = strategy.analyze_ticker(bt_data, {'pair': pair})
print(f"Generated {df['buy'].sum()} buy signals")

# Reindex data to be "nicer" and show data
# Report results
print(f"Generated {df['buy'].sum()} buy signals")
data = df.set_index('date', drop=True)
data.tail()
```

The code snippet loads and analyzes the strategy, calculates and prints the number of buy signals.

The last 2 lines serve to analyze the dataframe in detail.
This can be important if your strategy did not generate any buy signals.
Note that using `data.head()` would also work, however this is misleading since most indicators have some "startup" time at the start of a backtested dataframe.
### Load backtest results into a pandas dataframe

There can be many things wrong, some signs to look for are:

* Columns with NaN values at the end of the dataframe
* Columns used in `crossed*()` functions with completely different units

## Backtesting

To analyze your backtest results, you can [export the trades](#exporting-trades-to-file).
You can then load the trades to perform further analysis.

Freqtrade provides the `load_backtest_data()` helper function to easily load the backtest results, which takes the path to the the backtest-results file as parameter.

``` python
from freqtrade.data.btanalysis import load_backtest_data
df = load_backtest_data("user_data/backtest-result.json")
```python
# Load backtest results
df = load_backtest_data("user_data/backtest_data/backtest-result.json")

# Show value-counts per pair
df.groupby("pair")["sell_reason"].value_counts()

```

This will allow you to drill deeper into your backtest results, and perform analysis which otherwise would make the regular backtest-output very difficult to digest due to information overload.

If you have some ideas for interesting / helpful backtest data analysis ideas, please submit a Pull Request so the community can benefit from it.

## Live data

To analyze the trades your bot generated, you can load them to a DataFrame as follows:
### Load live trading results into a pandas dataframe

``` python
from freqtrade.data.btanalysis import load_trades_from_db

# Fetch trades from database
df = load_trades_from_db("sqlite:///tradesv3.sqlite")

# Display results
df.groupby("pair")["sell_reason"].value_counts()

```

Feel free to submit an issue or Pull Request enhancing this document if you would like to share ideas on how to best analyze the data.
1 change: 1 addition & 0 deletions environment.yml
Original file line number Diff line number Diff line change
Expand Up @@ -37,6 +37,7 @@ dependencies:
- coveralls
- mypy
# Useful for jupyter
- jupyter
- ipykernel
- isort
- yapf
Expand Down
Empty file added user_data/notebooks/.gitkeep
Empty file.
Loading

0 comments on commit 8418dfb

Please sign in to comment.