Skip to content

Latest commit

 

History

History
197 lines (130 loc) · 9.83 KB

tutorial.md

File metadata and controls

197 lines (130 loc) · 9.83 KB

Yardstick Tutorial

Minecraft is one of the best-selling games of all time. It has sold more than 200 million copies,1 and has more than 126 million active monthly players.2 In contrast to traditional games, Minecraft gives players fine-grained control over the environment. Players can be creative and alter the environment to their liking. Players can decide to create buildings, mines, farms, logical circuits, and other constructions. Minecraft's success has led to the creation of hundreds of similar games, which we collectively refer to as Minecraft-like games (MLGs).

Unfortunately, it is difficult for MLG players to play together due to severe performance limitations. The modifiable and complex virtual environment is difficult to scale to a large number of players, resulting in virtual worlds whose scalability reaches only around 200 concurrent players under favorable conditions. This is orders of magnitudes lower than today's most scalable worlds, such as EVE Online, which can scale to thousands of concurrent players in a single environment. The only way MLG operators can support their high player numbers and sustain their high popularity is by splitting players across a large number of small instances, preventing players from playing together in large groups.

In this tutorial, you make your first steps into exploring the performance of MLGs by running performance evaluation experiments with Yardstick,3,4 our MLG benchmark.

Setting Up

SSH

We run the Yardstick benchmark on the DAS-6 compute cluster at the VU. To connect to DAS-6, append the following configuration to your SSH configuration file, located at ~/.ssh/config:

Host das6
	HostName fs0.das6.cs.vu.nl
	User DAS5_USERNAME

You should now be able to connect to the DAS-6 using the command ssh das6. SSH will first request your VUnet password, and then your DAS-6 password.

Tip

If you connect to the DAS-6 regularly, it is worth switching to public-key authentication using ssh-keygen and ssh-copy-id. This is left as an exercise for the reader.

Tip

Use eduVPN to connet to the DAS-6 when you're not directly connected to the VU campus network.

VSCode

We will work with a remote Jupyter Notebook, which is easy to read and modify through VSCode. If you have not done so already, install VSCode. Next, use its "Connect to Host..." feature to connect VSCode to DAS6.

Python Environment

Now that your VSCode is connected to DAS6, open a terminal (shortcut: ctrl+~). We will proceed by installing Miniconda, which we use to manage Python and native dependencies required for Yardstick to run.

You can follow the steps outlined on their Web page, or follow the commands listed below:

Important

When asked for an installation location, make sure to use the target_dir location shown below, where whoami, including back ticks, is replaced by your DAS6 username.

The installation location is important because, by default, users on DAS6 have limited storage space available in their home directory, the default installation location, which can lead to errors during or after the installation process.

Run these commands to install Miniconda:

target_dir=/var/scratch/`whoami`/miniconda3
mkdir -p $target_dir
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O $target_dir/miniconda.sh
bash $target_dir/miniconda.sh -b -u -p $target_dir
rm -rf $target_dir/miniconda.sh
$target_dir/bin/conda init bash

You will need to close (ctrl+D) and reopen (ctrl+~) your shell before changes take effect.

Create a new conda environment named yardstick by running:

conda create -n yardstick python=3.9
conda activate yardstick

Once activated, run the following commands to obtain Yardstick and its dependencies:

conda install jupyter pandas seaborn
pip install yardstick-benchmark

Running Experiments

You are now ready to visit the example experiment and start running experiments with Yardstick.

Create a new directory on DAS-6 for this tutorial, download the example notebook into the newly created directory, and open the directory in VSCode:

mkdir yardstick-tutorial
cd yardstick-tutorial
wget https://raw.githubusercontent.com/atlarge-research/yardstick/master/example.ipynb

Open the yardstick-tutorial folder in VSCode, open the IPython Notebook, and click the button in the top right of the VSCode window to select a Kernel. Install the necessary VSCode extensions (probably Jupyter and Python) and select Python Environments... > yardstick. This is the yardstick environment we just created and prepared with Miniconda which contains all the necessary dependencies.

In the remainder of this section, we will ask you to perform increasingly difficult experiments, which will make you increasingly adept at using Yardstick specifically, and performing experiments on a distributed system generally.

Note

Here starts the challenging part of the tutorial. Each subsection can easily take 30 minutes to complete. If you are doing this tutorial as part of a lecture or workshop, there may not be sufficient time remaining to complete all exercises. This is by design. If you are out of time but remain curious about this work, feel free to explore the remaining sections from home. Your account is likely valid for several weeks.

Jupyter Notebook Example

We recommend reading the file line by line to develop a sufficiently good understanding of what is going on. Afterwards, run your first experiment by running all cells in the notebook. The cell that runs the experiment can take a long time (~10 minutes) to complete. This is expected. While the experiment is running, you can run type preserve -llist in the terminal to get an overview of node reservations on the DAS6. You'll likely see a line similar to the one below, with your username:

id      user            start           stop            state   nhosts  hosts
351651  core2435        06/21   07:02   06/21   07:18   R       2       node001 node015

This shows that user core2435 has reserved 2 nodes: node001 and node015 from 7:02am until 7:18am.

Which nodes did you reserve? How many nodes are in use by others? How many nodes do they use?

When the experiment has completed, it is time to review the resulting plots.

Are the numbers surprising? Why (not)?

Visualizing Results

Initiall, only the CPU utilization is plotted. However, there is a table containing the server's tick duration. Add a plot that visualizes the server's tick duration over time.

Do the tick durations match your expectations? Why (not)?

Visualize Network Bandwidth Usage

The data used for the previous plots is obtained by reading csv files in the experiment's output directory. This directory contains several other files containing other metrics.

Load one of the output files to visualize another metric. We recommend visualizing the network bandwidth usage of the server node.

Compare by Varying the Number of Players

Edit the notebook to run the experiments twice in a row with varying numbers of players.

Does changing the number of players have an impact on the game's performance?

Compare by Changing the Game's Configuration

Edit Yardstick's internals and change the game's configuration file to evaluate the impact of changing the simulation distance of the game server.

How does this distance affect the game's performance?

Evaluate the Impact of Player Workloads

The example uses a player workload called WalkAround, in which a variable number of players connect to the server and walk around a predefined area. However, we suspect that the behavior of players can have a significant impact of the game's performance.

Edit Yardstick's internals and add a new player workload with different player behavior.

How does the workload affect the game's performance?

Done Before Time Runs Out?

Explore Yardstick's features freely, or ask the lecture to come up with an ad-hoc exercise to complete.

BONUS: Connect to the Game Server during Your Experiment

While debugging your experiments, it can be useful to see what the game and its emulated players are doing. Because the DAS-6 worker nodes are not accessible from the Internet, you cannot directly connect to the game server with your local Minecraft client. However, you can easily work around this by creating an SSH tunnel.

Start by running your experiment or by launching the game server manually on a worker node. Next, use preserve -llist to identify which machine (e.g., node0XY) is running the game server.4 Now create two SSH tunnels from your local machine to the worker node that is running the game server, replacing node0XY with the correct hostname:

ssh -L 25565:node0XY:25565 das5

Working out how this command works exactly is left as an exercise for the reader.

Finally, start your Minecraft 1.12.2 client on your local machine and connect to the server at localhost:25565. You should now be connected to the game server running on the DAS-6.


1. https://news.xbox.com/en-us/2020/05/18/minecraft-connecting-more-players-than-ever-before/

2. Ibid.

3. van der Sar, et al. Yardstick: A Benchmark for Minecraft-like Services. ICPE 2019

4. Eickhoff, et al. Meterstick: Benchmarking Performance Variability in Cloud and Self-hosted Minecraft-like Games. ICPE 2023