In this lab, you will explore how to perform Continuous Integration(CI) using Jenkins. You are highly recommended to use team virtual machine for this lab.
Jenkins has a wide range of functionalities that facilitate the software development process. In this lab, we will focus on the Continuous Integration aspect, which means updates to the code base are continuously tested to ensure the program's quality.
The testing tool we use is pytest. It is a light weight tool that collect test cases automatically.
To receive credit for this lab, show your work to the TA during recitation.
- Show correct configuration of a Jenkins pipeline on forked Lab6 Github Repo. The build process must automatically fetch Jenkinsfile and run pipeline according to it.
- Complete the
jenkinsfile
to make the Jenkins pipeline test the repo during each build. Explain your understanding of what the given Jenkinsfile does. - Complete the
test_data_split
function intest_utility.py
to test data split step.
Since we recommend you to use team server, your teammate may already finish this step on your team server.
What's more, please strictly keep your credentials of Jenkins secret from anyone outside your group. Our team server has public IP and malicious attacker can cause significant harm to your VM by breaching Jenkins.
To check if Java or Jenkins are installed in your team server, you can run in the terminal java --version
and jenkins --version
.
You can also install these on your computer to test this lab.
- Go to Jenkins Installation Page, select the Operating System you currently use. For your team server, please select linux.
- Under section Installation of Java, install java using command:
sudo apt install fontconfig openjdk-17-jre
. - Show the success of installation by running
java --version
- In the same page for Java Installation, locate the installation command for Jenkins. For linux ubuntu system, it's:
sudo wget -O /usr/share/keyrings/jenkins-keyring.asc \
https://pkg.jenkins.io/debian-stable/jenkins.io-2023.key
echo deb [signed-by=/usr/share/keyrings/jenkins-keyring.asc] \
https://pkg.jenkins.io/debian-stable binary/ | sudo tee \
/etc/apt/sources.list.d/jenkins.list > /dev/null
sudo apt-get update
sudo apt-get install jenkins
- Use command
sudo systemctl start jenkins
to start Jenkins server. - Show the success of Jenkins installation by running
sudo systemctl status jenkins
- Enter
127.0.0.1:8080
in your browser and set up jenkins according to post installation wizard. - Give Jenkins sudo permission by running
sudo visudo
and add linejenkins ALL=(ALL) NOPASSWD: ALL
to this file.
- After retrieve access code of Jenkins, please set up a strong password. You Jenkins portal will be opened to the public internet and susceptible to attack.
- If you home internet has static IP address, you could set up firewall white list to protect your Jenkins server.
- Jenkins also allows advanced security management to control each user's permitted actions. This should not be useful in current team project.
- Miniconda is the recommended for this lab and your VM. However,
venv
orpipenv
are also good choice. - For miniconda users, please run
conda create -n mlip python pytest numpy pandas scikit-learn -c conda-forge
- For
venv
users, runpython -m venv mlip
to create a virtual environment. Then runsource mlip/bin/activate
to activate your virtual environment. Finally, usepip install
to install all required package.
- Fork repository for lab 6 repository and clone it to your local machine. It is important that you must fork the repo and clone the forked repo.
- Enter Jenkins dashboard: 127.0.0.1:8080 using your web browser.
- Clikc on
+ new item
button on the left. - Use
mliplab6
as your project name and choosepipeline
project. - Under
General
section, click onGithub Project
and provide your forked repository http url. - Under
pipeline
, click onpipeline definition
and choosePipeline Script from SCM
. You need to create personal access token. Then choose git as yourSCM
. Add ausername password credential
with any username and your personal access token as the password. - Change the
branch specifier
tomain
. Then, during each build, your Jenkins will pull code from Github and build upon it.
Notice that there is file named test_utility.py
in our repo. Pytest will recursively search through the work directory to find files with pattern test_*.py or *_test.py to collect test cases. Inside test_utility.py
, pytest will search for function names with test and execute them. In this case, test_data_preparation
and test_data_split
will be collected by pytest and executed. Since it's time consuming to test the function using the whole training dataset, we create a light-weight toy dataset act like input to test functions. In this case, housing_data_sample
is a toy input with only two rows. It also provide deterministic test data that can help to reproduce error. Now, please follow the steps below to complete deliverables.
- Run
pytest
on the root directory your local cloned repo. You should observe a failed test with errorNotImplemented
. - modified the TODO part in
test_utility.py
to test data split function. - Run
pytest
again and make sure all test cases are passed. - Push the changes to Github
- Read the Jenkinsfile carefully in the github repo. Then modifies TODO section to make Jenkins run pytest during each build.
- After the modification, push your changes into your forked repository.
- Show TA the modified Jenkins file to complete a deliverable.