Artifact for OOPSLA'21: Generalizable Synthesis Through Unification.
The updates of this project can be found on GitHub.
Note: It is highly recommended to install this artifact on Ubuntu 16.04 because the baseline solvers Eusolver, Euphony and Esolver requires python3.5
as the default version of python3
. If Ubuntu 18.04 or higher versions are used, Polygen will be installed normally but Eusolver, Euphony, and Esolver will fail unless python3.5 is installed manually.
-
Install dependencies
$ apt-get install cmake git wget libgoogle-glog-dev python3.5-tk python3-pip libboost-all-dev libjsoncpp-dev libboost-all-dev libgoogle-perftools-dev $ pip install --upgrade 9.0.1 $ pip3 install pyparsing z3-solver matplotlib==2
-
Clone Polygen
$ git clone https://github.com/jiry17/PolyGen
-
Build the whole project under the root directory of the project.
$ ./install
-
Polygen takes gurobi as the underlying ILP solver. Therefore, a license of gurobi is required. You can get an academic license of gurobi via the following steps.
- Register or login at the webside of gurobi.
- Visit the Free Academic License page.
- Click I Accept These Conditions.
- Get a command like
grbgetkey x...x
at the bottom of the webpage. - Replace
grbgetkey
withgurobi912/linux64/bin/grbgetkey
and execute this command under the root directory of the project. - Test whether the license woirks normally by executing
gurobi912/linux64/bin/gurobi.sh
under the root directory of the project.
-
Test whether Euphony is successfully installed :
$ cd recommend/my-euphony $ . bin/setenv $ ./bin/run_int ../../benchmark/cross ../../benchmark/CLIA_benchmark/max3.sl
The expected output is a program that returns the maximum of three:
(define-fun max3 ((x Int) (y Int) (z Int)) Int (ite (<= z x) (ite (<= y x) x y) (ite (<= z y) y z)))
-
Test whether Eusolver is successfully installed :
$ cd recommend/my-euphony $ . bin/setenv $ ./bin/run_int_eusolver ../../benchmark/CLIA_benchmark/max3.sl
The expected output is the same as 1.
-
Test whether Esolver is successfully installed :
$ cd recommend/esolver
$ ./eusolver benchmarks/max/max_2.sl
The expected output is a program that takes two integers as the input and returns the larger one:
(define-fun max2 ((a0 Int) (a1 Int)) Int
(ite (<= a1 a0) a0 a1))
-
Test whether the project is successfully built:
$ cd build $ ./run ../benchmark/CLIA_benchmark/sum.sl res cegis
The expected output in file
res
is1 (+ Param0 Param1)
Number 1 indicates the synthesis process takes 1 example in total.
$ cd build
# Run Polygen
$ ./run INPUT OUTPUT DOMAIN
# Run Polygen for ablation test
$ ./run INPUT OUTPUT DOMAIN ABLATION
- DOMAIN is
cegis
for oracle model$O_V$ andrandom
for oracle model$O_R$ . - ABLATION is 1 for $ \text{ PolyGen }{-T}$ and 2 for $\text { PolyGen }{-U}$
Some examples are listed below:
$ cd build
# Run original Polygen with cegis
$ ./run max5.sl res cegis
$ cd run
$ ./run_exp [-exp {1,2}] [-r {R <Restart>,C <Clear>}] [-s {0,1}]
# For example, to reproduce all results:
$ ./run_exp -c R
-exp
: the id of the experiment you want to run. All experiments will be executed by default.-c
: whether to clear the cache:R
represents yes whileC
represents no, and the default value isC
.-s
: whether to skip random test when the same benchmark fails in cegis test:1
represents yes while0
represents no. The default value is1
.-d
: whether to draw the figures:1
represents yes while0
represents no. The default value is0
.
Some parameters can be set in config.py
KMemoryLimit
: the memory limit. The default value is 8 GBKTimeLimit
: the time limit. The default value is 120 seconds.KIntMin
: lower bound of the input. The default value is -20.KIntMax
: upper bound of the input. The default value is 20.KExampleLimit
: The limit of examples involves in the synthesizing process. The default value is 10000.KRepeatNum
: the number of repetitions of each execution. The default value is 1 for efficiency. Note that all the algorithms are random, the smaller this value is, the more volatile the result will be. Our experiment set this value as 5.
The result of each single execution is cached in exp/result_cache
.
The figure of each experiment will be stored in exp/figure
.
These results are expected to be consistent with results presented in run
.
$ cd run
$ ./run_exp -exp 1
For (a) ~ (d) of Figure 2, the script will redraw them respectively.
For data listed in Table 2, run_exp
will recalculate them and print them to the standard output.
$ cd run
$ ./run_exp -exp 2
For (e) ~ (h) of Figure 2, the script will redraw them respectively.
For data listed in Table 3, run_exp
will recalculate them and print them to the standard output.
Note: There may be some small differences between the results listed in our paper and the reproduced ones because there exists randomness.