From c31b68bf44a107e780b01320e83e76b3e369c2b8 Mon Sep 17 00:00:00 2001
From: KnowingNothing <zhengsizeMax@outlook.com>
Date: Sat, 16 May 2020 13:50:15 +0800
Subject: [PATCH] update

---
 README-old.md                                 | 120 +++++++
 README.md                                     | 314 +++++++++++++-----
 project1/.gitignore                           |   1 +
 project2/.gitignore                           |   1 +
 ...54\344\272\214\351\203\250\345\210\206.md" |  47 ++-
 5 files changed, 369 insertions(+), 114 deletions(-)
 create mode 100644 README-old.md
 create mode 100644 project1/.gitignore
 create mode 100644 project2/.gitignore

diff --git a/README-old.md b/README-old.md
new file mode 100644
index 0000000..8f4b6d9
--- /dev/null
+++ b/README-old.md
@@ -0,0 +1,120 @@
+![Build](https://github.com/pku-compiler-design-spring/CompilerProject-2020Spring-Part1/workflows/C/C++%20CI/badge.svg?branch=master)
+
+## Code Generation Compiler
+
+This project is designed for undergraduate students who are taking Compiler Design courses in spring.
+
+> Author: Size Zheng
+
+> Email: zhengsz@pku.edu.cn
+
+### BUG report And Bonus
+
+__Format:__ [date] "message" by **reporters** [bonus]
+
+1. [2020-4-14] "In run.cc case 4,5 golden array shape bug" by **Ye Yuan, Anjiang Wei, Yuyue Wang, Chenyang Yang** [+1]
+2. [2020-4-15] "In document, input BNF bug in AList" by **Jing Mai, Can Su, Zixuan Ling** [+1]
+3. [2020-4-16] "In CMakeLists.txt, link target to library" by **Chenqian Wang, Jiaqi Zhang, Wenqi Wang** [+1]
+
+### 1. Overview
+In this project, we provide several useful IR nodes and corresponding IRVisitor and IRMutator. The concept behind these structs are well studied in [Halide](https://github.com/halide/Halide) and [TVM](https://github.com/apache/incubator-tvm). Here we invent some new IR nodes and re-implement the Visitor and Mutator for them.
+
+The purpose of this project is to help students to better understand how to build a IR system and implement a simple code generate tool.
+
+The IR infrastructure of this project contains four levels:
+
+```
+Program
+Group
+Stmt
+Expr
+```
+The first level `Program` is not explicitly implemented.
+Each level of IR has several different type of nodes:
+```
+Group: Kernel
+Stmt: LoopNest, IfThenElse, Move
+Expr: IntImm,
+      UIntImm,
+      FloatImm,
+      StringImm,
+      Unary,
+      Binary,
+      Select,
+      Compare,
+      Call,
+      Var,
+      Cast,
+      Ramp,
+      Index,
+      Dom
+```
+
+Use these IR nodes we can potentially represent many kinds of programs.
+
+### 2. Build
+```sh
+mkdir build
+cd build
+cmake ..
+make -j 4
+```
+
+### 3. Example
+In `test` directory, thre are two examples of `gemm` and `conv2d`, they are good examples of how to represent computations by our IR infrastructure. If you run them:
+```sh
+cd build/test
+./gemm
+./conv2d
+```
+You can see the results are very similar to C programs, however, the printed strings are just intermediate representation, you can't run the printed strings for now. We hope you can improve current system to print exactly C/C++ programs and compile them using C/C++ compilers
+
+
+### 4. Tasks
+1. Please read the source code base throughly, you need to understand every parts of it.
+2. You need to implment you C/C++ code genreation. Hints: learn how the IRPrinter works, imitate it and try to write a new IRVisitor which can print C/C++ source codes.
+2. Go to `project1` directory, you will find many json files in `case` directory. The are inputs to your questions. For example, `example.json` contains:
+```json
+{
+    "name": "kernel_example",
+    "ins": ["B", "C"],
+    "outs": ["A"],
+    "data_type": "float",
+    "kernel": "A<32, 16>[i, j] = C<32, 16>[i, j] * B<32, 16>[i, j];"
+}
+```
+It means you need to generate a `.cc` file which implements the computation of `A<32, 16>[i, j] = C<32, 16>[i, j] * B<32, 16>[i, j];`. Put the computation in a function named `kernel_example`, whose inputs are `B` and `C`, and output is `A`, the data type is `float`. In the expression, we can see `A` has shape of [32, 16], and also `B` and `C`. So the function's signature is 
+```c
+void kernel_example(float (&B)[32][16], float (&C)[32][16], float (&A)[32][16])
+```
+Please try to generate C/C++ source files for these json files and put them under directory `kernels`.
+
+4. Your code genration application source files should be placed in `solution` directory. (But your code genration passes can be put in outer directories such as `include` and `src`)
+
+
+### 5. Notice
+1. We present a silly solution in `solution` directory, please do not follow such silly manner. The example is just used to tell you how our framework works.
+2. All the source files you put in `solution` directory should only contain one `main` function, as we will compile all the source files in `solution` directory into one executable file.
+3. Please be careful and do not delete important files, which may break down the system.
+4. If you want to test your designs, just enter the `build` directory, run `make -j 4`, you will see the binaries in `build` directory, there are sub-directories such as `project1`, your executable files should be placed there automatically.
+5. You are not supposed to modify `run.h` and `run.cc`. These files will be changed to another version which contains the full 10 test cases, so any modification is meaningless.
+6. If you are confused about what kinds of C/C++ code you are supposed to generate, see `solution/example_solution.cc`.
+
+### 6. Judge
+1. We provide auto-test file, after building the project, enter `build/project1`, and run `./test1`, you can see the results.
+2. We only show you 6 cases and 4 cases are hidden. The TAs will test all the 10 files and decide scores according to how many cases you can pass. Don't be worried, the hidden cases are no more complex than the open cases. If you can handle the open cases, you should pass all hidden cases.
+3. Do not copy the codes from others, we will do the check! Any intends to break this rule will result a 0 score to you.
+
+
+### 7. How it works?
+When you build the project, we will actually build four parts:
+- the files in `include` and `src`
+- the files in `test`
+- the files in `project1` are compiled to one executable
+- the files in `project1/solution` are compiled to one executable
+
+And we will automatically clean files under `kernels/*.cc`, so you can't expect to modify them manually.
+
+Then we will call the executable from `project1/solution` automatically, which is expected to generate all the functions and put them in `kernels/*.cc`.
+
+At last, we will run `./test1` manually to see your results and decide your scores according to the results.
diff --git a/README.md b/README.md
index 8f4b6d9..3c9f722 100644
--- a/README.md
+++ b/README.md
@@ -1,120 +1,258 @@
-![Build](https://github.com/pku-compiler-design-spring/CompilerProject-2020Spring-Part1/workflows/C/C++%20CI/badge.svg?branch=master)
+![Build](https://github.com/pku-compiler-design-spring/CompilerProject-2020Spring/workflows/C/C++%20CI/badge.svg?branch=master)
 
-## Code Generation Compiler
+# 编译大作业
+## 第二部分——自动求导的编译器
 
-This project is designed for undergraduate students who are taking Compiler Design courses in spring.
 
-> Author: Size Zheng
+### 1. 前言
+在第一部分的作业中，我们做的事情是根据输入的表达式生成C/C++代码，并且在10个例子上测试正确性（6个公开，4个隐藏）。此时，每位同学手头都应该有一个可用的代码生成器了。回忆我们做这个project的初衷，我们想要做一个面向当前重要应用——深度学习——的代码生成工具，利用我们编译课上学习的知识完成这一任务。在第一部分中，我们体会了词法分析、语法分析、中间表示形式（IR Node）、语法树构建、语法树遍历（通过IRVisitor）和代码生成，并且还可能用到了少数SDD, SDT中的知识。我们第二次project将继续这个方向，利用编译技术做更多有趣的功能，这一次，我们的重点将放在语法树的变换（一个个pass）上来，对于变换的设计可能会用到课本上更多的知识（但不一定是严格局限课本的例子，同学们可以根据实际情况活用）。这一次project可能对于一些同学来说比较困难，希望通过小组合作，大家都能掌握这个过程中需要的知识和技术。
 
-> Email: zhengsz@pku.edu.cn
 
-### BUG report And Bonus
+### 2. 问题描述
 
-__Format:__ [date] "message" by **reporters** [bonus]
-
-1. [2020-4-14] "In run.cc case 4,5 golden array shape bug" by **Ye Yuan, Anjiang Wei, Yuyue Wang, Chenyang Yang** [+1]
-2. [2020-4-15] "In document, input BNF bug in AList" by **Jing Mai, Can Su, Zixuan Ling** [+1]
-3. [2020-4-16] "In CMakeLists.txt, link target to library" by **Chenqian Wang, Jiaqi Zhang, Wenqi Wang** [+1]
-
-### 1. Overview
-In this project, we provide several useful IR nodes and corresponding IRVisitor and IRMutator. The concept behind these structs are well studied in [Halide](https://github.com/halide/Halide) and [TVM](https://github.com/apache/incubator-tvm). Here we invent some new IR nodes and re-implement the Visitor and Mutator for them.
-
-The purpose of this project is to help students to better understand how to build a IR system and implement a simple code generate tool.
+#### 2.1 传统的深度学习框架求导
+自动求导是深度学习中当前必不可少的功能（依赖于梯度优化的算法都摆脱不了求导过程）。在深度学习框架中（如Tensorflow, PyTorch），自动求导都是由框架完成的，它们的方法论是，首先形成计算图，然后根据链式法则构建计算梯度的图。举一个例子，一个简单的计算过程为：
+```py
+X is a tensor of shape [4, 3, 28, 28]
+T is a label of shape [4, 8 * 28 * 28]
+Y1 = Conv2d(X, kernel=(8, 3, 3, 3), padding=1, stride=1) # result shape is [4, 8, 28, 28]
+Y2 = flatten(Y1)  # result shape is [4, 8 * 28 * 28]
+loss = mse_loss(Y2, T)  # loss is scalar
+```
+如果想要求出对于X的导数（虽然常见情况是对于网络参数求导，而不是网络输入，但这里只是做一个例子），就要从loss开始求，首先loss对于自己的导数是1，然后求Y2的导数，框架发现Y2用来计算loss时，使用的时mse_loss函数，于是找到了mse_loss函数的导函数grad_mse_loss，用于计算Y2的导数；接着对于Y1，框架又发现Y2是通过flatten函数求出来的，于是找到了flatten函数的导函数grad_flatten，利用这个导函数求出对于Y1的导数，继续向上求X的导数，框架又发现了Conv2d层，于是找到了对应的卷积求导函数，用于求X的导数。可以看到，这个过程除了链式法则，框架还在不断地识别正向传播时使用的函数/层的名字，然后在自己的函数库里寻找对应的导函数，框架知道应该找哪个导函数，都是依赖于编写框架的人了解这些知识，然后在框架的库里准备好需要的函数们。
+这是一种传统的求导方式，它的粒度是算子（加减乘除也算是算子），而在我们这次project中，我们将使用编译技术自动地根据前向算子计算定义生成其反向计算导数的函数，整个过程，不需要特意知道算子的名称，只需要看到数学表达式即可。这样做的一个优势是，深度学习应用的可扩展性将被加强，当有人希望自己设计一个算子时，他/她不再需要自己推导导函数的定义，然后自己实现出来再注册到框架里使用，而是只需要提供一个正向传播的计算表达式，就可以得到对应的导函数计算定义。
 
-The IR infrastructure of this project contains four levels:
+#### 2.2 问题定义
+__现在我们开始进行问题描述：__
+对于一个给定的表达式$Output = expr(Input_1, Input_2, ..., Input_n)$（$Output, Input_i$是张量或标量, $expr()$表示用其参数构造一个表达式），我们如果已知了最终loss对于$Output$的导数$dOutput = \frac{\partial loss}{\partial Output}$，我们想知道loss对于某个输入的导函数是什么，也就是求$dInput_i = \frac{\partial loss}{\partial Input_i}$的问题，**我们要求如下：**
+- 分析出来的求导表达式是一个或多个赋值语句形式，每个语句左侧的下标索引上不能有加减乘除等运算，也就是不能出现`A[i+1] = B[i]`的形式。
+- 必须通过对输入表达式的编译分析过程，综合出求导表达式的内容，并生成代码，不能通过判断case的名字直接得出求导表达式（这样就和传统框架一样了），也不能用打表法直接打印出字符串
 
+#### 2.3一个例子
+为了帮助理解，我们给一个例子：
+```py
+C<M, N>[i, j] = A<M, K>[i, k] * B<K, N>[k, j]
+```
+基于第一次project的知识，我们知道这个式子表达了一个矩阵乘法。
+现在已知了某个$loss$对于$C$的导数$dC$（是个张量，大小与$C$的大小相同，注意这里的$dC$是个名字，不是算符），假设想要求$dA$，那么根据求导的数学方法得到
+$$dA[i, k] = \frac{\partial loss}{\partial A[i, k]} = \sum_{j}{\frac{\partial loss}{\partial C[i, j]} \times \frac{\partial C[i, j]}{\partial A[i, k]}} = dC[i, j] \times B[k, j]$$
+所以可以得到对于$A$的导数计算式为
+```py
+dA<M, K>[i, k] = dC<M, N>[i, j] * B<K, N>[k, j]
+```
+翻译为C代码就是
+```c
+for (int i = 0; i < M; ++i) {
+  for (int k = 0; k < K; ++k) {
+    dA[i][k] = 0.0;
+    for (int j = 0; j < N; ++j) {
+      dA[i][k] += dC[i][j] * B[k][j];
+    }
+  }
+}
+```
+对于$B$也可以类似写出求导的式子：
+```py
+dB<K, N>[k, j] = dC<M, N>[i, j] * A<M, K>[i, k]
 ```
-Program
-Group
-Stmt
-Expr
+
+### 3. Project输入与输出
+基于上一次project的测试法，我们这次仍然给出10个例子，与第一次project的测试case不同的是，我们给出的json文件中多了一个"grad_to"的键值，这个键值的信息是对哪个/哪些输入（可能一个或多个输入）进行求导。
+比如看case1的json文件：
+```json
+{
+  "name": "grad_case1",
+  "ins": ["A", "B"],
+  "outs": ["C"],
+  "data_type": "float",
+  "kernel": "C<4, 16>[i, j] = A<4, 16>[i, j] * B<4, 16>[i, j] + 1.0;",
+  "grad_to": ["A"]
+}
 ```
-The first level `Program` is not explicitly implemented.
-Each level of IR has several different type of nodes:
+这里指明了对于$A$进行求导，所以得到的式子应该是
+```py
+dA<4, 16>[i, j] = dC<4, 16>[i, j] * B<4, 16>[i, j]
 ```
-Group: Kernel
-Stmt: LoopNest, IfThenElse, Move
-Expr: IntImm,
-      UIntImm,
-      FloatImm,
-      StringImm,
-      Unary,
-      Binary,
-      Select,
-      Compare,
-      Call,
-      Var,
-      Cast,
-      Ramp,
-      Index,
-      Dom
+这里的$dC$符号就是C的导数，我们认为所有的输出的导数张量都是已知，命名规则都是原来的名字前面加个$d$，此外，我们只考虑正向传播表达式有且仅有一个输出的情况。
+同学们读如json文件，分析正向表达式后，根据编译技术分析出反向传播表达式，然后对这个反向传播的表达式生成C/C++代码，放在kernels/目录对应的文件内。每次cmake这个project时，都会先自动运行solution下的代码，然后运行run2.cc，run2.cc里有测试逻辑，会测试同学们生成的反向传播代码的正确性。
+
+
+#### 3.1 测试例子
+这次一共10个测试例子，全部是公开的，公开理由为：
+- 自动求导本身蕴含NP问题，只有下标的变换满足一定条件（如线性）才是易解的，即使是易解的，其求解细节也比较复杂，所以给出具体的10个例子，同学们不必花费太多精力担心输入不可预测性。
+- 隐藏例子测试法（第一次project）是防止同学们通过打表法做题，只给出trivial的解决方案（比如直接输出字符串），这个问题可以通过设计审查方法来杜绝（审查法在后面介绍）
+- 并非所有同学都有求导的先验知识，所以给出所有例子方便同学们掌握问题，更好地完成任务
+
+这10个例子都是紧贴实际深度学习应用的，涵盖的实际应用包括：
+1. element-wise的乘法
+2. 矩阵乘法
+3. dense MTTKRP
+4. 二维普通卷积
+5. 转置
+6. flatten
+7. broadcast
+8. blur
+
+考虑到并非所有同学都接触过求导方法，我们提供了ground truth。每个测试例子在run2.cc里都会有一个对应地测试函数，在函数体以及注释里，都可以获取正确求导的结果。比如对case1，test_case1函数为
+```c
+bool test_case1(std::mt19937 &gen, std::uniform_real_distribution<float> &dis) {
+    // "C<4, 16>[i, j] = A<4, 16>[i, j] * B<4, 16>[i, j] + 1.0;"
+    // "dA<4, 16>[i, j] = dC<4, 16>[i, j] * B<4, 16>[i, j];"
+    float B[4][16] = {{0}};
+    float dA[4][16] = {{0}};
+    float dC[4][16] = {{0}};
+    float golden[4][16] = {{0}};
+    // initialize
+    for (int i = 0; i < 4; ++i) {
+        for (int j = 0; j < 16; ++j) {
+            B[i][j] = dis(gen);
+            dC[i][j] = dis(gen);
+        }
+    }
+    // compute golden
+    for (int i = 0; i < 4; ++i) {
+        for (int j = 0; j < 16; ++j) {
+            golden[i][j] = dC[i][j] * B[i][j];
+        }
+    }
+    try {
+        grad_case1(B, dC, dA);
+    } catch (...) {
+        std::cout << "Failed because of runtime error\n";
+        return false;
+    }
+
+    // check
+    for (int i = 0; i < 4; ++i) {
+        for (int j = 0; j < 16; ++j) {
+            if (std::abs(golden[i][j] - dA[i][j]) >= 1e-5) {
+                std::cout << "Wrong answer\n";
+                return false;
+            }
+        }
+    }
+    // correct
+    return true;
+}
 ```
+可以看注释，或者golden的记算方法，来学习正确的求导结果。
+
+### 4. 评分与要求
+#### 4.1 关键日期
+project2 开始：2020年5月16日晚23:59
+project2 截至：2020年6月21日晚23:59
+**不接受补交，请及时提交文件，并检查是否提交成功**
+#### 4.2 毕业班政策及组队
+毕业班同学有两个选择：
+1. 正常按时完成project2并计分。
+2. 不做project2，使用期末成绩折合20%作为project2的分数。
 
-Use these IR nodes we can potentially represent many kinds of programs.
+如果选择了第二种，只需要不提交project2即可，助教会自动认为选择了使用期末成绩折合的方式。
+考虑到有毕业班同学之前和非毕业班的同学组队。第二次project允许重新组队，请计划不做project2的毕业班同学不要再组队。新的组队信息在6月1日23:59前发送至compiler2020spring@163.com。没有变更的小组不用发邮件。
+#### 4.3 提交途径与要求
+##### 4.3.1 途径
+以小组为单位提交。
+提交代码途径：发送github**链接**到邮箱compiler2020spring@163.com
 
-### 2. Build
+##### 4.3.2 要求
+1. 必须包含一个pdf版本的报告在project2目录下，报告内容必须涵盖小组分工，自动求导技术设计，实现流程，实验结果。其余内容可根据个人爱好添加。
+2. 不可以更改/拷贝run2.h, run2.cc, clean2.cc的内容，其余内容均可自由改动
+3. 不要从stdin读取内容，请从json文件读取输入
+4. 使用编译器版本需要兼容C++11标准（gcc 4.8.5以上应该都满足）
+
+发送github链接前，请一定保证在提交截止日期后代码仓是public的，这样助教有权限下载代码（但也要注意不要提前public了，以防有人抄代码）。助教的测试命令为：
 ```sh
+git clone --recursive <提交的github链接> CompilerProject
+cd CompilerProject
 mkdir build
 cd build
 cmake ..
 make -j 4
+cd project2
+./test2
 ```
 
-### 3. Example
-In `test` directory, thre are two examples of `gemm` and `conv2d`, they are good examples of how to represent computations by our IR infrastructure. If you run them:
-```sh
-cd build/test
-./gemm
-./conv2d
-```
-You can see the results are very similar to C programs, however, the printed strings are just intermediate representation, you can't run the printed strings for now. We hope you can improve current system to print exactly C/C++ programs and compile them using C/C++ compilers
+#### 4.4 评分法
+本次project占总成绩20%，这20分中5分来自pdf报告，15分来自提交代码。为了保证每个小组成员都要给出有效贡献，pdf报告中的分工将被参考到最终评分中，同时，如果小组成员举报某一成员并未做出任何贡献,一经查实（查实方法为审核github提交贡献量），将**不予该成员给分**，请小组内部紧密合作。
 
+pdf评分标准：
+- 包含小组分工，自动求导技术设计（2分）
+- 包含实现流程，实验结果内容（1分）
+- 通过一个具体例子解释所设计的求导技术的可行性和正确性（1分）
+- 总结使用到的编译知识，讲解如何实现（1分）
 
-### 4. Tasks
-1. Please read the source code base throughly, you need to understand every parts of it.
-2. You need to implment you C/C++ code genreation. Hints: learn how the IRPrinter works, imitate it and try to write a new IRVisitor which can print C/C++ source codes.
-2. Go to `project1` directory, you will find many json files in `case` directory. The are inputs to your questions. For example, `example.json` contains:
-```json
-{
-    "name": "kernel_example",
-    "ins": ["B", "C"],
-    "outs": ["A"],
-    "data_type": "float",
-    "kernel": "A<32, 16>[i, j] = C<32, 16>[i, j] * B<32, 16>[i, j];"
-}
-```
-It means you need to generate a `.cc` file which implements the computation of `A<32, 16>[i, j] = C<32, 16>[i, j] * B<32, 16>[i, j];`. Put the computation in a function named `kernel_example`, whose inputs are `B` and `C`, and output is `A`, the data type is `float`. In the expression, we can see `A` has shape of [32, 16], and also `B` and `C`. So the function's signature is 
-```c
-void kernel_example(float (&B)[32][16], float (&C)[32][16], float (&A)[32][16])
-```
-Please try to generate C/C++ source files for these json files and put them under directory `kernels`.
+代码评分标准：
+- 根据通过的case数目计算分数(真实分数)，具体按照下表
 
-4. Your code genration application source files should be placed in `solution` directory. (But your code genration passes can be put in outer directories such as `include` and `src`)
+| 通过case数目 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
+| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
+| 得分 | 0 | 2.25 | 4.5 | 6.75 | 9 | 10.5 | 12 | 12.75 | 13.5 | 14.25 | 15 | 
 
+- 在运行./test2后会自动打出分数，符合上表的定义
 
-### 5. Notice
-1. We present a silly solution in `solution` directory, please do not follow such silly manner. The example is just used to tell you how our framework works.
-2. All the source files you put in `solution` directory should only contain one `main` function, as we will compile all the source files in `solution` directory into one executable file.
-3. Please be careful and do not delete important files, which may break down the system.
-4. If you want to test your designs, just enter the `build` directory, run `make -j 4`, you will see the binaries in `build` directory, there are sub-directories such as `project1`, your executable files should be placed there automatically.
-5. You are not supposed to modify `run.h` and `run.cc`. These files will be changed to another version which contains the full 10 test cases, so any modification is meaningless.
-6. If you are confused about what kinds of C/C++ code you are supposed to generate, see `solution/example_solution.cc`.
+#### 4.5 审查法
+审查代码是为了防止同学作弊，作弊的定义包含：
+- 拷贝或修改run2.h/run2.cc/clean2.cc的内容
+- 任何两组的代码重合度过高甚至完全一致
+- 完全使用第三方项目解决问题
+- 报告内容与实际实现不一致
 
-### 6. Judge
-1. We provide auto-test file, after building the project, enter `build/project1`, and run `./test1`, you can see the results.
-2. We only show you 6 cases and 4 cases are hidden. The TAs will test all the 10 files and decide scores according to how many cases you can pass. Don't be worried, the hidden cases are no more complex than the open cases. If you can handle the open cases, you should pass all hidden cases.
-3. Do not copy the codes from others, we will do the check! Any intends to break this rule will result a 0 score to you.
+可能使用的审查法包括
+1. 全面审查法，3名助教平均每人审查若干支队伍的代码
+2. 抽样审查法，随机挑选若干队伍进行代码审查
+3. 输入测试法，助教随机修改输入case的json文件，如果仍然能输出有意义的代码到kernels/下（助教会看），说明没有作弊（不测试代码正确性，只考察能生成代码）
+4. 结合报告内容审查法，检查报告中的技术和实现的技术一致性。
+5. 重点审查法，对于得分高于12分的组进行全面代码审查。
 
+具体审查方法不公开，请同学们自觉使用编译课所学知识解决问题，我们的给分策略是非线性，通过4个例子的组都可以得到9分以上（60%的分数），过6个例子就可以得到12分（80%的分数），宗旨是鼓励大家侧重于自己设计编译器，而不是唯分数论。
 
-### 7. How it works?
-When you build the project, we will actually build four parts:
-- the files in `include` and `src`
-- the files in `test`
-- the files in `project1` are compiled to one executable
-- the files in `project1/solution` are compiled to one executable
+### 5. 参考文献与代码
+1. Halide的一个自动求导工作
+https://people.csail.mit.edu/tzumao/gradient_halide/gradient_halide.pdf
+2. Halide自动求导代码
+https://github.com/halide/Halide/blob/master/src/Derivative.cpp
+3. TVM自动求导代码
+https://github.com/apache/incubator-tvm/pull/2498
+4. 二维卷积反向传播推导
+https://zhuanlan.zhihu.com/p/61898234
+5. 线性下标变换下求导方法
+https://arxiv.org/abs/1711.01348
 
-And we will automatically clean files under `kernels/*.cc`, so you can't expect to modify them manually.
+### 6. 讨论
+Project可能潜在的bug可以在微信群、github issue上提出，有价值的issue可以为全组加分，每个bug加1分
+另外，鼓励小组内部协作与讨论，也鼓励适当的小组间交流，交流方式为github issue或微信群，助教也会参与讨论，解答一些技术问题。
 
-Then we will call the executable from `project1/solution` automatically, which is expected to generate all the functions and put them in `kernels/*.cc`.
+### 附录
+#### 1. IRMutator的使用
+IRMutator的功能是遍历IR，并且在遍历到每个节点的时候，返回一个新的IR节点。默认的IRMutator行为是返回和先前一摸一样的新节点。实际使用时，可以通过继承IRMutator，并重载特定的visit函数来定制对于IRMutator的遍历和修改行为。所有通过IRMutator对于AST的修改，都是创造新的AST，所以不会影响原来的AST的内容。
 
-At last, we will run `./test1` manually to see your results and decide your scores according to the results.
+在test/目录下，ir_mutator.cc文件中展示了一个简单的定制Mutator的过程：
+```c
+class MyMutator : public IRMutator {
+ public:
+  Expr visit(Ref<const Var> op) override {
+    if (op->name == "A") {
+      return Var::make(op->type(), "modified_A", op->args, op->shape);
+    }
+    return IRMutator::visit(op);
+  }
+};
+```
+利用这个Mutator，可以把表达式里名字为"A"的Var节点更改为名字为"modified_A"的Var节点。
+```c
+MyMutator mutator;
+kernel = mutator.mutate(kernel);
+```
+更改后的kernel，打印出来是这样的:
+```py
+<CPU> simple_gemm(modified_A<1024, 256>, B<256, 512>, C<1024, 512>) {
+  for i<spatial> in dom[((int32_t <1>) 0), ((int32_t <1>) 1024)){
+    for j<spatial> in dom[((int32_t <1>) 0), ((int32_t <1>) 512)){
+      for k<reduce> in dom[((int32_t <1>) 0), ((int32_t <1>) 256)){
+        C[i, j] =<mem_to_mem> C[i, j] + modified_A[i, k] * B[k, j]
+      }
+    }
+  }
+}
+```
+可以看到名字的确改了。这样，我们可以利用IRMutator实现很多不同的pass。
\ No newline at end of file
diff --git a/project1/.gitignore b/project1/.gitignore
new file mode 100644
index 0000000..310ba7d
--- /dev/null
+++ b/project1/.gitignore
@@ -0,0 +1 @@
+kernels
\ No newline at end of file
diff --git a/project2/.gitignore b/project2/.gitignore
new file mode 100644
index 0000000..310ba7d
--- /dev/null
+++ b/project2/.gitignore
@@ -0,0 +1 @@
+kernels
\ No newline at end of file
diff --git "a/\347\274\226\350\257\221\345\244\247\344\275\234\344\270\232-\347\254\254\344\272\214\351\203\250\345\210\206.md" "b/\347\274\226\350\257\221\345\244\247\344\275\234\344\270\232-\347\254\254\344\272\214\351\203\250\345\210\206.md"
index 7a38f58..17e1cb5 100644
--- "a/\347\274\226\350\257\221\345\244\247\344\275\234\344\270\232-\347\254\254\344\272\214\351\203\250\345\210\206.md"
+++ "b/\347\274\226\350\257\221\345\244\247\344\275\234\344\270\232-\347\254\254\344\272\214\351\203\250\345\210\206.md"
@@ -137,34 +137,28 @@ bool test_case1(std::mt19937 &gen, std::uniform_real_distribution<float> &dis) {
 
 ### 4. 评分与要求
 #### 4.1 关键日期
-project2 开始：2020年5月16日中午12:00
-project2 截至：2020年6月21日中午12:00
+project2 开始：2020年5月16日晚23:59
+project2 截至：2020年6月21日晚23:59
 **不接受补交，请及时提交文件，并检查是否提交成功**
-#### 4.2 提交途径与要求
-##### 4.2.1 途径
-以小组为单位，分组沿袭第一次project的分组。
-提交代码有两种途径：
-- 发送压缩包到邮箱compiler2020spring@163.com
-- 发送github链接到邮箱compiler2020spring@163.com（推荐）
+#### 4.2 毕业班政策及组队
+毕业班同学有两个选择：
+1. 正常按时完成project2并计分。
+2. 不做project2，使用期末成绩折合20%作为project2的分数。
 
-##### 4.2.2 要求
+如果选择了第二种，只需要不提交project2即可，助教会自动认为选择了使用期末成绩折合的方式。
+考虑到有毕业班同学之前和非毕业班的同学组队。第二次project允许重新组队，请计划不做project2的毕业班同学不要再组队。新的组队信息在6月1日23:59前发送至compiler2020spring@163.com。没有变更的小组不用发邮件。
+#### 4.3 提交途径与要求
+##### 4.3.1 途径
+以小组为单位提交。
+提交代码途径：发送github**链接**到邮箱compiler2020spring@163.com
+
+##### 4.3.2 要求
 1. 必须包含一个pdf版本的报告在project2目录下，报告内容必须涵盖小组分工，自动求导技术设计，实现流程，实验结果。其余内容可根据个人爱好添加。
 2. 不可以更改/拷贝run2.h, run2.cc, clean2.cc的内容，其余内容均可自由改动
 3. 不要从stdin读取内容，请从json文件读取输入
 4. 使用编译器版本需要兼容C++11标准（gcc 4.8.5以上应该都满足）
 
-如果选择发送源代码压缩包，请把必要的源文件都放在压缩包中，不要包含build文件，要保证运行代码仅仅需要以下命令：
-```sh
-mkdir build
-cd build
-cmake ..
-make -j 4
-cd project2
-./test2
-```
-如果依赖第三方项目（以及非C++11标准自带的功能），要想办法满足这个要求（比如把第三方项目代码作为project2的子项目一起打包进压缩包）。
-
-如果选择发送github链接，请一定保证在提交截止日期后代码仓是public的，这样助教有权限下载代码（但也要注意不要提前public了，以防有人抄代码）。助教的测试命令为：
+发送github链接前，请一定保证在提交截止日期后代码仓是public的，这样助教有权限下载代码（但也要注意不要提前public了，以防有人抄代码）。助教的测试命令为：
 ```sh
 git clone --recursive <提交的github链接> CompilerProject
 cd CompilerProject
@@ -176,8 +170,9 @@ cd project2
 ./test2
 ```
 
-#### 4.3 评分法
-本次project占总成绩20%，这20分中5分来自pdf报告，15分来自提交代码。为了保证每个小组成员都要给出有效贡献，pdf报告中的分工将被参考到最终评分中，同时，如果小组其他成员一致举报某一成员并未做出任何贡献，将**不予该成员给分**，请小组内部紧密合作。
+#### 4.4 评分法
+本次project占总成绩20%，这20分中5分来自pdf报告，15分来自提交代码。为了保证每个小组成员都要给出有效贡献，pdf报告中的分工将被参考到最终评分中，同时，如果小组成员举报某一成员并未做出任何贡献,一经查实（查实方法为审核github提交贡献量），将**不予该成员给分**，请小组内部紧密合作。
+
 pdf评分标准：
 - 包含小组分工，自动求导技术设计（2分）
 - 包含实现流程，实验结果内容（1分）
@@ -193,7 +188,7 @@ pdf评分标准：
 
 - 在运行./test2后会自动打出分数，符合上表的定义
 
-#### 4.4 审查法
+#### 4.5 审查法
 审查代码是为了防止同学作弊，作弊的定义包含：
 - 拷贝或修改run2.h/run2.cc/clean2.cc的内容
 - 任何两组的代码重合度过高甚至完全一致
@@ -201,8 +196,8 @@ pdf评分标准：
 - 报告内容与实际实现不一致
 
 可能使用的审查法包括
-1. 全面审查法，共39个队伍，3名助教平均每人审查13支队伍的代码
-2. 抽样审查法，随机挑选x只队伍进行代码审查
+1. 全面审查法，3名助教平均每人审查若干支队伍的代码
+2. 抽样审查法，随机挑选若干队伍进行代码审查
 3. 输入测试法，助教随机修改输入case的json文件，如果仍然能输出有意义的代码到kernels/下（助教会看），说明没有作弊（不测试代码正确性，只考察能生成代码）
 4. 结合报告内容审查法，检查报告中的技术和实现的技术一致性。
 5. 重点审查法，对于得分高于12分的组进行全面代码审查。