|
| 1 | +# Step Logs |
| 2 | + |
| 3 | +The step log contains the agent's response to the user's request and additional information at every step. The step log is stored in the `response.log` file. The log fields are different for `HostAgent` and `AppAgent`. The step log is at the `info` level. |
| 4 | +## HostAgent Logs |
| 5 | + |
| 6 | +The `HostAgent` logs contain the following fields: |
| 7 | + |
| 8 | + |
| 9 | +### LLM Output |
| 10 | + |
| 11 | +| Field | Description | Type | |
| 12 | +| --- | --- | --- | |
| 13 | +| Observation | The observation of current desktop screenshots. | String | |
| 14 | +| Thought | The logical reasoning process of the `HostAgent`. | String | |
| 15 | +| Current Sub-Task | The current sub-task to be executed by the `AppAgent`. | String | |
| 16 | +| Message | The message to be sent to the `AppAgent` for the completion of the sub-task. | String | |
| 17 | +| ControlLabel | The index of the selected application to execute the sub-task. | String | |
| 18 | +| ControlText | The name of the selected application to execute the sub-task. | String | |
| 19 | +| Plan | The plan for the following sub-tasks after the current sub-task. | List of Strings | |
| 20 | +| Status | The status of the agent, mapped to the `AgentState`. | String | |
| 21 | +| Comment | Additional comments or information provided to the user. | String | |
| 22 | +| Questions | The questions to be asked to the user for additional information. | List of Strings | |
| 23 | +| AppsToOpen | The application to be opened to execute the sub-task if it is not already open. | Dictionary | |
| 24 | + |
| 25 | + |
| 26 | +### Additional Information |
| 27 | + |
| 28 | +| Field | Description | Type | |
| 29 | +| --- | --- | --- | |
| 30 | +| Step | The step number of the session. | Integer | |
| 31 | +| RoundStep | The step number of the current round. | Integer | |
| 32 | +| AgentStep | The step number of the `HostAgent`. | Integer | |
| 33 | +| Round | The round number of the session. | Integer | |
| 34 | +| ControlLabel | The index of the selected application to execute the sub-task. | Integer | |
| 35 | +| ControlText | The name of the selected application to execute the sub-task. | String | |
| 36 | +| Request | The user request. | String | |
| 37 | +| Agent | The agent that executed the step, set to `HostAgent`. | String | |
| 38 | +| AgentName | The name of the agent. | String | |
| 39 | +| Application | The application process name. | String | |
| 40 | +| Cost | The cost of the step. | Float | |
| 41 | +| Results | The results of the step, set to an empty string. | String | |
| 42 | +| CleanScreenshot | The image path of the desktop screenshot. | String | |
| 43 | + |
| 44 | + |
| 45 | + |
| 46 | +## AppAgent Logs |
| 47 | + |
| 48 | +The `AppAgent` logs contain the following fields: |
| 49 | + |
| 50 | +### LLM Output |
| 51 | + |
| 52 | +| Field | Description | Type | |
| 53 | +| --- | --- | --- | |
| 54 | +| Observation | The observation of the current application screenshots. | String | |
| 55 | +| Thought | The logical reasoning process of the `AppAgent`. | String | |
| 56 | +| ControlLabel | The index of the selected control to interact with. | String | |
| 57 | +| ControlText | The name of the selected control to interact with. | String | |
| 58 | +| Function | The function to be executed on the selected control. | String | |
| 59 | +| Args | The arguments required for the function execution. | List of Strings | |
| 60 | +| Status | The status of the agent, mapped to the `AgentState`. | String | |
| 61 | +| Plan | The plan for the following steps after the current action. | List of Strings | |
| 62 | +| Comment | Additional comments or information provided to the user. | String | |
| 63 | +| SaveScreenshot | The flag to save the screenshot of the application to the `blackboard` for future reference. | Boolean | |
| 64 | + |
| 65 | +### Additional Information |
| 66 | + |
| 67 | +| Field | Description | Type | |
| 68 | +| --- | --- | --- | |
| 69 | +| Step | The step number of the session. | Integer | |
| 70 | +| RoundStep | The step number of the current round. | Integer | |
| 71 | +| AgentStep | The step number of the `AppAgent`. | Integer | |
| 72 | +| Round | The round number of the session. | Integer | |
| 73 | +| Subtask | The sub-task to be executed by the `AppAgent`. | String | |
| 74 | +| SubtaskIndex | The index of the sub-task in the current round. | Integer | |
| 75 | +| Action | The action to be executed by the `AppAgent`. | String | |
| 76 | +| ActionType | The type of the action to be executed. | String | |
| 77 | +| Request | The user request. | String | |
| 78 | +| Agent | The agent that executed the step, set to `AppAgent`. | String | |
| 79 | +| AgentName | The name of the agent. | String | |
| 80 | +| Application | The application process name. | String | |
| 81 | +| Cost | The cost of the step. | Float | |
| 82 | +| Results | The results of the step. | String | |
| 83 | +| CleanScreenshot | The image path of the desktop screenshot. | String | |
| 84 | +| AnnotatedScreenshot | The image path of the annotated application screenshot. | String | |
| 85 | +| ConcatScreenshot | The image path of the concatenated application screenshot. | String | |
| 86 | + |
| 87 | +!!! tip |
| 88 | + You can use the following python code to read the request log: |
| 89 | + |
| 90 | + import json |
| 91 | + |
| 92 | + with open('logs/{task_name}/request.log', 'r') as f: |
| 93 | + for line in f: |
| 94 | + log = json.loads(line) |
| 95 | + |
| 96 | +!!! info |
| 97 | + The `FollowerAgent` logs share the same fields as the `AppAgent` logs. |
0 commit comments