273 lines
14 KiB
Plaintext
273 lines
14 KiB
Plaintext
# Instruction
|
||
- You are an expert in cleaning process data descriptions. Given a task, you are provided with a set of annotation description
|
||
data for a certain visual LLM related to human user operation videos. Plus, You are provided with full trace of playwright action,
|
||
whic includes action and url before and after the action.
|
||
- You need to analyze all the descriptive data and ultimately summarize a complete and reasonable user operation description that can accomplish the given task.
|
||
- For each strategy, give a clear list of the low level action sequence.
|
||
|
||
# Task
|
||
Where is the nearest pharmacy from Carnegie Mellon I can walk within 20mins
|
||
|
||
# Annotation description
|
||
## Part 1
|
||
### Step-by-Step Actions in the Video Segment
|
||
|
||
#### 1. **Initial State**
|
||
- **Action:** The video starts with the OpenStreetMap homepage displayed.
|
||
- **Page Changes:** The page shows a welcome message, a brief description of OpenStreetMap, and two buttons: "Learn More" and "Start Mapping."
|
||
- **Possible Purpose:** The initial state sets the context for the user's interaction with the website.
|
||
|
||
#### 2. **Click on the Search Box**
|
||
- **Action:** I click on the search box located at the top left corner of the page.
|
||
- **Page Changes:** The search box becomes active, allowing text input.
|
||
- **Possible Purpose:** To initiate a search for a specific location or feature on the map.
|
||
|
||
#### 3. **Type "Carnegie Mellon University" into the Search Box**
|
||
- **Action:** I type "Carnegie Mellon University" into the active search box.
|
||
- **Page Changes:** As I type, the search box displays suggestions based on the input text.
|
||
- **Possible Purpose:** To find and locate Carnegie Mellon University on the map.
|
||
|
||
#### 4. **Press the "Go" Button**
|
||
- **Action:** I press the "Go" button next to the search box.
|
||
- **Page Changes:** The map updates to center on the location of Carnegie Mellon University. A sidebar appears on the left side of the screen with detailed information about the selected location.
|
||
- **Possible Purpose:** To confirm the search query and view the specific location on the map.
|
||
|
||
#### 5. **Review the Sidebar Information**
|
||
- **Action:** I review the information displayed in the sidebar.
|
||
- **Page Changes:** The sidebar shows details such as the name, address, and other attributes of Carnegie Mellon University.
|
||
- **Possible Purpose:** To gather more information about the selected location.
|
||
|
||
#### 6. **Zoom In on the Map**
|
||
- **Action:** I use the zoom controls on the right side of the map to zoom in.
|
||
- **Page Changes:** The map zooms in, providing a more detailed view of the area around Carnegie Mellon University.
|
||
- **Possible Purpose:** To get a closer look at the specific area and surrounding features.
|
||
|
||
### Summary
|
||
In this video segment, I interact with the OpenStreetMap website by searching for "Carnegie Mellon University," confirming the search, and then reviewing the detailed information and map view of the location. The actions are focused on locating and examining a specific place using the map's search and zoom functionalities.
|
||
|
||
---
|
||
|
||
## Part 2
|
||
### Step-by-Step Actions in the Video Segment
|
||
|
||
#### 1. **Action:** I click on the search bar located at the top-left corner of the webpage.
|
||
- **Page Changes:** The cursor is now active inside the search bar, indicating it is ready for text input.
|
||
- **Possible Purpose:** The likely intent is to initiate a search query to find specific information or a location on the OpenStreetMap website.
|
||
|
||
#### 2. **Action:** I type "pharmacy near Carnegie Mellon University" into the search bar.
|
||
- **Page Changes:** As I type, the text appears in the search bar. No immediate page changes occur until the search is executed.
|
||
- **Possible Purpose:** The purpose is to locate pharmacies in the vicinity of Carnegie Mellon University using the search functionality provided by the website.
|
||
|
||
#### 3. **Action:** I click the "Go" button next to the search bar.
|
||
- **Page Changes:** After clicking "Go," a new section titled "Search Results from OpenStreetMap Nominatim" appears below the search bar. This section lists various pharmacy locations near Carnegie Mellon University, along with their addresses.
|
||
- **Possible Purpose:** The intent is to execute the search query and retrieve relevant results pertaining to pharmacies near the specified location.
|
||
|
||
#### 4. **Action:** I hover over the listed search results under "Search Results from OpenStreetMap Nominatim."
|
||
- **Page Changes:** Hovering over the results highlights each entry sequentially, but no other significant changes occur on the page.
|
||
- **Possible Purpose:** The likely intent is to review the search results and possibly select one of the listed pharmacy locations for further details or navigation.
|
||
|
||
### Summary
|
||
In this video segment, I interact with the OpenStreetMap website by using the search functionality to find pharmacies near Carnegie Mellon University. I activate the search bar, input the desired query, execute the search, and then review the resulting list of nearby pharmacies. Each action is performed methodically to achieve the goal of locating relevant pharmacy information.
|
||
|
||
---
|
||
|
||
## Part 3
|
||
### Part 1: Observing the Actions in the Provided Video Segment
|
||
|
||
#### Step 1:
|
||
- **Action:** I click on the search bar located at the top left corner of the OpenStreetMap webpage.
|
||
- **Page Changes:** The cursor is now active inside the search bar, indicating it is ready for text input.
|
||
- **Possible Purpose:** The likely intent is to enter a search query to find a specific location or point of interest on the map.
|
||
|
||
#### Step 2:
|
||
- **Action:** I type "pharmacy near Carnegie Mellon U" into the search bar.
|
||
- **Page Changes:** As I type, the search bar displays the entered text. No immediate page changes occur until the search is executed.
|
||
- **Possible Purpose:** The purpose is to search for pharmacies located near Carnegie Mellon University.
|
||
|
||
#### Step 3:
|
||
- **Action:** I click the blue "Go" button next to the search bar.
|
||
- **Page Changes:** After clicking "Go," a list of search results appears on the left side of the screen. The map on the right updates to show the general area around Carnegie Mellon University with markers for the found pharmacies.
|
||
- **Possible Purpose:** The action aims to execute the search query and display relevant pharmacy locations on the map and in the search results list.
|
||
|
||
#### Step 4:
|
||
- **Action:** I hover over the first search result, "Pharmacy: University Pharmacy, 4200, Fifth Avenue, North Oakland, Pittsburgh, Allegheny County, 15260, United States."
|
||
- **Page Changes:** Hovering over the result highlights it, but no other immediate changes occur.
|
||
- **Possible Purpose:** The intent is likely to select or view more details about this specific pharmacy.
|
||
|
||
#### Step 5:
|
||
- **Action:** I click on the highlighted search result for "University Pharmacy."
|
||
- **Page Changes:** Upon clicking, the map centers on the location of University Pharmacy, and a marker with an information bubble appears at that spot. The information bubble provides the name and address of the pharmacy.
|
||
- **Possible Purpose:** The goal is to focus on the exact location of University Pharmacy and view its details on the map.
|
||
|
||
### Summary of Observed Actions:
|
||
- I initiated a search for pharmacies near Carnegie Mellon University.
|
||
- I executed the search and reviewed the results.
|
||
- I selected a specific pharmacy from the results to view its location and details on the map.
|
||
Each step was performed methodically to locate and pinpoint a pharmacy of interest using the OpenStreetMap interface.
|
||
|
||
---
|
||
|
||
## Part 4
|
||
### Step-by-Step Actions in the Video Segment
|
||
|
||
#### 1. **Action**: I click on the "Export" tab.
|
||
- **Page Changes**: The content under the "Export" tab becomes visible, displaying options related to exporting map data.
|
||
- **Possible Purpose**: The likely intent is to access features that allow for the downloading or exporting of map data for further use or analysis.
|
||
|
||
#### 2. **Action**: I hover over the map area near the "University Pharmacy" label.
|
||
- **Page Changes**: No immediate changes occur on the page, but the cursor changes to a hand icon, indicating an interactive element.
|
||
- **Possible Purpose**: This action suggests an intention to interact with the map, possibly to select or highlight the specific location of the University Pharmacy.
|
||
|
||
#### 3. **Action**: I right-click on the map near the "University Pharmacy" label.
|
||
- **Page Changes**: A context menu appears with various options such as "Inspect," "Search for nodes," etc.
|
||
- **Possible Purpose**: The purpose is likely to access additional options or information related to the selected map area, such as inspecting the underlying code or searching for specific nodes.
|
||
|
||
#### 4. **Action**: I select the "Inspect" option from the context menu.
|
||
- **Page Changes**: The browser's developer tools panel opens, highlighting the HTML code associated with the clicked map area.
|
||
- **Possible Purpose**: The intent is to examine the HTML structure and properties of the map element, possibly to understand its attributes or to debug.
|
||
|
||
#### 5. **Action**: I click on a specific line in the developer tools panel that reads `get_by_role("link", name="Export")`.
|
||
- **Page Changes**: The corresponding element on the webpage is highlighted, and the focus shifts to the "Export" link in the interface.
|
||
- **Possible Purpose**: This action aims to locate and possibly interact with the "Export" link by using the developer tools to navigate the webpage's structure efficiently.
|
||
|
||
### Summary
|
||
In this video segment, my actions are focused on navigating the OpenStreetMap website, specifically interacting with the "Export" tab and using developer tools to inspect and locate elements related to exporting map data. Each step is methodical, suggesting a detailed exploration or troubleshooting process related to the map's export functionality.
|
||
|
||
# Playwright action
|
||
[
|
||
{
|
||
"action_uid": "textbox_Search",
|
||
"idx": 4,
|
||
"action_repr": "frame.clickget_by_role(\"textbox\", name=\"Search\")",
|
||
"before": {
|
||
"url": "http://miniserver1875.asuscomm.com:3000/#map=7/42.896/-75.108"
|
||
},
|
||
"after": {
|
||
"url": "http://miniserver1875.asuscomm.com:3000/#map=7/42.896/-75.108"
|
||
}
|
||
},
|
||
{
|
||
"action_uid": "button_Go",
|
||
"idx": 5,
|
||
"action_repr": "frame.clickget_by_role(\"button\", name=\"Go\")",
|
||
"before": {
|
||
"url": "http://miniserver1875.asuscomm.com:3000/#map=7/42.896/-75.108"
|
||
},
|
||
"after": {
|
||
"url": "http://miniserver1875.asuscomm.com:3000/#map=7/42.896/-75.108"
|
||
}
|
||
},
|
||
{
|
||
"action_uid": "link_Carnegie Mellon University, Schenley Drive Extension, North Oakland, Pittsburgh, Allegheny County, 15213, United States",
|
||
"idx": 2,
|
||
"action_repr": "frame.clickget_by_role(\"link\", name=\"Carnegie Mellon University, Schenley Drive Extension, North Oakland, Pittsburgh, Allegheny County, 15213, United States\")",
|
||
"before": {
|
||
"url": "http://miniserver1875.asuscomm.com:3000/#map=7/42.896/-75.108"
|
||
},
|
||
"after": {
|
||
"url": "http://miniserver1875.asuscomm.com:3000/#map=7/42.896/-75.108"
|
||
}
|
||
},
|
||
{
|
||
"action_uid": "action_3",
|
||
"idx": 3,
|
||
"action_repr": "frame.clicklocator(\".leaflet-interactive\").first",
|
||
"before": {
|
||
"url": "http://miniserver1875.asuscomm.com:3000/relation/2279034"
|
||
},
|
||
"after": {
|
||
"url": "http://miniserver1875.asuscomm.com:3000/relation/2279034"
|
||
}
|
||
},
|
||
{
|
||
"action_uid": "link_University Pharmacy, 4200, Fifth Avenue, North Oakland, Pittsburgh, Allegheny County, 15260, United States",
|
||
"idx": 6,
|
||
"action_repr": "frame.clickget_by_role(\"link\", name=\"University Pharmacy, 4200, Fifth Avenue, North Oakland, Pittsburgh, Allegheny County, 15260, United States\")",
|
||
"before": {
|
||
"url": "http://miniserver1875.asuscomm.com:3000/node/8010585574"
|
||
},
|
||
"after": {
|
||
"url": "http://miniserver1875.asuscomm.com:3000/search?query=pharmacy%20near%20Carnegie%20Mellon%20University#map=16/40.4429/-79.9522"
|
||
}
|
||
}
|
||
]
|
||
|
||
# Output format
|
||
- 先总结整个任务的Objective,然后按照Strategy-SubStrategy-action三级层次来给出整个过程,
|
||
- 接着给出整个操作流程后的观察和有趣的发现,最后严格按照json格式输出三级层次的过程描述。
|
||
- 最后的输出json应该是包在```{json}```之间,最底层动作需要包含描述、对应的playwright动作指令顺序编号,以及具体指令内容。
|
||
|
||
# Example
|
||
### Complete User Operation Description to Display Labeled Issues in kkroening/ffmpeg-python
|
||
|
||
**Objective:** Filter and display all issues labeled as "question" in the kkroening/ffmpeg-python repository.
|
||
|
||
---
|
||
|
||
#### **Strategy 1: Navigate to the Repository**
|
||
**Low-Level Action Sequence:**
|
||
1. **Search for the user "kkroening"**
|
||
- Click the global search bar (placeholder: "Search GitLab").
|
||
- Type "kkroening" and press `Enter`.
|
||
2. **Select the user from results**
|
||
- Click the "Users" tab in search results.
|
||
- Click on "Karl Kroening @kkroening" in the user list.
|
||
3. **Access the repository**
|
||
- Navigate to the "Personal projects" section.
|
||
- Click on the "ffmpeg-python" project.
|
||
|
||
---
|
||
|
||
#### **Strategy 2: Filter Issues by Label**
|
||
**Low-Level Action Sequence:**
|
||
1. **Open the Issues tab**
|
||
- Scroll to the left sidebar menu.
|
||
- Click the "Issues" tab (displaying the count, e.g., "Issues 402").
|
||
2. **Apply label filtering**
|
||
- Click the search/filter bar in the issues list.
|
||
- Select the "Label" dropdown from the filter options.
|
||
- Type or select "question" from the label dropdown.
|
||
- Click the search/apply button to confirm the filter.
|
||
|
||
---
|
||
|
||
#### **Final Oberservation**
|
||
The issues list will refresh to show only issues with the "question" label. The URL will reflect the filter:
|
||
`.../ffmpeg-python/-/issues/?label_name[]=question`.
|
||
|
||
---
|
||
|
||
### Key Observations from Playwright Trace
|
||
- The final URL after filtering:
|
||
`http://ec2-3-135-39-80.../ffmpeg-python/-/issues/?label_name%5B%5D=question`
|
||
confirms the "question" label filter is applied.
|
||
- Critical interactions include selecting the "Label" dropdown and explicitly choosing "question" to refine results.
|
||
|
||
### Final output
|
||
```json
|
||
[{
|
||
"strategy" : "Navigate to the Repository",
|
||
"substrategies": [
|
||
{
|
||
"substrategy": "Search for the user \"kkroening\"",
|
||
"actions" : [
|
||
{
|
||
"description": "Click the global search bar (placeholder: \"Search GitLab\"). ",
|
||
"playwright_idx" : 18,
|
||
"playwright_instruction" : "frame.pressget_by_placeholder(\"Search GitLab\")Enter"
|
||
}
|
||
]
|
||
},
|
||
{
|
||
"substrategy": "Select the user from results",
|
||
"actions" : [
|
||
]
|
||
}
|
||
]
|
||
},
|
||
{
|
||
"strategy" : "Filter Issues by Label",
|
||
"substrategies" : [
|
||
]
|
||
}]
|
||
``` |