desktop
Y YouTube

Model

Tasks

Update video metadata Moderate recent comments Search and add to playlist Export analytics snapshot Schedule a community post
1 2 3 4 5
RL env

YouTube environment

5 tasks with a model prompt, seeded environment state, and grader contract.

Task 1

Update video metadata

Prompt
Open the seeded YouTube Studio video, update the title and description from the brief, and save changes.
Environment
A draft video is available in YouTube Studio with outdated metadata.
Grader
Checks title, description, and that visibility is not changed.
Task 2

Moderate recent comments

Prompt
Filter comments to held-for-review, approve the helpful comment, and hide the spam comment.
Environment
The comments page has held comments with labels in the task brief.
Grader
Checks moderation actions on the two target comments only.
Task 3

Search and add to playlist

Prompt
Search YouTube for the reference video title in the brief and add the matching result to the research playlist.
Environment
The YouTube search page and a research playlist are available in the mock account.
Grader
Checks that the correct video ID is added to the specified playlist.
Task 4

Export analytics snapshot

Prompt
Open analytics for the last 28 days, switch to traffic source, and export the CSV.
Environment
Analytics defaults to the last 7 days and overview metrics.
Grader
Checks date range, traffic source tab, and CSV export.
Task 5

Schedule a community post

Prompt
Create a community post with the provided copy and schedule it for the target date.
Environment
The community composer is empty and the schedule date is in the task brief.
Grader
Checks post copy, schedule date, and that it is not published immediately.
UseDesktop Evals

Computer-use agent evals.

RL envs Main site Docs Blog