Skip to main content
AI Agents & Automation

⏱ About 10 min10 XP

Acting on a Screen

Not every AI agent lives in a robot body. Many agents live entirely inside a computer or phone. They cannot roll around or pick things up. Instead, they act by doing things on a screen — clicking buttons, typing words, searching for information, and sending messages. You do these things all the time yourself! When you search for a video online or send a message to a friend, you are doing digital actions. AI agents can do many of the same digital actions, often much faster than a person would.

What Digital Actions Look Like

Digital actions happen inside a computer system instead of in the physical world. Here are some of the most common ones. Typing: An agent can type words into a text box — like filling in a form, writing an email, or composing a message. A customer service agent might type a reply to your question automatically. Clicking: An agent can click on buttons, links, and menus — just as if an invisible finger were pressing on the screen. A shopping agent might click the Add to Cart button after deciding what to buy. Searching: An agent can enter words into a search engine and get back results, just like you do. A research agent might search for the answer to a question and then read through many results to find the best one. Sending messages: An agent can send emails, text messages, or chat notifications. An agent in charge of reminders might send you a message when it is time for dinner or a meeting.

The Big Idea

Digital agents act by controlling what happens on a screen: typing, clicking, searching, and sending. They sense digital information — text, images on screen, data — and their actions change what is on the screen or what messages are sent.

Here is a fun story about a digital agent at work. Sophia runs a small lemonade stand and uses an AI agent to help manage orders. When a customer fills out an online form asking for two cups of lemonade, the agent senses the new order. It thinks: I need to add this to the order list and send a confirmation. It acts: it clicks the button to record the order, types the total price into the receipt form, and clicks send to email the customer their receipt. Sophia did not have to do any of that clicking and typing herself. The agent handled all the digital actions in seconds.

Match each digital action to what it actually does.

Terms

Typing into a search bar
Clicking a Submit button
Sending an email message
Clicking a link on a webpage

Definitions

Delivering a written note to someone's inbox automatically
Navigating to a new page to find more information
Entering words to find information on the internet
Telling the computer to process and save a completed form

Drag terms onto their definitions, or click a term then click a definition to match.

Some digital agents can see the screen — they use a camera or special software to take a screenshot and look at it the way you look at a photo. They sense what is on the screen right now, think about what to do, and then act by clicking or typing. Other digital agents work behind the scenes without seeing the screen at all. They are connected directly to databases and systems, so they can add records, send messages, and update information without needing to see it visually. Both kinds are agents running the sense-think-act loop — they just sense and act in the digital world instead of the physical one.

Speed Is a Superpower

A digital agent can click, type, search, and send in fractions of a second. It can handle hundreds of tasks at the same time without getting tired or making the kinds of small mistakes a person makes after sitting at a computer for hours. Speed and tirelessness are big advantages for digital agents!

Flashcards — click each card to reveal the answer

A homework-helper AI agent reads a student's essay question and types a response. Which step of the sense-think-act loop is the typing?

Which of these is a digital action an AI agent might take?

Map a Digital Agent's Day

  1. Imagine you are a digital AI agent working as a homework reminder assistant.
  2. Your job: remind students about upcoming homework due dates.
  3. On paper, map out three moments in the agent's day. For each moment write:
  4. What does the agent SENSE? (What digital information does it see?)
  5. What does the agent THINK? (What does it decide to do?)
  6. What digital ACTION does it take? (What does it type, click, or send?)
  7. For example: Sense: it is 3 PM and Maya has a math assignment due tomorrow. Think: she should be reminded now. Act: send Maya a message saying her math homework is due tomorrow.
  8. Create at least three different sense-think-act moments for your agent.
  9. Share your map and talk about: would this helper actually be useful for real students?