Actions: Enabling the Agent to Engage with Its Environment

Source: Agents Course (Unit 1) - Hugging Face

Summary

Actions are the concrete steps an AI agent takes to interact with its environment (web browsing, tool usage, API calls).

JSON Agent: Emits a structured action payload that an external system parses and executes.
Code Agent: Generates executable code blocks (for example in python) instead of a narrow JSON action schema.
Function-calling Agent: Uses native structured tool-calling behavior exposed by the model or framework. It is adjacent to JSON-style agents, but not identical to "a fine-tuned JSON agent."
Stop and Parse: In many tool-use loops, the model is configured to stop after emitting an action so the environment can parse it, execute it, and return an observation before generation resumes.

This pattern is the control boundary between reasoning and action:

JSON-agent loops depend on this most explicitly, while native function-calling systems can hide more of the parsing machinery behind the model API.

JSON-style action: {"tool": "search", "args": {"query": "latest MCP spec"}}
Code-agent action: a short script that calls an API, transforms the response, and prints a final result