Exposing an AI Agent as an MCP Server with MAF — Part 5 : Driving AI Agents from YAML: Config-First Agent Servers in MAF

The Problem with Hardcoded Config#

In Part 4 we built a server that gives a MAF ChatAgent three simultaneous interfaces — MCP, REST, and a service registry — all from a single tools list. But the agent’s name, ports, instructions, and tool list were hardcoded as a Python dict inside server.py:

1
AGENT_CONFIG = {
2
    "name":         "ServiceNowAgent",
3
    "display_name": "ServiceNow Incident Manager",
4
    "description":  "Manages ServiceNow incidents — create, update, search.",
5
    "version":      "1.0.0",
6
    "tags":         ["servicenow", "itsm"],
7
    "module":       "tools",
8
    "tools":        ["create_incident", "update_incident", "search_incident"],
9
    "instructions": "You are a ServiceNow assistant...",
10
}

This works for a single agent. But the moment you need a second agent — a Jira agent, a GitHub agent, a Slack agent — you have to edit server.py again. The file that should be infrastructure starts accumulating business logic. Deployments become fragile. Non-engineers can’t configure agents without touching Python.

The fix is to externalise configuration into YAML files — one file per agent — and have server.py discover and load them at startup. No Python changes needed to add, remove, or reconfigure an agent.

The YAML Schema#

Each agent lives in its own file in an agents/ directory. The schema is designed to be self-documenting and cover everything server.py previously hardcoded:

1
# ──────────────────────────────────────────────────────────────────
2
# One file per agent. server.py discovers all *.yaml files in the
3
# agents/ directory on startup.
4
# ──────────────────────────────────────────────────────────────────
5

6
agent:
7
  # ── Identity ───────────────────────────────────────────────────
8
  name: ServiceNowAgent              # unique key — used in registry and MCP
9
  display_name: ServiceNow Incident Manager
10
  description: Manages ServiceNow incidents — create, update, search.
11
  version: "1.0.0"
12
  tags:
13
    - servicenow
14
    - itsm
15

16
  # ── Ports ──────────────────────────────────────────────────────
17
  ports:
18
    rest: 8000
19
    mcp: 8001
20

21
  # ── Tools ──────────────────────────────────────────────────────
22
  module: tools                      # Python module to import tools from
23
  tools:
24
    - create_incident
25
    - update_incident
26
    - search_incident
27

28
  # ── Behaviour ──────────────────────────────────────────────────
29
  instructions: |
30
    You are a ServiceNow assistant. Use your tools to manage incidents.
31
    Always confirm the incident number in your response.
32
    When urgency is not specified, default to Medium (2).

A few deliberate choices here. The instructions field uses YAML’s block scalar (|) so multi-line system prompts stay readable without escaping. The ports block is its own section rather than flat fields, making it clear these are deployment concerns separate from identity. The name field is the unique key used in the registry and as the MCP server identifier — it cannot contain spaces.

Adding a second agent is just a new file:

1
agent:
2
  name: JiraAgent
3
  display_name: Jira Issue Manager
4
  description: Creates and tracks Jira issues across projects.
5
  version: "1.0.0"
6
  tags:
7
    - jira
8
    - project-management
9
  ports:
10
    rest: 8010
11
    mcp: 8011
12
  module: jira_tools
13
  tools:
14
    - create_issue
15
    - update_issue
16
    - search_issues
17
  instructions: |
18
    You are a Jira assistant. Use your tools to manage issues.
19
    Always include the issue key in your response.

No changes to server.py. No Python at all.

Validating the Config with Pydantic#

Loading YAML is trivial — two lines of Python. The harder problem is catching errors early. A typo in a port number, a missing name field, or two agents sharing the same port should fail loudly at startup with a clear message, not silently misbehave at runtime.

Pydantic makes this straightforward. Two model classes mirror the YAML structure exactly:

1
from pydantic import BaseModel, Field, ValidationError, field_validator
2

3
class PortsConfig(BaseModel):
4
    rest: int = Field(..., gt=1024, lt=65535, description="REST API port")
5
    mcp:  int = Field(..., gt=1024, lt=65535, description="MCP server port")
6

7
    @field_validator("mcp")
8
    @classmethod
9
    def ports_must_differ(cls, mcp, info):
10
        rest = info.data.get("rest")
11
        if rest and mcp == rest:
12
            raise ValueError("mcp port must differ from rest port")
13
        return mcp
14

15

16
class AgentConfig(BaseModel):
17
    name:         str         = Field(..., min_length=1, pattern=r"^[A-Za-z][A-Za-z0-9_-]*$")
18
    display_name: str         = Field(..., min_length=1)
19
    description:  str         = Field(..., min_length=1)
20
    version:      str         = Field("1.0.0")
21
    tags:         list[str]   = Field(default_factory=list)
22
    ports:        PortsConfig
23
    module:       str         = Field(..., min_length=1)
24
    tools:        list[str]   = Field(..., min_length=1)
25
    instructions: str         = Field(..., min_length=1)

The name field uses a regex pattern to enforce that it starts with a letter and contains only alphanumerics, hyphens, and underscores — no spaces, no special characters that would break URL paths or registry keys. The PortsConfig validator catches the case where rest and mcp are accidentally set to the same value, which would cause a silent port collision at startup.

Loading a file then becomes a single function:

1
def load_config(yaml_path: str) -> AgentConfig:
2
    with open(yaml_path) as f:
3
        raw = yaml.safe_load(f)
4
    try:
5
        return AgentConfig(**raw["agent"])
6
    except (KeyError, ValidationError) as e:
7
        raise ValueError(f"Invalid config in '{yaml_path}': {e}") from e

If the agent: key is missing entirely, or if any field fails validation, the error message includes the file path and Pydantic’s field-level explanation. No guessing about which file caused the problem.

Discovering and Loading Multiple Agents#

discover_configs() handles the multi-agent case. It either loads explicit paths passed as command-line arguments, or globs all *.yaml files from the agents/ directory. After loading, it checks for port collisions across agents — catching the case where two YAML files both claim port 8000:

1
def discover_configs(agents_dir: str, explicit: list[str]) -> list[AgentConfig]:
2
    paths = explicit if explicit else sorted(glob.glob(f"{agents_dir}/*.yaml"))
3
    if not paths:
4
        raise RuntimeError(f"No agent YAML files found in '{agents_dir}/'.")
5

6
    configs = [load_config(p) for p in paths]
7

8
    # Guard against port collisions across agents
9
    seen_ports: set[int] = set()
10
    for cfg in configs:
11
        for port in (cfg.ports.rest, cfg.ports.mcp):
12
            if port in seen_ports:
13
                raise ValueError(f"Port {port} is used by more than one agent.")
14
            seen_ports.add(port)
15

16
    return configs

The port collision check happens before any servers start, so you get a clean error at launch rather than a cryptic address already in use failure mid-startup.

Running Multiple Agents Concurrently#

With configs validated, run_agent() does the same work as before for each agent — loading tools, building the REST API and MCP app, starting uvicorn servers, and self-registering — but now it takes an AgentConfig instead of a raw dict, and ports come from cfg.ports.rest and cfg.ports.mcp rather than module-level constants:

1
async def run_agent(cfg: AgentConfig, all_tasks: list):
2
    tools    = load_tools(cfg.module, cfg.tools)
3
    rest_app = build_rest_api(tools, cfg.display_name)
4
    agent    = ChatAgent(
5
        chat_client=make_client(),
6
        name=cfg.name,
7
        description=cfg.description,
8
        instructions=cfg.instructions,
9
        tools=tools,
10
    )
11
    mcp_app = build_mcp_app(agent)
12

13
    servers = {
14
        "rest": uvicorn.Server(uvicorn.Config(rest_app, host="0.0.0.0", port=cfg.ports.rest, log_level="warning")),
15
        "mcp":  uvicorn.Server(uvicorn.Config(mcp_app,  host="0.0.0.0", port=cfg.ports.mcp,  log_level="warning")),
16
    }
17
    tasks = {k: asyncio.create_task(s.serve()) for k, s in servers.items()}
18
    all_tasks.extend(tasks.values())
19

20
    await asyncio.sleep(0.3)
21

22
    async with httpx.AsyncClient() as client:
23
        await client.post(
24
            f"http://localhost:{REGISTRY_PORT}/registry/register",
25
            json=build_registration(cfg, tools),
26
        )
27

28
    print(f"  [{cfg.name}] REST → http://localhost:{cfg.ports.rest}/docs")
29
    print(f"  [{cfg.name}] MCP  → http://localhost:{cfg.ports.mcp}/sse")
30

31
    return servers

main() starts the registry first, then loops over all configs and calls run_agent() for each. Because each agent’s servers are started with asyncio.create_task(), they all run concurrently in the same event loop — no threads, no subprocesses:

1
async def main():
2
    explicit = [a for a in sys.argv[1:] if a.endswith(".yaml")]
3
    configs  = discover_configs(AGENTS_DIR, explicit)
4

5
    # Start the shared registry first
6
    registry_server = uvicorn.Server(
7
        uvicorn.Config(registry_app, host="0.0.0.0", port=REGISTRY_PORT, log_level="warning")
8
    )
9
    registry_task = asyncio.create_task(registry_server.serve())
10
    await asyncio.sleep(0.3)
11
    print(f"  [Registry] http://localhost:{REGISTRY_PORT}/registry\n")
12

13
    # Start each agent
14
    all_agent_tasks: list = []
15
    all_servers: list = []
16
    for cfg in configs:
17
        servers = await run_agent(cfg, all_agent_tasks)
18
        all_servers.append(servers)
19

20
    try:
21
        await asyncio.gather(registry_task, *all_agent_tasks)
22
    finally:
23
        # Deregister all agents on shutdown
24
        async with httpx.AsyncClient() as client:
25
            for cfg in configs:
26
                await client.delete(
27
                    f"http://localhost:{REGISTRY_PORT}/registry/{cfg.name}"
28
                )
29
        for servers in all_servers:
30
            for s in servers.values():
31
                s.should_exit = True
32
        registry_server.should_exit = True
33
        await asyncio.gather(registry_task, *all_agent_tasks, return_exceptions=True)
34
        print("\n  All agents deregistered. Servers stopped.")

Running It#

With two YAML files in the agents/ directory, a single command starts everything:

1
python server.py

1
  [Registry] http://localhost:8002/registry
2

3
  [ServiceNowAgent] REST → http://localhost:8000/docs
4
  [ServiceNowAgent] MCP  → http://localhost:8001/sse
5
  [JiraAgent] REST → http://localhost:8010/docs
6
  [JiraAgent] MCP  → http://localhost:8011/sse

To start only one specific agent without touching the others:

1
python server.py agents/servicenow_agent.yaml

What the Registry Now Knows#

With two agents running, GET /registry returns the full catalogue:

1
{
2
  "total": 2,
3
  "agents": [
4
    {
5
      "name": "ServiceNowAgent",
6
      "display_name": "ServiceNow Incident Manager",
7
      "description": "Manages ServiceNow incidents — create, update, search.",
8
      "version": "1.0.0",
9
      "tags": ["servicenow", "itsm"],
10
      "interfaces": {
11
        "rest": "http://localhost:8000/docs",
12
        "mcp":  "http://localhost:8001/sse"
13
      },
14
      "tools": [ ... ],
15
      "registered_at": "2026-02-22T09:00:00Z"
16
    },
17
    {
18
      "name": "JiraAgent",
19
      ...
20
    }
21
  ]
22
}

An orchestrator agent could query this registry at startup to discover what agents are available and where they are, rather than having their URLs hardcoded. That is the next natural step — a registry-aware orchestrator that wires agents together dynamically.

The Final Project Structure#

1
project/
2
├── agents/
3
│   ├── servicenow_agent.yaml   ← one file per agent
4
│   └── jira_agent.yaml
5
├── tools.py                    ← @ai_function ServiceNow tools
6
├── jira_tools.py               ← @ai_function Jira tools
7
├── server.py                   ← discovers and runs everything
8
└── .env                        ← Azure OpenAI credentials

Adding a new agent is now entirely a YAML and tools file concern. server.py is infrastructure — it never needs to change.

What Changed and Why It Matters#

	Part 4 (hardcoded)	Part 5 (YAML-driven)
Agent config lives in	Python dict in `server.py`	`agents/*.yaml` files
Adding a new agent requires	Editing `server.py`	Adding a new `.yaml` file
Config validation	None — errors at runtime	Pydantic at startup with clear messages
Port collision detection	None	Caught before any server starts
Multi-agent support	One agent only	Unlimited — one YAML per agent
Selective startup	Not supported	`python server.py agents/servicenow.yaml`
Non-engineer friendly	No	Yes — YAML only

The core architecture from Part 4 is unchanged — the same three surfaces, the same as_mcp_server() call, the same registry design. YAML is just the configuration layer on top.

This post is part of a series on building multi-agent systems with Microsoft Agent Framework.

← Part 1: Stdio | ← Part 2: HTTP/SSE | ← Part 3: MAF vs Raw MCP | ← Part 4: Three Interfaces