Asynchronous Streaming Agent

Code

1import asyncio
2from typing import Iterator  # noqa
3
4from kern.agent import Agent, RunOutput  # noqa
5from kern.models.meta import Llama
6
7agent = Agent(model=Llama(id="Llama-4-Maverick-17B-128E-Instruct-FP8"), markdown=True)
8
9# Get the response in a variable
10# run_response: Iterator[RunOutputEvent] = asyncio.run(agent.arun("Share a 2 sentence horror story", stream=True))
11# for chunk in run_response:
12#     print(chunk.content)
13
14# Print the response in the terminal
15asyncio.run(agent.aprint_response("Share a 2 sentence horror story", stream=True))

Usage

Set up your virtual environment

1uv venv --python 3.12
2source .venv/bin/activate

1uv venv --python 3.12
2.venv\Scripts\activate

Set your LLAMA API key

1export LLAMA_API_KEY=YOUR_API_KEY

Install dependencies

1uv pip install llama-api-client kern-ai

Run Agent

1python cookbook/11_models/meta/async_stream.py