Image Agent

Code

1from kern.agent import Agent
2from kern.media import Image
3from kern.models.dashscope import DashScope
4from kern.tools.hackernews import HackerNewsTools
5
6agent = Agent(
7 model=DashScope(id="qwen-vl-plus"),
8 tools=[HackerNewsTools()],
9 markdown=True,
10)
11
12agent.print_response(
13 "Analyze this image in detail and tell me what you see. Also search for more information about the subject.",
14 images=[
15 Image(
16 url="https://upload.wikimedia.org/wikipedia/commons/0/0c/GoldenGateBridge-001.jpg"
17 )
18 ],
19 stream=True,
20)

Usage

Set up your virtual environment

1uv venv --python 3.12
2source .venv/bin/activate
1uv venv --python 3.12
2.venv\Scripts\activate

Set your API key

1export DASHSCOPE_API_KEY=xxx

Install dependencies

1uv pip install -U kern-ai

Run Agent

1python cookbook/11_models/dashscope/image_agent.py