Agent with PDF Input (URL)

Code

1from kern.agent import Agent
2from kern.media import File
3from kern.models.google import Gemini
4
5agent = Agent(
6 model=Gemini(id="gemini-2.0-flash-exp"),
7 markdown=True,
8)
9
10agent.print_response(
11 "Summarize the contents of the attached file.",
12 files=[File(url="https://kern-public.s3.amazonaws.com/recipes/ThaiRecipes.pdf")],
13)

Usage

Set up your virtual environment

1uv venv --python 3.12
2source .venv/bin/activate
1uv venv --python 3.12
2.venv\Scripts\activate

Set your API key

1export GOOGLE_API_KEY=xxx

Install dependencies

1uv pip install -U google-genai kern-ai

Run Agent

1python cookbook/11_models/google/gemini/pdf_input_url.py