GenAI Python SDK

These details have not been verified by PyPI

Project links

Homepage

Project description

Google Gen AI SDK

Documentation: https://googleapis.github.io/python-genai/

Installation

pip install google-genai

Imports

from google import genai
from google.genai import types

Create a client

Please run one of the following code blocks to create a client for different services (Google AI or Vertex). Feel free to switch the client and run all the examples to see how it behaves under different APIs.

# Only run this block for Google AI API
client = genai.Client(api_key='YOUR_API_KEY')

# Only run this block for Vertex AI API
client = genai.Client(
    vertexai=True, project='your-project-id', location='us-central1'
)

Types

Parameter types can be specified as either dictionaries(TypedDict) or pydantic Models. Pydantic model types are available in the types module.

Models

The client.models modules exposes model inferencing and model getters.

Generate Content

response = client.models.generate_content(
    model='gemini-2.0-flash-exp', contents='What is your name?'
)
print(response.text)

System Instructions and Other Configs

response = client.models.generate_content(
    model='gemini-2.0-flash-exp',
    contents='high',
    config=types.GenerateContentConfig(
        system_instruction='I say high, you say low',
        temperature= 0.3,
    ),
)
print(response.text)

Typed Config

All API methods support pydantic types for parameters as well as dictionaries. You can get the type from google.genai.types.

response = client.models.generate_content(
    model='gemini-2.0-flash-exp',
    contents=types.Part.from_text('Why is sky blue?'),
    config=types.GenerateContentConfig(
        temperature=0,
        top_p=0.95,
        top_k=20,
        candidate_count=1,
        seed=5,
        max_output_tokens=100,
        stop_sequences=["STOP!"],
        presence_penalty=0.0,
        frequency_penalty=0.0,
    )
)

response

Safety Settings

response = client.models.generate_content(
    model='gemini-2.0-flash-exp',
    contents='Say something bad.',
    config=types.GenerateContentConfig(
        safety_settings= [types.SafetySetting(
            category='HARM_CATEGORY_HATE_SPEECH',
            threshold='BLOCK_ONLY_HIGH',
        )]
    ),
)
print(response.text)

Function Calling

Automatic Python function Support

You can pass a python function directly and it will be automatically called and responded.

def get_current_weather(location: str,) -> int:
  """Returns the current weather.

  Args:
    location: The city and state, e.g. San Francisco, CA
  """
  return 'sunny'

response = client.models.generate_content(
    model='gemini-2.0-flash-exp',
    contents="What is the weather like in Boston?",
    config=types.GenerateContentConfig(tools=[get_current_weather],)
)

response.text

Manually declare and invoke a function for function calling

If you don't want to use the automatic function support, you can manually declare the function and invoke it.

The following example shows how to declare a function and pass it as a tool. Then you will receive a function call part in the response.

function = dict(
    name="get_current_weather",
    description="Get the current weather in a given location",
    parameters={
      "type": "OBJECT",
      "properties": {
          "location": {
              "type": "STRING",
              "description": "The city and state, e.g. San Francisco, CA",
          },
      },
      "required": ["location"],
    }
)

tool = types.Tool(function_declarations=[function])


response = client.models.generate_content(
    model='gemini-2.0-flash-exp',
    contents="What is the weather like in Boston?",
    config=types.GenerateContentConfig(tools=[tool],)
)

response.candidates[0].content.parts[0].function_call

After you receive the function call part from model, you can invoke the function and get the function response. And then you can pass the function response to the model. The following example shows how to do it for a simple function invocation.

function_call_part = response.candidates[0].content.parts[0]

try:
  function_result = get_current_weather(**function_call_part.function_call.args)
  function_response = {'result': function_result}
except Exception as e:  # instead of raising the exception, you can let the model handle it
  function_response = {'error': str(e)}


function_response_part = types.Part.from_function_response(
    name=function_call_part.function_call.name,
    response=function_response,
)

response = client.models.generate_content(
    model='gemini-2.0-flash-exp',
    contents=[
        types.Part.from_text("What is the weather like in Boston?"),
        function_call_part,
        function_response_part,
    ])

response

JSON Response Schema

Pydantic Model Schema support

Schemas can be provided as Pydantic Models.

from pydantic import BaseModel

class CountryInfo(BaseModel):
  name: str
  population: int
  capital: str
  continent: str
  gdp: int
  official_language: str
  total_area_sq_mi: int


response = client.models.generate_content(
    model='gemini-2.0-flash-exp',
    contents='Give me information of the United States.',
    config=types.GenerateContentConfig(
        response_mime_type= 'application/json',
        response_schema= CountryInfo,
    ),
)
print(response.text)

response = client.models.generate_content(
    model='gemini-2.0-flash-exp',
    contents='Give me information of the United States.',
    config={
        'response_mime_type': 'application/json',
        'response_schema': {
            'required': [
                'name',
                'population',
                'capital',
                'continent',
                'gdp',
                'official_language',
                'total_area_sq_mi',
            ],
            'properties': {
                'name': {'type': 'STRING'},
                'population': {'type': 'INTEGER'},
                'capital': {'type': 'STRING'},
                'continent': {'type': 'STRING'},
                'gdp': {'type': 'INTEGER'},
                'official_language': {'type': 'STRING'},
                'total_area_sq_mi': {'type': 'INTEGER'},
            },
            'type': 'OBJECT',
        },
    },
)
print(response.text)

Streaming

Streaming for text content

for chunk in client.models.generate_content_stream(
    model='gemini-2.0-flash-exp', contents='Tell me a story in 300 words.'
):
  print(chunk.text)

Streaming for image content

If your image is stored in Google Cloud Storage, you can use the from_uri class method to create a Part object.

for chunk in client.models.generate_content_stream(
    model='gemini-1.5-flash',
    contents=[
      'What is this image about?',
      types.Part.from_uri(
        file_uri='gs://generativeai-downloads/images/scones.jpg',
        mime_type='image/jpeg'
      )
    ],
):
  print(chunk.text)

If your image is stored in your local file system, you can read it in as bytes data and use the from_bytes class method to create a Part object.

YOUR_IMAGE_PATH = 'your_image_path'
YOUR_IMAGE_MIME_TYPE = 'your_image_mime_type'
with open(YOUR_IMAGE_PATH, 'rb') as f:
  image_bytes = f.read()

for chunk in client.models.generate_content_stream(
    model='gemini-1.5-flash',
    contents=[
      'What is this image about?',
      types.Part.from_bytes(
        data=image_bytes,
        mime_type=YOUR_IMAGE_MIME_TYPE
      )
    ],
):
  print(chunk.text)

Async

client.aio exposes all the analogous async methods that are available on client

For example, client.aio.models.generate_content is the async version of client.models.generate_content

request = await client.aio.models.generate_content(
    model='gemini-2.0-flash-exp', contents='Tell me a story in 300 words.'
)

print(response.text)

Streaming

async for response in client.aio.models.generate_content_stream(
    model='gemini-2.0-flash-exp', contents='Tell me a story in 300 words.'
):
  print(response.text)

Count Tokens and Compute Tokens

response = client.models.count_tokens(
    model='gemini-2.0-flash-exp',
    contents='What is your name?',
)
print(response)

Compute Tokens

Compute tokens is not supported by Google AI.

response = client.models.compute_tokens(
    model='gemini-2.0-flash-exp',
    contents='What is your name?',
)
print(response)

Async

response = await client.aio.models.count_tokens(
    model='gemini-2.0-flash-exp',
    contents='What is your name?',
)
print(response)

Embed Content

response = client.models.embed_content(
    model='text-embedding-004',
    contents='What is your name?',
)
response

# multiple contents with config
response = client.models.embed_content(
    model='text-embedding-004',
    contents=['What is your name?', 'What is your age?'],
    config=types.EmbedContentConfig(output_dimensionality= 10)
)

response

Imagen

Generate Image

Support for generate image in Google AI is behind an allowlist

# Generate Image
response1 = client.models.generate_image(
    model='imagen-3.0-generate-001',
    prompt='An umbrella in the foreground, and a rainy night sky in the background',
    config=types.GenerateImageConfig(
        negative_prompt= 'human',
        number_of_images= 1,
        include_rai_reason= True,
        output_mime_type= 'image/jpeg'
    )
)
response1.generated_images[0].image.show()

Upscale Image

Upscale image is not supported in Google AI.

# Upscale the generated image from above
response2 = client.models.upscale_image(
    model='imagen-3.0-generate-001',
    image=response1.generated_images[0].image,
    upscale_factor='x2',
    config=types.UpscaleImageConfig(
        include_rai_reason= True,
        output_mime_type= 'image/jpeg',
    ),
)
response2.generated_images[0].image.show()

Edit Image

Edit image uses a separate model from generate and upscale.

Edit image is not supported in Google AI.

# Edit the generated image from above
from google.genai.types import RawReferenceImage, MaskReferenceImage
raw_ref_image = RawReferenceImage(
    reference_id=1,
    reference_image=response1.generated_images[0].image,
)

# Model computes a mask of the background
mask_ref_image = MaskReferenceImage(
    reference_id=2,
    config=types.MaskReferenceConfig(
        mask_mode='MASK_MODE_BACKGROUND',
        mask_dilation=0,
    ),
)

response3 = client.models.edit_image(
    model='imagen-3.0-capability-001',
    prompt='Sunlight and clear sky',
    reference_images=[raw_ref_image, mask_ref_image],
    config=types.EditImageConfig(
        edit_mode= 'EDIT_MODE_INPAINT_INSERTION',
        number_of_images= 1,
        negative_prompt= 'human',
        include_rai_reason= True,
        output_mime_type= 'image/jpeg',
    ),
)
response3.generated_images[0].image.show()

Chats

Create a chat session to start a multi-turn conversations with the model.

Send Message

chat = client.chats.create(model='gemini-2.0-flash-exp')
response = chat.send_message('tell me a story')
print(response.text)

Streaming

chat = client.chats.create(model='gemini-2.0-flash-exp')
for chunk in chat.send_message_stream('tell me a story'):
  print(chunk.text)

Async

chat = client.aio.chats.create(model='gemini-2.0-flash-exp')
response = await chat.send_message('tell me a story')
print(response.text)

Async Streaming

chat = client.aio.chats.create(model='gemini-2.0-flash-exp')
async for chunk in chat.send_message_stream('tell me a story'):
  print(chunk.text)

Files (Only Google AI)

!gsutil cp gs://cloud-samples-data/generative-ai/pdf/2312.11805v3.pdf .
!gsutil cp gs://cloud-samples-data/generative-ai/pdf/2403.05530.pdf .

Upload

file1 = client.files.upload(path='2312.11805v3.pdf')
file2 = client.files.upload(path='2403.05530.pdf')

print(file1)
print(file2)

Delete

file3 = client.files.upload(path='2312.11805v3.pdf')

client.files.delete(name=file3.name)

Caches

client.caches contains the control plane APIs for cached content

Create

if client.vertexai:
  file_uris = [
      'gs://cloud-samples-data/generative-ai/pdf/2312.11805v3.pdf',
      'gs://cloud-samples-data/generative-ai/pdf/2403.05530.pdf'
  ]
else:
  file_uris = [file1.uri, file2.uri]

cached_content = client.caches.create(
      model='gemini-1.5-pro-002',
      config=types.CreateCachedContentConfig(
          contents=[
              types.Content(
                  role='user',
                  parts=[
                    types.Part.from_uri(
                        file_uri=file_uris[0],
                        mime_type='application/pdf'),
                    types.Part.from_uri(
                        file_uri=file_uris[1],
                        mime_type='application/pdf',)])
          ],
          system_instruction='What is the sum of the two pdfs?',
          display_name='test cache',
          ttl='3600s',
      ),
  )

Get

client.caches.get(name=cached_content.name)

Generate Content

client.models.generate_content(
    model='gemini-1.5-pro-002',
    contents='Summarize the pdfs',
    config=types.GenerateContentConfig(
        cached_content=cached_content.name,
    )
)

Tunings

client.tunings contains tuning job APIs and supports supervised fine tuning through tune and distillation through distill

Tune

Vertex supports tuning from GCS source
Google AI supports tuning from inline examples

if client.vertexai:
  model = 'gemini-1.5-pro-002'
  training_dataset=types.TuningDataset(
        gcs_uri='gs://cloud-samples-data/ai-platform/generative_ai/gemini-1_5/text/sft_train_data.jsonl',
  )
else:
  model = 'models/gemini-1.0-pro-001'
  training_dataset=types.TuningDataset(
        examples=[
            types.TuningExample(
                text_input=f"Input text {i}",
                output=f"Output text {i}",
            )
            for i in range(5)
        ],
    )

tuning_job = client.tunings.tune(
    base_model=model,
    training_dataset=training_dataset,
    config=types.CreateTuningJobConfig(
        epoch_count= 1,
        tuned_model_display_name="test_dataset_examples model"
    )
)
tuning_job

Get Tuning Job

tuning_job = client.tunings.get(name=tuning_job.name)
tuning_job

import time

running_states = set([
    "JOB_STATE_PENDING",
    "JOB_STATE_RUNNING",
])

while tuning_job.state in running_states:
    print(tuning_job.state)
    tuning_job = client.tunings.get(name=tuning_job.name)
    time.sleep(10)

Use Tuned Model

response = client.models.generate_content(
    model=tuning_job.tuned_model.endpoint,
    contents='What is your name?',
)

response.text

Get Tuned Model

tuned_model = client.models.get(model=tuning_job.tuned_model.model)
tuned_model

List Tuned Models

for model in client.models.list(config={'page_size': 10}):
  print(model)

pager = client.models.list(config={'page_size': 10})
print(pager.page_size)
print(pager[0])
pager.next_page()
print(pager[0])

Async

async for job in await client.aio.models.list(config={'page_size': 10}):
  print(job)

async_pager = await client.aio.models.list(config={'page_size': 10})
print(async_pager.page_size)
print(async_pager[0])
await async_pager.next_page()
print(async_pager[0])

Update Tuned Model

model = pager[0]

model = client.models.update(
    model=model.name,
    config=types.UpdateModelConfig(
        display_name='my tuned model',
        description='my tuned model description'))

model

Distillation

Only supported on Vertex. Requires allowlist.

distillation_job = client.tunings.distill(
    student_model="gemma-2b-1.1-it",
    teacher_model="gemini-1.5-pro-002",
    training_dataset=genai.types.DistillationDataset(
        gcs_uri="gs://cloud-samples-data/ai-platform/generative_ai/gemini-1_5/text/sft_train_data.jsonl",
    ),
    config=genai.types.CreateDistillationJobConfig(
        epoch_count=1,
        pipeline_root_directory=(
            "gs://my-bucket"
        ),
    ),
)
distillation_job

tcompleted_states = set([
    "JOB_STATE_SUCCEEDED",
    "JOB_STATE_FAILED",
    "JOB_STATE_CANCELLED",
    "JOB_STATE_PAUSED"
])

while distillation_job.state not in completed_states:
    print(distillation_job.state)
    distillation_job = client.tunings.get(name=distillation_job.name)
    time.sleep(10)

distillation_job

List Tuning Jobs

for job in client.tunings.list(config={'page_size': 10}):
  print(job)

pager = client.tunings.list(config={'page_size': 10})
print(pager.page_size)
print(pager[0])
pager.next_page()
print(pager[0])

Async

async for job in await client.aio.tunings.list(config={'page_size': 10}):
  print(job)

async_pager = await client.aio.tunings.list(config={'page_size': 10})
print(async_pager.page_size)
print(async_pager[0])
await async_pager.next_page()
print(async_pager[0])

Batch Prediction

Only supported in Vertex AI.

Create

# Specify model and source file only, destination and job display name will be auto-populated
job = client.batches.create(
    model='gemini-1.5-flash-002',
    src='bq://my-project.my-dataset.my-table',
)

job

# Get a job by name
job = client.batches.get(name=job.name)

job.state

completed_states = set([
    "JOB_STATE_SUCCEEDED",
    "JOB_STATE_FAILED",
    "JOB_STATE_CANCELLED",
    "JOB_STATE_PAUSED"
])

while job.state not in completed_states:
    print(job.state)
    job = client.batches.get(name=job.name)
    time.sleep(30)

job

List

for job in client.batches.list(config={'page_size': 10}):
  print(job)

pager = client.batches.list(config={'page_size': 10})
print(pager.page_size)
print(pager[0])
pager.next_page()
print(pager[0])

Async

async for job in await client.aio.batches.list(config={'page_size': 10}):
  print(job)

async_pager = await client.aio.batches.list(config={'page_size': 10})
print(async_pager.page_size)
print(async_pager[0])
await async_pager.next_page()
print(async_pager[0])

Delete

# Delete the job resource
delete_job = client.batches.delete(name=job.name)

delete_job

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.73.1

Apr 14, 2026

1.73.0

Apr 13, 2026

1.72.0

Apr 9, 2026

1.71.0

Apr 8, 2026

1.70.0

Apr 1, 2026

1.69.0

Mar 28, 2026

1.68.0

Mar 18, 2026

1.67.0

Mar 12, 2026

1.66.0

Mar 4, 2026

1.65.0

Feb 26, 2026

1.64.0

Feb 19, 2026

1.63.0

Feb 11, 2026

1.62.0

Feb 4, 2026

1.61.0

Jan 30, 2026

1.60.0

Jan 21, 2026

1.59.0

Jan 15, 2026

1.58.0

Jan 15, 2026

1.57.0

Jan 7, 2026

1.56.0

Dec 17, 2025

1.55.0

Dec 11, 2025

1.54.0

Dec 8, 2025

1.53.0

Dec 3, 2025

1.52.0

Nov 21, 2025

1.51.0

Nov 18, 2025

1.50.1

Nov 13, 2025

1.50.0

Nov 12, 2025

1.49.0

Nov 5, 2025

1.48.0

Nov 3, 2025

1.47.0

Oct 29, 2025

1.46.0

Oct 21, 2025

1.45.0

Oct 15, 2025

1.44.0

Oct 15, 2025

1.43.0

Oct 10, 2025

1.42.0

Oct 8, 2025

1.41.0

Oct 2, 2025

1.40.0

Oct 1, 2025

1.39.1

Sep 26, 2025

1.39.0

Sep 25, 2025

1.38.0

Sep 16, 2025

1.37.0

Sep 16, 2025

1.36.0

Sep 10, 2025

1.35.0

Sep 9, 2025

1.34.0

Sep 9, 2025

1.33.0

Sep 3, 2025

1.32.0

Aug 27, 2025

1.31.0

Aug 18, 2025

1.30.0

Aug 14, 2025

1.29.0

Aug 6, 2025

1.28.0

Jul 30, 2025

1.27.0

Jul 23, 2025

1.26.0

Jul 16, 2025

1.25.0

Jul 9, 2025

1.24.0

Jul 1, 2025

1.23.0

Jun 27, 2025

1.22.0

Jun 26, 2025

1.21.1

Jun 19, 2025

1.21.0

Jun 18, 2025

1.20.0

Jun 11, 2025

1.19.0

Jun 4, 2025

1.18.0

May 30, 2025

1.17.0

May 28, 2025

1.16.1

May 20, 2025

1.16.0 yanked

May 19, 2025

1.15.0

May 13, 2025

1.14.0

May 7, 2025

1.13.0

Apr 30, 2025

1.12.1

Apr 24, 2025

1.12.0 yanked

Apr 24, 2025

1.11.0

Apr 16, 2025

1.10.0

Apr 9, 2025

1.9.0

Apr 1, 2025

1.8.0

Mar 26, 2025

1.7.0

Mar 18, 2025

1.6.0 yanked

Mar 13, 2025

1.5.0

Mar 7, 2025

1.4.0

Mar 5, 2025

1.3.0

Feb 24, 2025

1.2.0

Feb 12, 2025

1.1.0

Feb 10, 2025

1.0.0

Feb 5, 2025

1.0.0rc0 pre-release

Feb 4, 2025

0.8.0

Jan 30, 2025

0.7.0

Jan 28, 2025

0.6.0

Jan 21, 2025

0.5.0

Jan 13, 2025

This version

0.4.0

Jan 8, 2025

0.3.0

Dec 17, 2024

0.2.2

Dec 13, 2024

0.2.1

Dec 12, 2024

0.2.0

Dec 12, 2024

0.1.0

Dec 11, 2024

0.0.1

Dec 10, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

google_genai-0.4.0.tar.gz (107.6 kB view details)

Uploaded Jan 8, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

google_genai-0.4.0-py3-none-any.whl (113.6 kB view details)

Uploaded Jan 8, 2025 Python 3

File details

Details for the file google_genai-0.4.0.tar.gz.

File metadata

Download URL: google_genai-0.4.0.tar.gz
Upload date: Jan 8, 2025
Size: 107.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.11.9

File hashes

Hashes for google_genai-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`d14ce2e941063092cfc98726aeabcae44f179456e3a4906ee5f28dc91b0663fb`
MD5	`07eb6def414bb8a6d80798aba945eebd`
BLAKE2b-256	`8ffae8c81d37ffe7d8aa05573494735cdc432a97b77f641a08caa959de19523d`

See more details on using hashes here.

File details

Details for the file google_genai-0.4.0-py3-none-any.whl.

File metadata

Download URL: google_genai-0.4.0-py3-none-any.whl
Upload date: Jan 8, 2025
Size: 113.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.11.9

File hashes

Hashes for google_genai-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2cbfea3cb47d4ac54ee3d3f9ecd79ff72298cac13e150828afdc5ed62768ed00`
MD5	`33598cce0ea7bbd73b711cb3ada18d2f`
BLAKE2b-256	`9daccf91960fc842f8c3387be8abeaa01deb0e6b20a72a028b70107f58e13150`

See more details on using hashes here.

google-genai 0.4.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Google Gen AI SDK

Installation

Imports

Create a client

Types

Models

Generate Content

System Instructions and Other Configs

Typed Config

Safety Settings

Function Calling

Automatic Python function Support

Manually declare and invoke a function for function calling

JSON Response Schema

Pydantic Model Schema support

Streaming

Streaming for text content

Streaming for image content

Async

Streaming

Count Tokens and Compute Tokens

Compute Tokens

Async

Embed Content

Imagen

Generate Image

Upscale Image

Edit Image

Chats

Send Message

Streaming

Async

Async Streaming

Files (Only Google AI)

Upload

Delete

Caches

Create

Get

Generate Content

Tunings

Tune

Get Tuning Job

Use Tuned Model

Get Tuned Model

List Tuned Models

Async

Update Tuned Model

Distillation

List Tuning Jobs

Async

Batch Prediction

Create

List

Async

Delete

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details