quickstarts

Gemini API Tutorials

This folder contains guides to help you explore all Gemini API features using complete end-to-end code examples.

When you're confident in your Gemini capabilities, the examples folder will be an endless source of inspiration on how to mix those capabilities together.

Gemini 3: For the most recent updates on our latest generation, please check the Get Started and the thinking guides who include migration guides.

🍌 Nano-Banana 2: Go bananas with our latest image generation model: Nano-Banana 2. Get started here with 512px, thinking, search and image grounding, and a ton of examples!

🎶 Lyria 3: Channel your inner composer with Lyria 3. Get started here with 30s and full song generation, image-to-music, and a ton of examples!

Quickstart	Description	Open
Lyria 3	Convert your ideas into songs with full control over the music structure and more!.
Nano-Banana 2 & Pro aka. Gemini Image	Get to know how the Gemini model can directly output images and edit them through multi-turn discussion. Now featuring Gemini 3.1 Flash!
Veo	Get started with video generation using the Veo models.
Imagen	Get started with our image generation model.
Imagen (REST)	Get started with our image generation model (REST version).
Thinking models	The thinking models are, as their names imply, capable of deeper chains of thoughts than the classical models, this guide will show you how to use those thinking capabilities to solve complex problems.
Lyria RealTime	The Lyria RealTime model let's your generate music and prompt the model in real-time to have it mis it for you live.
Text-to-speech	The TTS models let you generate speeches with one or even two speakers!

There're multiple ways to call the models using the Gemini API, these other Get Started guides will then show you the other ways to call the model:

Quickstart	Description	Open
Get started with Live API	Get started with the live API with this comprehensive overview of its capabilities
OpenAI compatibility	Did you know that you could use Gemini using the OpenAI SDK?

Finally, these guides will deep-dive into specific capabilities of the Gemini models and API:

Quickstart	Description	Open
Grounding	Learn how to use diffrent ways (Google Search, Google Maps, Youtube, url context) to ground your answers with external sources.
File Search	Upload and search your data using the hosted RAG capability of the File Search tool.
Search Grounding	Deep-dive into the Google search grounding capabilities.
Gemini Robotics-ER 1.5 Preview	Learn how to use the Gemini Robotics Embodied Reasoning model for spatial understanding and reasoning
Batch-mode	Use Batch-mode to send large volume of non-time-sensitive requests to the model and get a 50% discount.
Inference tiers	Learn how to use the different inference tiers to help you balance speed, cost, and reliability.
Function Calling	Discover how to have Gemini call you own function and enhaced its capabilites.
Spatial understanding	Learn how to use Gemini's spatial understanding capabilities to detect what's in your images, and reason about them
Video understanding	Learn how to use Gemini's video understanding capabilities to analyze what's in your videos
Get started with Live API tools	Now you know everything about the Live API, go to the next level and learn how to use tools with it!

These guides will walk you through the various use cases of the Gemini API:

Quickstart	Description	Open
Asynchronous requests	Learn how to use Python's async/await API with the Gemini SDK to parallelize calls.
Counting Tokens	Tokens are the basic inputs to the Gemini models. Through this
Models	Learn about the different models and parameters available in the Gemini API.
Working with files	Use the Gemini API to upload files (audio, video, images, code, text) and perform actions with them through the Gemini models.
Audio	Learn how to use the Gemini API with audio files.
JSON mode	Discover how to use JSON mode.
PDF files	Learn how to work with PDF files, and upload text and images.
System Instructions	Give models additional context on how to respond by setting system instructions.
Streaming	Learn how to use streaming for single interactions, and for chat.
Embeddings	Create high quality and task-specific embeddings.
Video	Upload a video to the Gemini API and use it in your prompt.
AI Tutors with LearnLM	Demonstrates how to craft AI tutoring experiences using system instructions aligned with learning science principles.

Name		Name	Last commit message	Last commit date
parent directory ..
file-api		file-api
rest		rest
websockets		websockets
Asynchronous_requests.ipynb		Asynchronous_requests.ipynb
Audio.ipynb		Audio.ipynb
Authentication.ipynb		Authentication.ipynb
Authentication_with_OAuth.ipynb		Authentication_with_OAuth.ipynb
Batch_mode.ipynb		Batch_mode.ipynb
Caching.ipynb		Caching.ipynb
Code_Execution.ipynb		Code_Execution.ipynb
Counting_Tokens.ipynb		Counting_Tokens.ipynb
Embeddings.ipynb		Embeddings.ipynb
Enum.ipynb		Enum.ipynb
Error_handling.ipynb		Error_handling.ipynb
File_API.ipynb		File_API.ipynb
File_Search.ipynb		File_Search.ipynb
Function_calling.ipynb		Function_calling.ipynb
Get_Started_Nano_Banana.ipynb		Get_Started_Nano_Banana.ipynb
Get_started.ipynb		Get_started.ipynb
Get_started_Deep_Research.ipynb		Get_started_Deep_Research.ipynb
Get_started_LearnLM.ipynb		Get_started_LearnLM.ipynb
Get_started_LiveAPI.ipynb		Get_started_LiveAPI.ipynb
Get_started_LiveAPI.py		Get_started_LiveAPI.py
Get_started_LiveAPI_NativeAudio.py		Get_started_LiveAPI_NativeAudio.py
Get_started_LiveAPI_tools.ipynb		Get_started_LiveAPI_tools.ipynb
Get_started_Lyria.ipynb		Get_started_Lyria.ipynb
Get_started_LyriaRealTime.ipynb		Get_started_LyriaRealTime.ipynb
Get_started_LyriaRealTime.py		Get_started_LyriaRealTime.py
Get_started_OpenAI_Compatibility.ipynb		Get_started_OpenAI_Compatibility.ipynb
Get_started_TTS.ipynb		Get_started_TTS.ipynb
Get_started_Veo.ipynb		Get_started_Veo.ipynb
Get_started_Veo_REST.ipynb		Get_started_Veo_REST.ipynb
Get_started_imagen.ipynb		Get_started_imagen.ipynb
Get_started_imagen_rest.ipynb		Get_started_imagen_rest.ipynb
Get_started_interactions_api.ipynb		Get_started_interactions_api.ipynb
Get_started_thinking.ipynb		Get_started_thinking.ipynb
Get_started_thinking_REST.ipynb		Get_started_thinking_REST.ipynb
Grounding.ipynb		Grounding.ipynb
Image_out.ipynb		Image_out.ipynb
Inference_tiers.ipynb		Inference_tiers.ipynb
JSON_mode.ipynb		JSON_mode.ipynb
Models.ipynb		Models.ipynb
New_in_002.ipynb		New_in_002.ipynb
PDF_Files.ipynb		PDF_Files.ipynb
Prompting.ipynb		Prompting.ipynb
README.md		README.md
Safety.ipynb		Safety.ipynb
Search_Grounding.ipynb		Search_Grounding.ipynb
Spatial_understanding.ipynb		Spatial_understanding.ipynb
Streaming.ipynb		Streaming.ipynb
System_instructions.ipynb		System_instructions.ipynb
Template.ipynb		Template.ipynb
Video_understanding.ipynb		Video_understanding.ipynb
gemini-robotics-er.ipynb		gemini-robotics-er.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Gemini API Tutorials

Table of contents

Quickstart	Description	Open
Authentication	Start here to learn how you can set up your API key so you can get access to the Gemini API.
Get Started	Learn how to make your first calls to the Gemini API and get a quick overview of everything it can do.

FilesExpand file tree

quickstarts

Directory actions

More options

Directory actions

More options

Latest commit

History

quickstarts

Folders and files

parent directory

README.md

Gemini API Tutorials

Table of contents