Migrate from OpenAI SDK to Gen AI SDK

This page explains how to migrate code designed for the OpenAI SDK to the Google Gen AI SDK to utilize Gemini models on Vertex AI.

Migration Overview

The following notebook demonstrates a practical migration from the openai library to the google-genai library:

API & Syntax Mapping

The following table compares the core components, methods, and parameters of the OpenAI SDK with the Gen AI SDK.

Feature	OpenAI SDK (`openai`)	Gen AI SDK (`google-genai`)
Client Initialization	`client = OpenAI(api_key=...)`	`client = genai.Client(vertexai=True, ...)`
Generation Method	`client.chat.completions.create`	`client.models.generate_content`
Streaming Method	`stream=True` (parameter)	`client.models.generate_content_stream` (method)
User Input	`messages=[{"role": "user", "content": "..."}]`	`contents="..."` (str) or `contents=[...]` (list)
System Instructions	`messages=[{"role": "system", "content": "..."}]`	`config=types.GenerateContentConfig(system_instruction=...)`
Response Access	`response.choices[0].message.content`	`response.text`
Chat History	Manual list management (`messages.append`)	`client.chats.create()` (Stateful object)
Max Tokens	`max_tokens`	`max_output_tokens` (inside `config`)
Temperature	`temperature`	`temperature` (inside `config`)
JSON Mode	`response_format={"type": "json_object"}`	`response_mime_type="application/json"` (inside `config`)

Installation and Setup

Uninstall the OpenAI library and install the Gen AI SDK.

pip install google-genai

2. Authentication & Initialization

While OpenAI uses an API Key, Vertex AI uses Identity and Access Management (IAM) credentials (Application Default Credentials). You must explicitly define your Project ID and Location.

OpenAI SDK Google Gen AI SDK

OpenAI SDK	Google Gen AI SDK
`from openai import OpenAI import os # Relies on OPENAI_API_KEY environment variable client = OpenAI()`	`from google import genai # Use vertexai=True to use the Vertex AI platform client = genai.Client( vertexai=True, project='your-project-id', location='us-central1' )`


        
from openai import OpenAI
import os

# Relies on OPENAI_API_KEY environment variable
client = OpenAI()


        
from google import genai

# Use vertexai=True to use the Vertex AI platform
client = genai.Client(
    vertexai=True,
    project='your-project-id',
    location='us-central1'
)

Tip: You can also set environment variables to initialize the client without arguments, similar to how the OpenAI client reads the API key from the environment.

Set GOOGLE_GENAI_USE_VERTEXAI, GOOGLE_CLOUD_PROJECT and GOOGLE_CLOUD_LOCATION, as shown:

export GOOGLE_GENAI_USE_VERTEXAI=true
export GOOGLE_CLOUD_PROJECT='your-project-id'
export GOOGLE_CLOUD_LOCATION='global'

Once configured, you can initialize the client without passing parameters:

from google import genai

client = genai.Client()

Code Examples

The following code samples show the differences between the OpenAI SDK and Google Gen AI SDK for common tasks.

Single-turn text generation

The following code samples show how to generate text. Note that in the Google Gen AI SDK, system instructions are handled as a configuration parameter rather than a message role in the input list.

OpenAI SDK Google Gen AI SDK

OpenAI SDK	Google Gen AI SDK
`response = client.chat.completions.create( model="gpt-4", messages=[ {"role": "system", "content": "You are a helpful assistant."}, {"role": "user", "content": "Explain quantum physics."} ] ) print(response.choices[0].message.content)`	`from google.genai import types response = client.models.generate_content( model="gemini-2.5-flash", contents="Explain quantum physics.", config=types.GenerateContentConfig( system_instruction="You are a helpful assistant." ) ) print(response.text)`


        
response = client.chat.completions.create(
    model="gpt-4",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Explain quantum physics."}
    ]
)
print(response.choices[0].message.content)


        
from google.genai import types

response = client.models.generate_content(
    model="gemini-2.5-flash",
    contents="Explain quantum physics.",
    config=types.GenerateContentConfig(
        system_instruction="You are a helpful assistant."
    )
)
print(response.text)

Text generation with parameters

The following code samples show the differences in defining configuration parameters. In the Google Gen AI SDK, parameters like temperature, max_output_tokens (previously max_tokens), and JSON formatting are grouped into a GenerateContentConfig object.

OpenAI SDK Google Gen AI SDK

OpenAI SDK	Google Gen AI SDK
`response = client.chat.completions.create( model="gpt-4", messages=[ {"role": "user", "content": "List 3 types of apples in JSON."} ], temperature=0.7, max_tokens=1000, response_format={"type": "json_object"} ) print(response.choices[0].message.content)`	`from google.genai import types config = types.GenerateContentConfig( temperature=0.7, max_output_tokens=1000, response_mime_type="application/json" ) response = client.models.generate_content( model="gemini-2.5-flash", contents="List 3 types of apples in JSON.", config=config ) print(response.text)`


        
response = client.chat.completions.create(
    model="gpt-4",
    messages=[
        {"role": "user", "content": "List 3 types of apples in JSON."}
    ],
    temperature=0.7,
    max_tokens=1000,
    response_format={"type": "json_object"}
)

print(response.choices[0].message.content)


        
from google.genai import types

config = types.GenerateContentConfig(
    temperature=0.7,
    max_output_tokens=1000,
    response_mime_type="application/json"
)

response = client.models.generate_content(
    model="gemini-2.5-flash",
    contents="List 3 types of apples in JSON.",
    config=config
)

print(response.text)

Chat (Multi-turn)

The following code samples show the differences in managing chat history. Google Gen AI SDK simplifies this by providing a stateful chat object, whereas OpenAI requires manually appending messages to a list.

OpenAI SDK Google Gen AI SDK

OpenAI SDK	Google Gen AI SDK
`# You must manually manage the list state messages = [{"role": "user", "content": "Hi"}] response = client.chat.completions.create( model="gpt-4", messages=messages ) # Append the response to history manually messages.append(response.choices[0].message) messages.append({"role": "user", "content": "Next question"}) response2 = client.chat.completions.create( model="gpt-4", messages=messages ) print(response2.choices[0].message.content)`	`# The SDK manages history for you chat = client.chats.create( model="gemini-2.5-flash", config=types.GenerateContentConfig( system_instruction="You are a helpful assistant." ) ) response1 = chat.send_message("Hi") print(response1.text) # History is retained automatically in the chat object response2 = chat.send_message("Next question") print(response2.text)`


        
# You must manually manage the list state
messages = [{"role": "user", "content": "Hi"}]

response = client.chat.completions.create(
    model="gpt-4",
    messages=messages
)

# Append the response to history manually
messages.append(response.choices[0].message)
messages.append({"role": "user", "content": "Next question"})

response2 = client.chat.completions.create(
    model="gpt-4",
    messages=messages
)
print(response2.choices[0].message.content)


        
# The SDK manages history for you
chat = client.chats.create(
    model="gemini-2.5-flash",
    config=types.GenerateContentConfig(
        system_instruction="You are a helpful assistant."
    )
)

response1 = chat.send_message("Hi")
print(response1.text)

# History is retained automatically in the chat object
response2 = chat.send_message("Next question")
print(response2.text)

Streaming

The following code samples show the differences in streaming responses. Google Gen AI SDK uses a specific method (generate_content_stream) rather than a boolean flag.

OpenAI SDK Google Gen AI SDK

OpenAI SDK	Google Gen AI SDK
`stream = client.chat.completions.create( model="gpt-4", messages=[{"role": "user", "content": "Write a story."}], stream=True ) for chunk in stream: if chunk.choices[0].delta.content: print(chunk.choices[0].delta.content, end="")`	`stream = client.models.generate_content_stream( model="gemini-2.5-flash", contents="Write a story." ) for chunk in stream: print(chunk.text, end="")`


        
stream = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Write a story."}],
    stream=True
)

for chunk in stream:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="")


        
stream = client.models.generate_content_stream(
    model="gemini-2.5-flash",
    contents="Write a story."
)

for chunk in stream:
    print(chunk.text, end="")

What's next

Learn how to Use OpenAI libraries with Vertex AI.
See code examples for OpenAI compatibility.
Get started with Google Gen AI SDK quickstart.

Migrate from OpenAI SDK to Gen AI SDK Stay organized with collections Save and categorize content based on your preferences.

Migration Overview

API & Syntax Mapping

Installation and Setup

2. Authentication & Initialization

Code Examples

Single-turn text generation

Text generation with parameters

Chat (Multi-turn)

Streaming

What's next

Migrate from OpenAI SDK to Gen AI SDK