Vertex AI 기반 Imagen을 사용하여 텍스트 프롬프트로 이미지 생성

주의: 다음 Imagen 4 프리뷰 모델은 2025년 11월 30일에 삭제됩니다. imagen-4.0-generate-preview-06-06, imagen-4.0-ultra-generate-preview-06-06, imagen-4.0-fast-generate-preview-06-06 서비스 중단을 방지하려면 2025년 11월 30일 전에 Imagen 4 프리뷰 모델을 사용하는 모든 워크플로를 다음의 정식 버전 Imagen 4 모델로 마이그레이션하세요. imagen-4.0-generate-001, imagen-4.0-ultra-generate-001, imagen-4.0-fast-generate-001

Vertex AI 기반 Imagen을 사용하여 텍스트 프롬프트에서 새 이미지를 생성할 수 있습니다. 지원되는 인터페이스에는 Google Cloud 콘솔과 Vertex AI API가 있습니다.

이미지 생성 및 수정을 위한 텍스트 프롬프트 작성에 대한 상세 내용은 프롬프트 가이드를 참조하세요.

생성용 Imagen 모델 카드 보기

이미지 생성 사용해 보기(Vertex AI Studio)

Colab에서 Imagen 사용해 보기

시작하기 전에

Sign in to your Google Cloud account. If you're new to Google Cloud, create an account to evaluate how our products perform in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Roles required to select or create a project

Select a project: Selecting a project doesn't require a specific IAM role—you can select any project that you've been granted a role on.
Create a project: To create a project, you need the Project Creator role (roles/resourcemanager.projectCreator), which contains the resourcemanager.projects.create permission. Learn how to grant roles.

Go to project selector

Verify that billing is enabled for your Google Cloud project.

Enable the Vertex AI API.

Roles required to enable APIs

To enable APIs, you need the Service Usage Admin IAM role (roles/serviceusage.serviceUsageAdmin), which contains the serviceusage.services.enable permission. Learn how to grant roles.

Enable the API

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Roles required to select or create a project

Select a project: Selecting a project doesn't require a specific IAM role—you can select any project that you've been granted a role on.
Create a project: To create a project, you need the Project Creator role (roles/resourcemanager.projectCreator), which contains the resourcemanager.projects.create permission. Learn how to grant roles.

Go to project selector

Verify that billing is enabled for your Google Cloud project.

Enable the Vertex AI API.

Roles required to enable APIs

To enable APIs, you need the Service Usage Admin IAM role (roles/serviceusage.serviceUsageAdmin), which contains the serviceusage.services.enable permission. Learn how to grant roles.

Enable the API

환경에 대한 인증을 설정하세요.

Select the tab for how you plan to use the samples on this page:
Console

When you use the Google Cloud console to access Google Cloud services and APIs, you don't need to set up authentication.
Python

로컬 개발 환경에서 이 페이지의 Python 샘플을 사용하려면 gcloud CLI를 설치하고 초기화한 후 사용자 인증 정보로 애플리케이션 기본 사용자 인증 정보를 설정합니다.
자세한 내용은 Google Cloud 인증 문서의 로컬 개발 환경의 ADC 설정을 참고하세요.
REST

로컬 개발 환경에서 이 페이지의 REST API 샘플을 사용하려면 gcloud CLI에 제공한 사용자 인증 정보를 사용합니다.
자세한 내용은 Google Cloud 인증 문서의 REST 사용을 위한 인증을 참조하세요.
텍스트로 이미지 생성

설명 텍스트만 입력으로 사용하여 새로운 이미지를 생성할 수 있습니다. 다음 샘플에서는 이미지 생성에 대한 기본 안내를 보여줍니다.

중요: 프롬프트가 복잡하고 향상된 프롬프트를 사용하는 경우 imagen-4.0-fast-generate-001에서 원치 않는 결과가 생성될 수 있습니다. 이 문제를 해결하려면 Google Cloud 콘솔에서 글쓰기 도우미를 사용하지 않거나 enhancePrompt를 false로 설정하세요.
콘솔
1. Google Cloud 콘솔에서 Vertex AI > Media Studio 페이지로 이동합니다.
<a href="https://console.cloud.google.com/vertex-ai/studio/media/generate;tab=image" class="button button-primary" target="console" track-name="consoleLink" track-type="task">Go to Media Studio</a>
1. Imagen을 클릭합니다. Imagen Media Studio 이미지 생성 페이지가 표시됩니다.
2. 선택사항: 설정 창에서 다음 설정을 구성합니다.
  
  모델: 사용 가능한 옵션 중에서 모델을 선택합니다.
  
  사용 가능한 모델에 대한 자세한 내용은 Imagen 모델을 참조하세요.
  
  가로세로 비율: 사용 가능한 옵션 중에서 가로세로 비율을 선택합니다.
  
  결과 수: 슬라이더를 조정하거나 1~4 사이의 값을 입력합니다.
  
  출력 해상도: 사용 가능한 옵션 중에서 해상도를 선택합니다.
3. 선택사항: 고급 옵션 섹션에서 이미지를 생성할 리전을 선택합니다.
4. 프롬프트 작성 상자에 생성할 이미지를 설명하는 텍스트 프롬프트를 입력합니다. 예를 들면 아침 물 위에 떠 있는 작은 배 수채화 이미지입니다.
  
  효과적인 프롬프트를 작성하는 방법에 대한 자세한 내용은 프롬프트 및 이미지 속성 가이드를 참조하세요.
5. 생성을 클릭합니다.
디지털 워터마크가 생성된 이미지에 자동으로 추가됩니다. Google Cloud 콘솔을 사용하여 이미지 생성 시에 디지털 워터마크를 중지할 수 없습니다.

이미지를 선택하여 이미지 세부정보 창에서 볼 수 있습니다. 워터마크가 추가된 이미지에는 디지털 워터마크 배지가 포함됩니다. 명시적으로 이미지 워터마크를 확인할 수도 있습니다.

^{프롬프트에서 Imagen 2로 생성된 워터마크가 추가된 이미지의 이미지 세부정보 뷰: 저채도 색상의 아침 물 위에 떠 있는 빨간색 작은 배 수채화 이미지.}
Python

설치
pip install --upgrade google-genai
자세한 내용은 SDK 참고 문서를 참고하세요.

Vertex AI에서 Gen AI SDK를 사용하도록 환경 변수를 설정합니다.
# Replace the `GOOGLE_CLOUD_PROJECT` and `GOOGLE_CLOUD_LOCATION` values # with appropriate values for your project. export GOOGLE_CLOUD_PROJECT=GOOGLE_CLOUD_PROJECT export GOOGLE_CLOUD_LOCATION=global export GOOGLE_GENAI_USE_VERTEXAI=True
이 샘플에서는 ImageGenerationModel에서 generate_images 메서드를 호출하고 생성된 이미지를 로컬로 저장합니다. 그런 다음 노트북에서 show() 메서드를 선택적으로 사용하여 생성된 이미지를 표시할 수 있습니다. 모델 버전과 기능에 대한 자세한 내용은 Imagen 모델을 참조하세요.
from google import genai from google.genai.types import GenerateImagesConfig client = genai.Client() # TODO(developer): Update and un-comment below line # output_file = "output-image.png" image = client.models.generate_images( model="imagen-4.0-generate-001", prompt="A dog reading a newspaper", config=GenerateImagesConfig( image_size="2K", ), ) image.generated_images[0].image.save(output_file) print(f"Created output image using {len(image.generated_images[0].image.image_bytes)} bytes") # Example response: # Created output image using 1234567 bytes
REST

요청 데이터를 사용하기 전에 다음을 바꿉니다.
- REGION: 프로젝트가 있는 리전. 지원되는 리전에 대한 자세한 내용은 Vertex AI의 생성형 AI 위치를 참고하세요.
- PROJECT_ID: Google Cloud 프로젝트 ID
- MODEL_VERSION: 사용할 Imagen 모델 버전. 사용 가능한 모델에 대한 자세한 내용은 Imagen 모델을 참조하세요.
- TEXT_PROMPT: 모델이 생성하는 이미지를 안내하는 텍스트 프롬프트. 이 필드는 생성 및 수정 모두에서 필요합니다.
- IMAGE_COUNT: 생성할 이미지 수. 허용되는 값의 범위는 1~4입니다.
HTTP 메서드 및 URL:
POST https://REGION-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/REGION/publishers/google/models/MODEL_VERSION:predict
JSON 요청 본문:
{ "instances": [ { "prompt": "TEXT_PROMPT" } ], "parameters": { "sampleCount": IMAGE_COUNT } }
요청을 보내려면 다음 옵션 중 하나를 선택합니다.
curl

참고: 다음 명령어는 gcloud init 또는 gcloud auth login을 실행하거나 gcloud CLI에 자동으로 로그인하는 Cloud Shell을 사용하여 사용자 계정으로 gcloud CLI에 로그인했다고 가정합니다. gcloud auth list를 실행하면 현재 활성 계정을 확인할 수 있습니다.

요청 본문을 request.json 파일에 저장하고 다음 명령어를 실행합니다.

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json; charset=utf-8" \
-d @request.json \
"https://REGION-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/REGION/publishers/google/models/MODEL_VERSION:predict"

PowerShell

참고: 다음 명령어는 gcloud init 또는 gcloud auth login을 실행하여 사용자 계정으로 gcloud CLI에 로그인했다고 가정합니다. gcloud auth list를 실행하면 현재 활성 계정을 확인할 수 있습니다.

요청 본문을 request.json 파일에 저장하고 다음 명령어를 실행합니다.

$cred = gcloud auth print-access-token
$headers = @{ "Authorization" = "Bearer $cred" }

Invoke-WebRequest `
-Method POST `
-Headers $headers `
-ContentType: "application/json; charset=utf-8" `
-InFile request.json `
-Uri "https://REGION-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/REGION/publishers/google/models/MODEL_VERSION:predict" | Select-Object -Expand Content
다음은 "sampleCount": 2 요청에 대한 샘플 응답입니다. 응답은 생성된 이미지 바이트를 base64로 인코딩한 예측 객체 2개를 반환합니다.
{ "predictions": [ { "bytesBase64Encoded": "BASE64_IMG_BYTES", "mimeType": "image/png" }, { "mimeType": "image/png", "bytesBase64Encoded": "BASE64_IMG_BYTES" } ] }
프롬프트를 개선할 수 있는 모델을 사용하는 경우 생성에 사용된 개선된 프롬프트가 포함된 추가 prompt 필드가 응답에 포함됩니다.
{ "predictions": [ { "mimeType": "MIME_TYPE", "prompt": "ENHANCED_PROMPT_1", "bytesBase64Encoded": "BASE64_IMG_BYTES_1" }, { "mimeType": "MIME_TYPE", "prompt": "ENHANCED_PROMPT_2", "bytesBase64Encoded": "BASE64_IMG_BYTES_2" } ] }
다음 단계

Imagen 및 Vertex AI의 기타 생성형 AI 제품 관련 문서 읽기:

Vertex AI 기반 Imagen을 사용하여 텍스트 프롬프트로 이미지 생성 컬렉션을 사용해 정리하기 내 환경설정을 기준으로 콘텐츠를 저장하고 분류하세요.

시작하기 전에

Console

Python

REST

텍스트로 이미지 생성

콘솔

Python

설치

REST

curl

PowerShell

다음 단계

Vertex AI 기반 Imagen을 사용하여 텍스트 프롬프트로 이미지 생성