Note: Vertex AI Search is being renamed to Agent Search. We are in the process of updating content to reflect the new branding.

Google uses AI technology to translate content into your preferred language. AI translations can contain errors.

יצירת תשובות שמבוססות על מקורות באמצעות RAG

במסגרת חוויית השימוש ב-Retrieval Augmented Generation (יצירה משולבת-אחזור, RAG) בחיפוש מבוסס סוכנים, אתם יכולים ליצור תשובות מבוססות-מידע להנחיות על סמך מקורות המידע הבאים:

חיפוש Google: כדאי להשתמש בעיגון באמצעות חיפוש Google אם רוצים לקשר את המודל לידע על העולם, למגוון רחב של נושאים או למידע עדכני באינטרנט. עיגון באמצעות חיפוש Google תומך באחזור דינמי שמאפשר לכם ליצור תוצאות שמבוססות על נתונים מחיפוש Google רק כשצריך. לכן, הגדרת השליפה הדינמית בודקת אם ההנחיה דורשת ידע על אירועים מהזמן האחרון, ומפעילה את עיגון באמצעות חיפוש Google. מידע נוסף זמין במאמר בנושא אחזור דינמי.
חשוב: אם אתם מקבלים הצעות לחיפוש Google עם תשובה, התשובה הזו היא "תוצאה מבוססת" בכפוף לתנאים של עיגון באמצעות חיפוש Google בקטע תנאי השירות בתנאים הספציפיים לשירות. כדי להשתמש בהצעות לחיפוש ב-Google, אפשר לעיין במאמר בנושא .
טקסט מוטבע: אפשר להשתמש בהארקה עם טקסט מוטבע כדי להארק את התשובה בחלקים של טקסט שנקראים טקסט עובדתי ומסופקים בבקשה. עובדה היא הצהרה שסופקה על ידי המשתמש ונחשבת לעובדתית עבור בקשה נתונה. המודל לא בודק את האותנטיות של הטקסט של העובדה.
מאגרי נתונים של חיפוש מבוסס סוכנים: אם רוצים לקשר את המודל למסמכים של הארגון ממאגרי נתונים של חיפוש מבוסס סוכנים, צריך להשתמש בהארקה עם חיפוש מבוסס סוכנים.

בדף הזה נסביר איך ליצור תשובות מבוססות-הקשר על סמך מקורות ההקשר האלה, באמצעות הגישות הבאות:

יצירת תשובה בשיחה אחת
- מאגר נתונים של טקסט מוטבע וחיפוש מבוסס סוכנים
- חיפוש Google
יצירת תשובות רב-שלבית

בנוסף, אתם יכולים לבחור להזרים את התשובות מהמודל. יצירת תשובה שמבוססת על מקורות באמצעות סטרימינג היא תכונה ניסיונית.

אתם יכולים להשתמש בשיטות אחרות כדי ליצור תשובות מבוססות-קרקע, בהתאם לאפליקציה שלכם. מידע נוסף מופיע במאמר Vertex AI APIs for building search and RAG experiences.

הסברים על המונחים

לפני שמשתמשים בשיטה ליצירת תשובות מבוססות-קרקע, כדאי להבין את הקלט והפלט, איך לבנות את הבקשה ואת המינוח שקשור ל-RAG.

מונחים שקשורים ל-RAG

RAG היא מתודולוגיה שמאפשרת למודלים גדולים של שפה (LLMs) ליצור תשובות שמבוססות על מקור הנתונים שתבחרו. יש שני שלבים ב-RAG:

אחזור: קבלת העובדות הכי רלוונטיות במהירות יכולה להיות בעיה נפוצה בחיפוש. בעזרת RAG, אפשר לאחזר במהירות את העובדות שחשובות ליצירת תשובה.
יצירה: העובדות שאוחזרו משמשות את ה-LLM ליצירת תשובה מבוססת.

לכן, שיטת יצירת התשובות שמבוססות על מקורות מאחזרת את העובדות ממקור ההצמדה ויוצרת תשובה שמבוססת על מקורות.

נתוני קלט

שיטת יצירת התשובות המבוססת על מידע מהימן דורשת את הקלט הבא בבקשה:

תפקיד: השולח של טקסט מסוים, שהוא משתמש (user) או מודל (model).
טקסט: כשהתפקיד הוא user, הטקסט הוא הנחיה, וכשהתפקיד הוא model, הטקסט הוא תשובה מבוססת. האופן שבו מציינים את התפקיד ואת הטקסט בבקשה נקבע באופן הבא:
- כדי ליצור תשובה בשיחה אחת, המשתמש שולח את הטקסט של ההנחיה בבקשה והמודל שולח את הטקסט של התשובה בתגובה.
- כדי ליצור תשובה שמתבססת על כמה תגובות קודמות, הבקשה מכילה את צמד ההנחיה-תשובה של כל התגובות הקודמות ואת טקסט ההנחיה מהמשתמש לתגובה הנוכחית. לכן, בבקשה כזו, התפקיד הוא user עבור טקסט ההנחיה ו-model עבור טקסט התשובה.
הוראת מערכת: מבוא להנחיה שקובע את ההתנהגות של המודל ומשנה את הפלט בהתאם. לדוגמה, אפשר להוסיף פרסונה לתשובה שנוצרה או להנחות את המודל לעצב את טקסט הפלט בדרך מסוימת. כדי ליצור תשובה רב-שלבית, צריך לספק את הוראות המערכת לכל תפנית. מידע נוסף זמין במאמר בנושא שימוש בהוראות מערכת.

מקור העיגון: המקור שבו התשובה מעוגנת, ויכול להיות אחד או יותר מהמקורות הבאים:

חיפוש Google: ביסוס התשובות על תוצאות חיפוש Google. כשמקור ההארקה הוא חיפוש Google, אפשר לציין הגדרה של אחזור דינמי עם סף אחזור דינמי. מידע נוסף זמין במאמר בנושא אחזור דינמי.
חשוב: אם אתם מקבלים הצעות לחיפוש Google עם תשובה, התשובה הזו היא "תוצאה מבוססת" בכפוף לתנאים של עיגון באמצעות חיפוש Google בקטע תנאי השירות בתנאים הספציפיים לשירות. כדי להשתמש בהצעות לחיפוש ב-Google, אפשר לעיין במאמר בנושא .
טקסט מוטבע: מעגנים את התשובה בטקסט עובדתי שמופיע בבקשה. עובדה היא הצהרה שסופקה על ידי המשתמש ונחשבת לעובדתית עבור בקשה נתונה. המודל לא בודק את האותנטיות של הטקסט של העובדה. אפשר לספק עד 100 טקסטים של עובדות בכל מקור טקסט מוטבע. אפשר לתמוך בטקסטים של העובדות באמצעות מאפייני מטא, כמו title, author ו-URI. מאפייני המטא האלה מוחזרים בתשובה כשמצטטים את החלקים שתומכים בתשובה.
מאגרי נתונים של חיפוש מבוסס סוכנים: התשובה מבוססת על המסמכים ממאגרי הנתונים של חיפוש מבוסס סוכנים. אי אפשר לציין מאגר נתונים של חיפושים באתר כמקור לביסוס.

בבקשה נתונה, אפשר לספק גם מקור טקסט מוטבע וגם מקור של מאגר נתונים של חיפוש מבוסס סוכנים. אי אפשר לשלב את חיפוש Google עם אף אחד מהמקורות האלה. לכן, אם רוצים להשתמש בתוצאות חיפוש Google כדי להצדיק את התשובות, צריך לשלוח בקשה נפרדת שבה מציינים שחיפוש Google הוא המקור היחיד להצדקה.

אפשר לספק עד 10 מקורות להארקה בכל סדר. לדוגמה, נניח שאתם מספקים את מקורות ההארקה עם המספר הבא, בסדר הבא, כדי לקבל סך של 10 מקורות הארקה:

שלושה מקורות טקסט מוטבעים, שכל אחד מהם יכול להכיל עד 100 טקסטים של עובדות
שישה מאגרי נתונים של חיפוש מבוסס סוכנים
מקור טקסט מוטבע אחד, שמכיל עד 100 טקסטים של עובדות

לכל מקור מוקצה אינדקס לפי הסדר שבו הוא מצוין בבקשה. לדוגמה, אם ציינתם שילוב של מקורות בבקשה, המערכת תקצה את אינדקס המקור כמו שמוצג בטבלה הבאה:

מקור העיגון	אינדקס
טקסט מוטבע מספר 1	0
טקסט מוטבע #2	1
מאגר נתונים של חיפוש מבוסס-סוכנים מס' 1	2
טקסט מוטבע #3	3
מאגר נתונים של חיפוש מבוסס סוכנים מס' 2	4

האינדקס הזה מצוין בתגובה ועוזר לעקוב אחרי המקור.

מפרטי יצירה: המפרטים להגדרת המודל, שכוללים את המידע הבא:
- מספר המודל: מציין את מודל Gemini של Vertex AI שבו יש להשתמש ליצירת תשובות. רשימת מודלים של Gemini שבהם אפשר להשתמש כדי ליצור תשובות מבוססות-קרקע זמינה במאמר מודלים של Gemini.
- פרמטרים של המודל: מציינים את הפרמטרים שאפשר להגדיר למודל שבוחרים להשתמש בו. הפרמטרים האלה הם: שפה, רמת אקראיות, Top-P ו-Top-K. פרטים על הפרמטרים האלה מופיעים במאמר פרמטרים של מודלים של Gemini.
קוד שפה: השפה של התשובה שנוצרה בדרך כלל תואמת לשפה של ההנחיה. אם אין שפה אחת בהנחיה (לדוגמה, אם ההנחיה קצרה מאוד ויכולה להיות תקפה בכמה שפות), שדה קוד השפה קובע את שפת התשובה.

רשימת קודי השפות מופיעה במאמר שפות.
‫Latitude and longitude: מציינים את קו הרוחב וקו האורך של המשתמש. אם השאילתה מכילה שאלות שקשורות למיקום, כמו 'תמצא בית קפה בקרבת מקום', נעשה שימוש בשדות האלה. אם אי אפשר לקבוע את שפת השאילתה וקוד השפה לא מוגדר, המערכת משתמשת בקו הרוחב ובקו האורך כדי לקבוע את שפת התשובה.

נתוני פלט

התשובה שהמודל יוצר נקראת מועמד והיא מכילה את הנתונים הבאים. יכול להיות שלא כל השדות יופיעו בפלט.

תפקיד: השולח של התשובה המבוססת. התשובה תמיד מכילה את הטקסט של התשובה המבוססת על מידע. לכן, התפקיד בתשובה הוא תמיד מודל.
טקסט: תשובה מבוססת.
ציון ההצמדה לקרקע: ערך מסוג float בטווח [0, 1] שמציין עד כמה התשובה מבוססת על המקורות שצוינו.
מטא-נתונים של ביסוס: מטא-נתונים על מקור הביסוס. מטא-נתונים של ביסוס כוללים את הפרטים הבאים:
- קטעי תמיכה: רשימה של קטעים שתומכים בתשובה. לכל מקטע תמיכה מוקצה אינדקס של מקטע התמיכה, שעוזר לעקוב אחרי מקור המידע. כל מקטע תמיכה מכיל את הפרטים הבאים:
  - קטע טקסט: חלק מהטקסט שצוטט מילה במילה מהמקור שממנו נלקחה התשובה או חלק מהתשובה (שנקרא טקסט הטענה). יכול להיות שהמידע הזה לא יופיע תמיד בתשובה.
  - מקור: אינדקס שמוקצה למקור בבקשה.
  - מטא-נתונים של המקור: מטא-נתונים על החלק. בהתאם למקור, המטא-נתונים של המקור יכולים להיות כל אחד מהבאים:
    - במקרה של מקור מוטבע, המטא-נתונים יכולים להיות הפרטים הנוספים שצוינו בבקשה, כמו שם, מחבר או URI.
    - במאגר הנתונים של חיפוש מבוסס סוכנים, המטא-נתונים יכולים להיות מזהה המסמך, שם המסמך, ה-URI (המיקום ב-Cloud Storage) או מספר הדף.
    - ב-Grounding with Google Search, כשנוצרת תוצאה מבוססת, המטא-נתונים מכילים מזהה URI שמפנה מחדש לבעל התוכן ששימש ליצירת התוצאה המבוססת. המטא-נתונים כוללים גם את הדומיין של המוציא לאור. כתובות ה-URI שסופקו נשארות נגישות למשך 30 יום לכל היותר אחרי יצירת התוצאה המבוססת.
    חשוב: משתמשי הקצה צריכים לגשת ישירות ל-URI שצוין, ולא לבצע שאילתות באופן פרוגרמטי באמצעות אמצעים אוטומטיים. אם מזוהה גישה אוטומטית, יכול להיות ששירות עיגון באמצעות חיפוש Google יפסיק לספק את מזהי ה-URI של ההפניה האוטומטית. כדי להפעיל מחדש את כתובות ה-URI להפניה אוטומטית, צריך לפנות למהנדס הלקוחות.
- תמיכה בהצגת מקורות מידע: הצגת מקורות מידע לתמיכה בטענה בתשובה. התמיכה בהארקה כוללת את המידע הבא:
  - טקסט הטענה: התשובה או חלק מהתשובה שמגובות בטקסט של קטע התמיכה.
  - Support chunk index: אינדקס שמוקצה לחלק התומך בסדר שבו החלק מופיע ברשימת החלקים התומכים.
  - שאילתות חיפוש באינטרנט: הצעות לשאילתות חיפוש בחיפוש Google.
  - הצעות לחיפוש: אם אתם מקבלים הצעות לחיפוש Google עם תשובה, התשובה הזו היא "תוצאה עם ביסוס" בכפוף לתנאי השירות של עיגון באמצעות חיפוש Google. מידע נוסף זמין במאמר בנושא תנאי השירות. השדה renderedContent בתוך השדה searchEntryPoint הוא הקוד שסופק להטמעה של הצעות לחיפוש Google. כדי להשתמש בהצעות לחיפוש ב-Google, אפשר לעיין במאמר בנושא שימוש בהצעות לחיפוש ב-Google.

יצירת תשובה שמבוססת על מקורות בשיחה אחת

בקטע הזה מוסבר איך ליצור תשובות שמבוססות על המקורות הבאים:

מאגר נתונים של טקסט מוטבע וחיפוש מבוסס סוכנים
חיפוש Google

התשובה תהיה מבוססת על טקסט מוטבע ומאגר נתונים של חיפוש מבוסס סוכנים

בדוגמה הבאה מוצג איך לשלוח טקסט של הנחיה על ידי ציון טקסט מוטבע ומאגר נתונים של חיפוש מבוסס סוכנים כמקור העיגון. אי אפשר לציין מאגר נתונים של חיפוש באתר כמקור ההארקה. בדוגמה הזו נעשה שימוש בשיטה generateGroundedContent.

REST

שולחים את ההנחיה בבקשת curl הבאה.

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/PROJECT_NUMBER/locations/global:generateGroundedContent" \
-d '
{
"contents": [
 {
   "role": "user",
   "parts": [
     {
       "text": "PROMPT_TEXT"
     }
   ]
 }
],
"systemInstruction": {
   "parts": {
       "text": "SYSTEM_INSTRUCTION"
   }
},
"groundingSpec": {
 "groundingSources": [
   {
     "inlineSource": {
       "groundingFacts": [
         {
           "factText": "FACT_TEXT_1",
           "attributes": {
             "title": "TITLE_1",
             "uri": "URI_1",
             "author": "AUTHOR_1"
           }
         }
       ]
     }
   },
   {
     "inlineSource": {
       "groundingFacts": [
         {
           "factText": "FACT_TEXT_2",
           "attributes": {
             "title": "TITLE_2",
             "uri": "URI_2"
           }
         },
         {
           "factText": "FACT_TEXT_3",
           "attributes": {
             "title": "TITLE_3",
             "uri": "URI_3"
           }
         }
       ]
     }
   },
   {
     "searchSource": {
       "servingConfig": "projects/PROJECT_NUMBER/locations/global/collections/default_collection/engines/APP_ID_1/servingConfigs/default_search"
     }
   },
   {
     "searchSource": {
       "servingConfig": "projects/PROJECT_NUMBER/locations/global/collections/default_collection/engines/APP_ID_2/servingConfigs/default_search"
     }
   }
  ]
},
"generationSpec": {
  "modelId": "MODEL_ID",
  "temperature": TEMPERATURE,
  "topP": TOP_P,
  "topK": TOP_K
},
"user_context": {
  "languageCode: "LANGUAGE_CODE",
  "latLng": {
    "latitude": LATITUDE,
    "longitude": LONGITUDE
 },
}
}'

מחליפים את מה שכתוב בשדות הבאים:

‫PROJECT_NUMBER: מספר הפרויקט ב- Google Cloud .
‫PROMPT_TEXT: ההנחיה מהמשתמש.
‫SYSTEM_INSTRUCTION: שדה אופציונלי שבו אפשר לספק הקדמה או הקשר נוסף.
‫FACT_TEXT_N: הטקסט שמופיע בשורה כדי להצדיק את התשובה. אפשר לספק עד 100 טקסטים של עובדות.
‫TITLE_N: שדה אופציונלי להגדרת מאפיין המטא של הכותרת לטקסט בתוך התג.
‫URI_N: שדה אופציונלי להגדרת מאפיין המטא של ה-URI עבור הטקסט בתוך התג.
‫AUTHOR_N: שדה אופציונלי להגדרת מאפיין מטא של מחבר לטקסט בתוך התג.
‫APP_ID_N: המזהה של אפליקציית חיפוש מבוסס סוכנים.
‫MODEL_ID: שדה אופציונלי להגדרת מזהה המודל של מודל Gemini שבו רוצים להשתמש כדי ליצור את התשובה המבוססת. רשימה של מזהי המודלים הזמינים מופיעה במאמר מודלים של Gemini במסמכי המוצר בנושא AI גנרטיבי ב-Vertex AI.
‫TEMPERATURE: שדה אופציונלי להגדרת רמת האקראיות שמשמשת לדגימה. ‫Google ממליצה על רמת אקראיות של 0.0. מידע נוסף זמין במאמר פרמטרים של מודל Gemini.
‫TOP_P: שדה אופציונלי להגדרת ערך top-P של המודל. מידע נוסף זמין במאמר פרמטרים של מודל Gemini.
‫TOP_K: שדה אופציונלי להגדרת ערך ה-K העליון של המודל. מידע נוסף זמין במאמר פרמטרים של מודל Gemini.
‫LANGUAGE_CODE: שדה אופציונלי שאפשר להשתמש בו כדי להגדיר את השפה של התשובה שנוצרה ושל הטקסט של החלק שמוחזר. אם אי אפשר לקבוע את השפה מהשאילתה, נעשה שימוש בשדה הזה. ערך ברירת המחדל הוא en. רשימת קודי שפה זמינה במאמר שפות.
‫LATITUDE: שדה אופציונלי להגדרת קו הרוחב. מזינים את הערך במעלות עשרוניות – לדוגמה, -25.34.
‫LONGITUDE: שדה אופציונלי להגדרת קו האורך. מזינים את הערך במעלות עשרוניות – לדוגמה, 131.04.

תשובה

אתם אמורים לקבל תגובת JSON שדומה לתגובה הבאה (התגובה קוצרה). כדי להבין את התגובה, אפשר לעיין בנתוני הפלט.

{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "ANSWER_TEXT"
         }
       ]
     },
     "groundingScore": GROUNDING_SCORE,
     "groundingMetadata": {
       "supportChunks": [
         {
           "chunkText": "CHUNK_TEXT_FROM_A_DOCUMENT_IN_A_DATA_STORE ",
           "source": "4",
           "sourceMetadata": {
             "title": "DOCUMENT_TITLE",
             "uri": "gs://PATH/TO/DOCUMENT.pdf",
             "document_id": "DOCUMENT_ID",
             "page_identifier": "PAGE_NUMBER"
           }
         },
         {
           "chunkText": "CHUNK_TEXT_FROM_FACT_TEXT_1",
           "source": "0",
           "sourceMetadata": {
             "title": "TITLE_1",
             "uri": "URI_1",
             "author": "AUTHOR_1"
           }
         }
       ],
       "groundingSupport": [
         {
           "claimText": "CLAIM_TEXT_1",
           "supportChunkIndices": [
             0,
             1
           ]
         }
       ]
     }
   }
 ]
}

דוגמה ליצירת תשובה בשיחה חד-תורית שמבוססת על טקסט מוטבע ועל חיפוש באמצעות סוכן

בדוגמה הבאה, הבקשה מציינת את מקורות העיגון הבאים: עובדה טקסטואלית מוטמעת אחת ומאגר נתונים אחד של חיפוש מבוסס סוכנים. בדוגמה הזו נעשה שימוש בשיטה generateGroundedContent. בדוגמה הזו נעשה שימוש גם בהוראה למערכת לסיים את התשובה עם אמוג'י של פרצוף מחייך.

REST

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/123456/locations/global:generateGroundedContent" \
-d '
{
  "contents": [
    {
      "role": "user",
      "parts": [
        {
          "text": "How did Google do in 2020? Where can I find BigQuery docs?"
        }
      ]
    }
  ],
  "systemInstruction": {
      "parts": {
          "text": "Add a smiley emoji after the answer."
      }
  },
  "groundingSpec": {
    "groundingSources": [
      {
        "inline_source": {
          "grounding_facts": [
            {
              "fact_text": "The BigQuery documentation can be found at https://cloud.google.com/bigquery/docs/introduction",
              "attributes": {
                "title": "BigQuery Overview",
                "uri": "https://cloud.google.com/bigquery/docs/introduction"
              }
            }
          ]
        }
      },
      {
        "searchSource": {
          "servingConfig": "projects/123456/locations/global/collections/default_collection/engines/app_id_example/servingConfigs/default_search"
        }
      }
    ]
  },
  "generationSpec": {
    "modelId": "gemini-2.5-flash"
  },
  "user_context": {
    "languageCode: "en",
    "latLng": {
       "latitude": 37.422131,
       "longitude": -122.084801
    }
  }
}'

תשובה

אתם אמורים לקבל תגובת JSON שדומה לתגובה הבאה (התגובה קוצרה). כדי להבין את התגובה, אפשר לעיין בנתוני הפלט.

{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "Google's revenue increased by 23% in 2020, reaching $182.5 billion. Google Cloud revenue was $13.1 billion for 2020. You can find BigQuery documentation at https://cloud.google.com/bigquery/docs/introduction. 😊 \n"
         }
       ]
     },
     "groundingScore": 0.86738646,
     "groundingMetadata": {
       "supportChunks": [
         {
           "chunkText": "Alphabet Announces Fourth Quarter and Fiscal Year 2020 Results\nMOUNTAIN VIEW, Calif. – February 2, 2021 – Alphabet Inc. (NASDAQ: GOOG, GOOGL) today announced\nfinancial results for the quarter and fiscal year ended December 31, 2020. Sundar Pichai, CEO of Google and Alphabet, said: "Our strong results this quarter reflect the helpfulness of our\nproducts and services to people and businesses, as well as the accelerating transition to online services and the\ncloud. Google succeeds when we help our customers and partners succeed, and we see significant opportunities to\nforge meaningful partnerships as businesses increasingly look to a digital future." Ruth Porat, CFO of Google and Alphabet, said: "Our strong fourth quarter performance, with revenues of $56.9\nbillion, was driven by Search and YouTube, as consumer and business activity recovered from earlier in the year. Google Cloud revenues were $13.1 billion for 2020, with significant ongoing momentum, and we remain focused on\ndelivering value across the growth opportunities we see." New reporting segment structure and operating results\nWe are now reporting results for three segments: Google Services, Google Cloud, and Other Bets. \n...\nIn 2020, we entered into derivatives that hedged the changes in fair value of certain marketable equity securities, which\nresulted in a $497 million net loss for the quarter ended December 31, 2020. The offsetting recognized gains on the\nmarketable equity securities are reflected in Gain (loss) on equity securities, net. Segment results\nThe following table presents our revenues and operating income (loss) (in millions; unaudited): Quarter Fiscal Year Q4 2019 Q1 2020 Q2 2020 Q3 2020 Q4 2020 2018 2019 2020 Revenues:\nGoogle Services $ 43,198 $ 38,198 $ 34,991 $ 42,573 $ 52,873 $ 130,524 $ 151,825 $ 168,635 Google Cloud 2,614 2,777 3,007 3,444 3,831 5,838 8,918 13,059 Other Bets 172 135 148 178 196 595 659 657 Hedging gains (losses) 91 49 151 (22) (2) (138) 455 176 Total revenues $ 46,075 $ 41,159 $ 38,297 $ 46,173 $ 56,898 $ 136,819 $ 161,857 $ 182,527 Quarter Fiscal Year Q4 2019 Q1 2020 Q2 2020 Q3 2020 Q4 2020 2018 2019 2020 Operating income (loss):\nGoogle Services $ 13,488 $ 11,548 $ 9,539 $ 14,453 $ 19,066 $ 43,137 $ 48,999 $ 54,606 Google Cloud (1,194) (1,730) (1,426) (1,208) (1,243) (4,348) (4,645) (5,607) Other Bets (2,026) (1,121) (1,116) (1,103) (1,136) \n...\nQ4 2020 financial highlights\nThe following table summarizes our consolidated financial results for the quarters ended December 31, 2019 and\n2020 (in millions, except for per share information and percentages; unaudited). Quarter Ended December 31,\n2019 2020 Revenues $ 46,075 $ 56,898 Increase in revenues year over year 17 % 23 % Increase in constant currency revenues year over year(1) 19 % 23 % Operating income $ 9,266 $ 15,651 Operating margin 20 % 28 % Other income (expense), net $ 1,438 $ 3,038 Net income $ 10,671 $ 15,227 Diluted EPS $ 15.35 $ 22.30 (1) Non-GAAP measure. See the table captioned "Reconciliation from GAAP revenues to non-GAAP constant currency\nrevenues" for more details. Q4 2020 supplemental information (in millions, except for number of employees; unaudited)\nRevenues, Traffic Acquisition Costs (TAC) and number of employees\nThe following table summarizes our revenues, total TAC and number of employees. Quarter Ended December 31,\n2019 2020 Google Search & other $ 27,185 $ 31,903 YouTube ads 4,717 6,885 Google Network Members' properties 6,032 7,411 Google advertising 37,934 46,199 Google other 5,264 6,674 Google Services total 43,198 52,873 Google Cloud 2,614 3,831 Other Bets 172 196 Hedging gains (losses) 91 (2) Total revenues $ 46,075 $ 56,898 Total TAC $ 8,501 $ 10,466 Number of employees 118,899 135,301 ",
           "source": "1",
           "sourceMetadata": {
             "title": "GOOG Exhibit 99.1 Q4'20",
             "page_identifier": "2",
             "uri": "gs://cloud-samples-data/gen-app-builder/search/alphabet-investor-pdfs/2020Q4_alphabet_earnings_release.pdf",
             "document_id": "projects/123456/locations/global/collections/default_collection/dataStores/data_store_id_example/branches/0/documents/217e8bedecfe08e3c43f5b289af15243"
           }
         },
         {
           "chunkText": "Alphabet Announces Fourth Quarter and Fiscal Year 2020 Results\nMOUNTAIN VIEW, Calif. – February 2, 2021 – Alphabet Inc. (NASDAQ: GOOG, GOOGL) today announced\nfinancial results for the quarter and fiscal year ended December 31, 2020. Sundar Pichai, CEO of Google and Alphabet, said: "Our strong results this quarter reflect the helpfulness of our\nproducts and services to people and businesses, as well as the accelerating transition to online services and the\ncloud. Google succeeds when we help our customers and partners succeed, and we see significant opportunities to\nforge meaningful partnerships as businesses increasingly look to a digital future." Ruth Porat, CFO of Google and Alphabet, said: "Our strong fourth quarter performance, with revenues of $56.9\nbillion, was driven by Search and YouTube, as consumer and business activity recovered from earlier in the year. Google Cloud revenues were $13.1 billion for 2020, with significant ongoing momentum, and we remain focused on\ndelivering value across the growth opportunities we see." New reporting segment structure and operating results\nWe are now reporting results for three segments: Google Services, Google Cloud, and Other Bets. \n...\nIn 2020, we entered into derivatives that hedged the changes in fair value of certain marketable equity securities, which\nresulted in a $497 million net loss for the quarter ended December 31, 2020. The offsetting recognized gains on the\nmarketable equity securities are reflected in Gain (loss) on equity securities, net. Segment results\nThe following table presents our revenues and operating income (loss) (in millions; unaudited): Quarter Fiscal Year Q4 2019 Q1 2020 Q2 2020 Q3 2020 Q4 2020 2018 2019 2020 Revenues:\nGoogle Services $ 43,198 $ 38,198 $ 34,991 $ 42,573 $ 52,873 $ 130,524 $ 151,825 $ 168,635 Google Cloud 2,614 2,777 3,007 3,444 3,831 5,838 8,918 13,059 Other Bets 172 135 148 178 196 595 659 657 Hedging gains (losses) 91 49 151 (22) (2) (138) 455 176 Total revenues $ 46,075 $ 41,159 $ 38,297 $ 46,173 $ 56,898 $ 136,819 $ 161,857 $ 182,527 Quarter Fiscal Year Q4 2019 Q1 2020 Q2 2020 Q3 2020 Q4 2020 2018 2019 2020 Operating income (loss):\nGoogle Services $ 13,488 $ 11,548 $ 9,539 $ 14,453 $ 19,066 $ 43,137 $ 48,999 $ 54,606 Google Cloud (1,194) (1,730) (1,426) (1,208) (1,243) (4,348) (4,645) (5,607) Other Bets (2,026) (1,121) (1,116) (1,103) (1,136) \n...\nQ4 2020 financial highlights\nThe following table summarizes our consolidated financial results for the quarters ended December 31, 2019 and\n2020 (in millions, except for per share information and percentages; unaudited). Quarter Ended December 31,\n2019 2020 Revenues $ 46,075 $ 56,898 Increase in revenues year over year 17 % 23 % Increase in constant currency revenues year over year(1) 19 % 23 % Operating income $ 9,266 $ 15,651 Operating margin 20 % 28 % Other income (expense), net $ 1,438 $ 3,038 Net income $ 10,671 $ 15,227 Diluted EPS $ 15.35 $ 22.30 (1) Non-GAAP measure. See the table captioned "Reconciliation from GAAP revenues to non-GAAP constant currency\nrevenues" for more details. Q4 2020 supplemental information (in millions, except for number of employees; unaudited)\nRevenues, Traffic Acquisition Costs (TAC) and number of employees\nThe following table summarizes our revenues, total TAC and number of employees. Quarter Ended December 31,\n2019 2020 Google Search & other $ 27,185 $ 31,903 YouTube ads 4,717 6,885 Google Network Members' properties 6,032 7,411 Google advertising 37,934 46,199 Google other 5,264 6,674 Google Services total 43,198 52,873 Google Cloud 2,614 3,831 Other Bets 172 196 Hedging gains (losses) 91 (2) Total revenues $ 46,075 $ 56,898 Total TAC $ 8,501 $ 10,466 Number of employees 118,899 135,301 ",
           "source": "1",
           "sourceMetadata": {
             "document_id": "projects/123456/locations/global/collections/default_collection/dataStores/data_store_id_example/branches/0/documents/217e8bedecfe08e3c43f5b289af15243",
             "page_identifier": "2",
             "title": "GOOG Exhibit 99.1 Q4'20",
             "uri": "gs://cloud-samples-data/gen-app-builder/search/alphabet-investor-pdfs/2020Q4_alphabet_earnings_release.pdf"
           }
         },
         {
           "chunkText": "The BigQuery documentation can be found at https://cloud.google.com/bigquery/docs/introduction ",
           "source": "0",
           "sourceMetadata": {
             "uri": "https://cloud.google.com/bigquery/docs/introduction",
             "title": "BigQuery Overview"
           }
         }
       ],
       "groundingSupport": [
         {
           "claimText": "Google's revenue increased by 23% in 2020, reaching $182.5 billion.",
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "Google Cloud revenue was $13.1 billion for 2020.",
           "supportChunkIndices": [
             1
           ]
         },
         {
           "claimText": "You can find BigQuery documentation at https://cloud.google.com/bigquery/docs/introduction.😊 ",
           "supportChunkIndices": [
             2
           ]
         }
       ]
     }
   }
 ]
}

Python

from google.cloud import discoveryengine_v1 as discoveryengine

# TODO(developer): Uncomment these variables before running the sample.
# project_number = "YOUR_PROJECT_NUMBER"
# engine_id = "YOUR_ENGINE_ID"

client = discoveryengine.GroundedGenerationServiceClient()

request = discoveryengine.GenerateGroundedContentRequest(
    # The full resource name of the location.
    # Format: projects/{project_number}/locations/{location}
    location=client.common_location_path(project=project_number, location="global"),
    generation_spec=discoveryengine.GenerateGroundedContentRequest.GenerationSpec(
        model_id="gemini-2.5-flash",
    ),
    # Conversation between user and model
    contents=[
        discoveryengine.GroundedGenerationContent(
            role="user",
            parts=[
                discoveryengine.GroundedGenerationContent.Part(
                    text="How did Google do in 2020? Where can I find BigQuery docs?"
                )
            ],
        )
    ],
    system_instruction=discoveryengine.GroundedGenerationContent(
        parts=[
            discoveryengine.GroundedGenerationContent.Part(
                text="Add a smiley emoji after the answer."
            )
        ],
    ),
    # What to ground on.
    grounding_spec=discoveryengine.GenerateGroundedContentRequest.GroundingSpec(
        grounding_sources=[
            discoveryengine.GenerateGroundedContentRequest.GroundingSource(
                inline_source=discoveryengine.GenerateGroundedContentRequest.GroundingSource.InlineSource(
                    grounding_facts=[
                        discoveryengine.GroundingFact(
                            fact_text=(
                                "The BigQuery documentation can be found at https://cloud.google.com/bigquery/docs/introduction"
                            ),
                            attributes={
                                "title": "BigQuery Overview",
                                "uri": "https://cloud.google.com/bigquery/docs/introduction",
                            },
                        ),
                    ]
                ),
            ),
            discoveryengine.GenerateGroundedContentRequest.GroundingSource(
                search_source=discoveryengine.GenerateGroundedContentRequest.GroundingSource.SearchSource(
                    # The full resource name of the serving config for a Vertex AI Search App
                    serving_config=f"projects/{project_number}/locations/global/collections/default_collection/engines/{engine_id}/servingConfigs/default_search",
                ),
            ),
        ]
    ),
)
response = client.generate_grounded_content(request)

# Handle the response
print(response)

יצירת תשובה מעוגנת באמצעות חיפוש Google

אפשר להוסיף לתשובות שנוצרות נתונים שזמינים לכולם באינטרנט.

אחזור דינמי

אתם יכולים להשתמש באחזור דינמי בבקשה כדי לבחור מתי להשבית את עיגון באמצעות חיפוש Google. האפשרות הזו שימושית כשלא צריך תשובה שמבוססת על חיפוש Google, והמודל יכול לספק תשובה על סמך הידע שלו בלי להסתמך על חיפוש. כך תוכלו לנהל את זמן האחזור, האיכות והעלות בצורה יעילה יותר.

ציון וסף של חיזוי אחזור דינמי

כששולחים בקשה ליצירת תשובה מבוססת, התכונה 'חיפוש סוכן' מקצה לטקסט הבקשה ציון חיזוי. ציון החיזוי הוא ערך נקודה צפה (floating-point) בטווח [0,1]. הערך שלו תלוי בשאלה אם התשובה להנחיה יכולה להסתמך על המידע העדכני ביותר מחיפוש Google. לכן, הנחיה שנדרשת בה תשובה שמבוססת על העובדות העדכניות ביותר באינטרנט מקבלת ציון חיזוי גבוה יותר, והנחיה שבה תשובה שנוצרה על ידי מודל מספיקה מקבלת ציון חיזוי נמוך יותר.

הנה כמה דוגמאות להנחיות ולציוני החיזוי שלהן.

הנחיה	ציון החיזוי	תגובה
‫"Write a poem about peonies" ‏(תכתוב שיר על אדמוניות)	0.13	המודל יכול להסתמך על הידע שלו, ולא צריך להצמיד את התשובה למקור
‫"Suggest a toy for a 2yo child" ‏(הצעת צעצוע לילד בן שנתיים)	0.36	המודל יכול להסתמך על הידע שלו, ולא צריך להצמיד את התשובה למקור
"תנסח מתכון לגוואקמולי בסגנון אסייתי?"	0.55	חיפוש Google יכול לספק תשובה מבוססת, אבל לא בהכרח צריך לבסס אותה; יכול להיות שהידע של המודל יספיק
מה זה חיפוש מבוסס סוכנים? איך מתבצע חיוב על עיגון בחיפוש מבוסס סוכנים?	0.72	נדרש חיפוש ב-Google כדי ליצור תשובה שמבוססת על מקורות
"מי ניצח בגרנד פרי האחרון של פורמולה 1?"	0.97	נדרש חיפוש ב-Google כדי ליצור תשובה שמבוססת על מקורות

בבקשה ליצירת תשובה מבוססת-קרקע, אפשר לציין הגדרה של שליפה דינמית עם סף. הסף הוא ערך נקודה צפה בטווח [0,1], וערך ברירת המחדל הוא 0.7. אם ערך הסף הוא אפס, התשובה תמיד מבוססת על חיפוש Google. לכל שאר ערכי הסף, חלים התנאים הבאים:

אם ציון התחזית גדול מהסף או שווה לו, התשובה מבוססת על חיפוש Google. סף נמוך יותר מרמז שיש יותר הנחיות שהתשובות שלהן נוצרות באמצעות עיגון באמצעות חיפוש Google.
אם ציון החיזוי נמוך מהסף, יכול להיות שהמודל עדיין ייצור את התשובה, אבל היא לא תתבסס על חיפוש Google.

כדי למצוא סף טוב שמתאים לצרכים העסקיים שלכם, אתם יכולים ליצור קבוצה מייצגת של שאילתות שאתם מצפים להיתקל בהן. לאחר מכן, אפשר למיין את השאילתות לפי ציון החיזוי בתשובה ולבחור סף טוב לתרחיש השימוש שלכם.

עיגון התשובה באמצעות חיפוש Google

בדוגמה הבאה מוסבר איך ליצור תשובה מבוססת-קרקע מתוך הנחיה על ידי ציון חיפוש Google כמקור לביסוס. בדוגמה הזו נעשה שימוש בשיטה generateGroundedContent.

REST

שולחים את ההנחיה בבקשת curl הבאה.

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/PROJECT_NUMBER/locations/global:generateGroundedContent" \
-d '
{
"contents": [
 {
   "role": "user",
   "parts": [
     {
       "text": "PROMPT_TEXT"
     }
   ]
 }
],
"systemInstruction": {
   "parts": {
       "text": "SYSTEM_INSTRUCTION"
   }
},
"groundingSpec": {
 "groundingSources": [
 {
     "googleSearchSource": {
          "dynamicRetrievalConfig": {
              "predictor":{
                  "threshold": DYNAMIC_RETRIEVAL_THRESHOLD
              }
          }
     }
 }
]
},
"generationSpec": {
 "modelId": "MODEL_ID",
 "temperature": TEMPERATURE,
 "topP": TOP_P,
 "topK": TOP_K
},
"user_context": {
 "languageCode: "LANGUAGE_CODE",
 "latLng": {
   "latitude": LATITUDE,
   "longitude": LONGITUDE
 },
}
}'

מחליפים את מה שכתוב בשדות הבאים:

‫PROJECT_NUMBER: מספר הפרויקט ב- Google Cloud .
‫PROMPT_TEXT: ההנחיה מהמשתמש.
‫SYSTEM_INSTRUCTION: שדה אופציונלי שבו אפשר לספק הקדמה או הקשר נוסף.
‫DYNAMIC_RETRIEVAL_THRESHOLD: שדה אופציונלי להגדרת הסף להפעלת ההגדרה של אחזור דינמי. זהו ערך נקודה צפה (floating-point) בטווח [0,1]. אם מוסיפים את השדה dynamicRetrievalConfig, אבל לא מגדירים את השדה predictor או threshold, ערך הסף מוגדר כברירת מחדל ל-0.7. אם לא מגדירים את השדה dynamicRetrievalConfig, התשובה תמיד תהיה מבוססת על מקורות.
‫MODEL_ID: שדה אופציונלי להגדרת מזהה המודל של מודל Gemini שבו רוצים להשתמש כדי ליצור את התשובה המבוססת. רשימה של מזהי המודלים הזמינים מופיעה במאמר מודלים של Gemini במסמכי המוצר בנושא AI גנרטיבי ב-Vertex AI.
‫TEMPERATURE: שדה אופציונלי להגדרת רמת האקראיות שמשמשת לדגימה. ‫Google ממליצה על רמת אקראיות של 0.0. מידע נוסף זמין במאמר פרמטרים של מודל Gemini.
‫TOP_P: שדה אופציונלי להגדרת ערך top-P של המודל. מידע נוסף זמין במאמר פרמטרים של מודל Gemini.
‫TOP_K: שדה אופציונלי להגדרת ערך ה-K העליון של המודל. מידע נוסף זמין במאמר פרמטרים של מודל Gemini.
‫LANGUAGE_CODE: שדה אופציונלי שאפשר להשתמש בו כדי להגדיר את השפה של התשובה שנוצרה ושל הטקסט של החלק שמוחזר. אם אי אפשר לקבוע את השפה מהשאילתה, נעשה שימוש בשדה הזה. ערך ברירת המחדל הוא en. רשימת קודי שפה זמינה במאמר שפות.
‫LATITUDE: שדה אופציונלי להגדרת קו הרוחב. מזינים את הערך במעלות עשרוניות – לדוגמה, -25.34.
‫LONGITUDE: שדה אופציונלי להגדרת קו האורך. מזינים את הערך במעלות עשרוניות – לדוגמה, 131.04.

תשובה

אתם אמורים לקבל תגובת JSON שדומה לתגובה הבאה (התגובה קוצרה). כדי להבין את התגובה, אפשר לעיין בנתוני הפלט.

{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "ANSWER_TEXT"
         }
       ]
     },
     "groundingScore":GROUNDING_SCORE,
     "groundingMetadata": {
       "supportChunks": [
         {
          "source": "0",
          "sourceMetadata": {
            "uri": "REDIRECTION_URI",
            "domain": "PUBLISHER_DOMAIN"
          }
         }
       ],
       "groundingSupport": [
         {
           "claimText": "CLAIM_TEXT_1",
           "supportScore": SUPPORT_SCORE,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "CLAIM_TEXT_2",
           "supportScore": SUPPORT_SCORE,
           "supportChunkIndices": [
             0
           ]
         }
       ],
       "webSearchQueries": [
         "QUERY_BUILT_FROM_USER_PROMPT"
       ],
       "searchEntryPoint": {
         "renderedContent": "RENDERED_CONTENT"
       },
     }
   }
 ]
}
{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "ANSWER_TEXT"
         }
       ]
     },
     "groundingScore":GROUNDING_SCORE,
     "groundingMetadata": {
       "supportChunks": [
         {}
       ],
       "groundingSupport": [
         {
           "claimText": "CLAIM_TEXT_1",
           "supportScore": SUPPORT_SCORE,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "CLAIM_TEXT_2",
           "supportScore": SUPPORT_SCORE,
           "supportChunkIndices": [
             0
           ]
         }
       ],
       "webSearchQueries": [
         "QUERY_BUILT_FROM_USER_PROMPT"
       ],
       "searchEntryPoint": {
         "renderedContent": "RENDERED_CONTENT"
       },
       "retrievalMetadata": [
         {
           "source": "GOOGLE_SEARCH",
           "dynamicRetrievalMetadata": {
             "predictorMetadata": {
               "version": "V1_INDEPENDENT",
               "prediction": PREDICTION_SCORE
             }
           }
         }
       ]
     }
   }
 ]
}

Python

from google.cloud import discoveryengine_v1 as discoveryengine

# TODO(developer): Uncomment these variables before running the sample.
# project_number = "YOUR_PROJECT_NUMBER"

client = discoveryengine.GroundedGenerationServiceClient()

request = discoveryengine.GenerateGroundedContentRequest(
    # The full resource name of the location.
    # Format: projects/{project_number}/locations/{location}
    location=client.common_location_path(project=project_number, location="global"),
    generation_spec=discoveryengine.GenerateGroundedContentRequest.GenerationSpec(
        model_id="gemini-2.5-flash",
    ),
    # Conversation between user and model
    contents=[
        discoveryengine.GroundedGenerationContent(
            role="user",
            parts=[
                discoveryengine.GroundedGenerationContent.Part(
                    text="How much is Google stock?"
                )
            ],
        )
    ],
    system_instruction=discoveryengine.GroundedGenerationContent(
        parts=[
            discoveryengine.GroundedGenerationContent.Part(text="Be comprehensive.")
        ],
    ),
    # What to ground on.
    grounding_spec=discoveryengine.GenerateGroundedContentRequest.GroundingSpec(
        grounding_sources=[
            discoveryengine.GenerateGroundedContentRequest.GroundingSource(
                google_search_source=discoveryengine.GenerateGroundedContentRequest.GroundingSource.GoogleSearchSource(
                    # Optional: For Dynamic Retrieval
                    dynamic_retrieval_config=discoveryengine.GenerateGroundedContentRequest.DynamicRetrievalConfiguration(
                        predictor=discoveryengine.GenerateGroundedContentRequest.DynamicRetrievalConfiguration.DynamicRetrievalPredictor(
                            threshold=0.7
                        )
                    )
                )
            ),
        ]
    ),
)
response = client.generate_grounded_content(request)

# Handle the response
print(response)

דוגמה ליצירת תשובה בשיחה חד-תורית שמבוססת על חיפוש Google

בדוגמה הבאה, הבקשה מציינת את חיפוש Google כמקור ההארקה. בדוגמה הזו נעשה שימוש בשיטה generateGroundedContent. בדוגמה הזו נעשה שימוש גם בהוראה למערכת לסיים את התשובה עם אמוג'י של פרצוף מחייך.

REST

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/123456/locations/global:generateGroundedContent" \
-d '
{
"contents": [{
  "role": "user",
  "parts": [{
    "text": "What is Agent Search?"
}]
}],
"systemInstruction": {
   "parts": {
      "text": "Add a smiley emoji after the answer."
   }
},
"groundingSpec": {
  "groundingSources": [
  {
      "googleSearchSource": {
        "dynamicRetrievalConfig": {
               "predictor":{
                   "threshold": 0.6
               }
           }
      }
  }
 ]
},
"generationSpec": {
  "modelId": "gemini-2.5-flash"
}
}
'

תשובה

אתם אמורים לקבל תגובת JSON שדומה לתגובה הבאה (התגובה קוצרה). כדי להבין את התגובה, אפשר לעיין בנתוני הפלט.

{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "Agent Search is a platform developed by Google Cloud that simplifies the creation and deployment of generative AI agents. It offers both no-code and code-first approaches, allowing developers of all skill levels to build AI-powered agents. \n\nHere are some key features of Agent Search:\n\n* **No-code interface:**  Use natural language to design and build agents without writing code.\n* **Code-first approach:**  Utilize powerful orchestration and customization capabilities.\n* **Enterprise-grade security and compliance:**  Built-in security, compliance, and governance features align with industry certifications like HIPAA, ISO 27000-series, SOC-1/2/3, VPC-SC, and CMEK.\n* **Integration with enterprise data:**  Easily ground your agents in enterprise data using Agent Search and Retrieval Augmented Generation (RAG) APIs.\n* **Pre-built templates:**  Rapidly prototype and experiment with pre-built templates for conversational AI and process automation agents.\n* **Advanced integrations:**  Supports integrations with frameworks like LlamaIndex and LangChain for enhanced AI capabilities.\n* **Natural language understanding (NLU):**  Accurate query responses and support for multiple languages.\n\nAgent Search is designed to help developers create AI agents that can:\n\n* Answer complex questions\n* Provide support and personalize user experiences\n* Automate tasks and processes\n* Interact with backend systems\n\nOverall, Agent Search is a powerful tool that makes it easier for developers to build and deploy generative AI agents, regardless of their experience level. 😊 \n"
         }
       ]
     },
     "groundingScore": 0.80400103,
     "groundingMetadata": {
       "supportChunks": [
         {
          "source": "0",
          "sourceMetadata": {
          "uri": "https://vertexaisearch.cloud.google.com/grounding-api-redirect/{unique_string}",
          "domain": "example.com"
         }
        }
       ],
       "groundingSupport": [
         {
           "claimText": "Agent Search is a platform developed by Google Cloud that simplifies the creation and deployment of generative AI agents.",
           "supportScore": 0.9541752,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "It offers both no-code and code-first approaches, allowing developers of all skill levels to build AI-powered agents.",
           "supportScore": 0.9648506,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* **No-code interface:**  Use natural language to design and build agents without writing code.",
           "supportScore": 0.77115613,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* **Code-first approach:**  Utilize powerful orchestration and customization capabilities.",
           "supportScore": 0.8540146,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* **Enterprise-grade security and compliance:**  Built-in security, compliance, and governance features align with industry certifications like HIPAA, ISO 27000-series, SOC-1/2/3, VPC-SC, and CMEK.",
           "supportScore": 0.9574074,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* **Integration with enterprise data:**  Easily ground your agents in enterprise data using Agent Search and Retrieval Augmented Generation (RAG) APIs.",
           "supportScore": 0.9533333,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* **Pre-built templates:**  Rapidly prototype and experiment with pre-built templates for conversational AI and process automation agents.",
           "supportScore": 0.9457701,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* **Advanced integrations:**  Supports integrations with frameworks like LlamaIndex and LangChain for enhanced AI capabilities.",
           "supportScore": 0.9541752,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* **Natural language understanding (NLU):**  Accurate query responses and support for multiple languages.",
           "supportScore": 0.97726375,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* Provide support and personalize user experiences",
           "supportScore": 0.8540146,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* Automate tasks and processes",
           "supportScore": 0.82046676,
           "supportChunkIndices": [
             0
           ]
         }
       ],
       "webSearchQueries": [
         "what is Agent Search"
       ],
       "searchEntryPoint": {
         "renderedContent": "\u003cstyle\u003e\n.container {\n  align-items: center;\n  border-radius: 8px;\n  display: flex;\n  font-family: Google Sans, Roboto, sans-serif;\n  font-size: 14px;\n  line-height: 20px;\n  padding: 8px 12px;\n}\n.chip {\n  display: inline-block;\n  border: solid 1px;\n  border-radius: 16px;\n  min-width: 14px;\n  padding: 5px 16px;\n  text-align: center;\n  user-select: none;\n  margin: 0 8px;\n  -webkit-tap-highlight-color: transparent;\n}\n.carousel {\n  overflow: auto;\n  scrollbar-width: none;\n  white-space: nowrap;\n  margin-right: -12px;\n}\n.headline {\n  display: flex;\n  margin-right: 4px;\n}\n.gradient-container {\n  position: relative;\n}\n.gradient {\n  position: absolute;\n  transform: translate(3px, -9px);\n  height: 36px;\n  width: 9px;\n}\n@media (prefers-color-scheme: light) {\n  .container {\n    background-color: #fafafa;\n    box-shadow: 0 0 0 1px #0000000f;\n  }\n  .headline-label {\n    color: #1f1f1f;\n  }\n  .chip {\n    background-color: #ffffff;\n    border-color: #d2d2d2;\n    color: #5e5e5e;\n    text-decoration: none;\n  }\n  .chip:hover {\n    background-color: #f2f2f2;\n  }\n  .chip:focus {\n    background-color: #f2f2f2;\n  }\n  .chip:active {\n    background-color: #d8d8d8;\n    border-color: #b6b6b6;\n  }\n  .logo-dark {\n    display: none;\n  }\n  .gradient {\n    background: linear-gradient(90deg, #fafafa 15%, #fafafa00 100%);\n  }\n}\n@media (prefers-color-scheme: dark) {\n  .container {\n    background-color: #1f1f1f;\n    box-shadow: 0 0 0 1px #ffffff26;\n  }\n  .headline-label {\n    color: #fff;\n  }\n  .chip {\n    background-color: #2c2c2c;\n    border-color: #3c4043;\n    color: #fff;\n    text-decoration: none;\n  }\n  .chip:hover {\n    background-color: #353536;\n  }\n  .chip:focus {\n    background-color: #353536;\n  }\n  .chip:active {\n    background-color: #464849;\n    border-color: #53575b;\n  }\n  .logo-light {\n    display: none;\n  }\n  .gradient {\n    background: linear-gradient(90deg, #1f1f1f 15%, #1f1f1f00 100%);\n  }\n}\n\u003c/style\u003e\n\u003cdiv class=\"container\"\u003e\n  \u003cdiv class=\"headline\"\u003e\n    \u003csvg class=\"logo-light\" width=\"18\" height=\"18\" viewBox=\"9 9 35 35\" fill=\"none\" xmlns=\"http://www.w3.org/2000/svg\"\u003e\n      \u003cpath fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M42.8622 27.0064C42.8622 25.7839 42.7525 24.6084 42.5487 23.4799H26.3109V30.1568H35.5897C35.1821 32.3041 33.9596 34.1222 32.1258 35.3448V39.6864H37.7213C40.9814 36.677 42.8622 32.2571 42.8622 27.0064V27.0064Z\" fill=\"#4285F4\"/\u003e\n      \u003cpath fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M26.3109 43.8555C30.9659 43.8555 34.8687 42.3195 37.7213 39.6863L32.1258 35.3447C30.5898 36.3792 28.6306 37.0061 26.3109 37.0061C21.8282 37.0061 18.0195 33.9811 16.6559 29.906H10.9194V34.3573C13.7563 39.9841 19.5712 43.8555 26.3109 43.8555V43.8555Z\" fill=\"#34A853\"/\u003e\n      \u003cpath fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M16.6559 29.8904C16.3111 28.8559 16.1074 27.7588 16.1074 26.6146C16.1074 25.4704 16.3111 24.3733 16.6559 23.3388V18.8875H10.9194C9.74388 21.2072 9.06992 23.8247 9.06992 26.6146C9.06992 29.4045 9.74388 32.022 10.9194 34.3417L15.3864 30.8621L16.6559 29.8904V29.8904Z\" fill=\"#FBBC05\"/\u003e\n      \u003cpath fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M26.3109 16.2386C28.85 16.2386 31.107 17.1164 32.9095 18.8091L37.8466 13.8719C34.853 11.082 30.9659 9.3736 26.3109 9.3736C19.5712 9.3736 13.7563 13.245 10.9194 18.8875L16.6559 23.3388C18.0195 19.2636 21.8282 16.2386 26.3109 16.2386V16.2386Z\" fill=\"#EA4335\"/\u003e\n    \u003c/svg\u003e\n    \u003csvg class=\"logo-dark\" width=\"18\" height=\"18\" viewBox=\"0 0 48 48\" xmlns=\"http://www.w3.org/2000/svg\"\u003e\n      \u003ccircle cx=\"24\" cy=\"23\" fill=\"#FFF\" r=\"22\"/\u003e\n      \u003cpath d=\"M33.76 34.26c2.75-2.56 4.49-6.37 4.49-11.26 0-.89-.08-1.84-.29-3H24.01v5.99h8.03c-.4 2.02-1.5 3.56-3.07 4.56v.75l3.91 2.97h.88z\" fill=\"#4285F4\"/\u003e\n      \u003cpath d=\"M15.58 25.77A8.845 8.845 0 0 0 24 31.86c1.92 0 3.62-.46 4.97-1.31l4.79 3.71C31.14 36.7 27.65 38 24 38c-5.93 0-11.01-3.4-13.45-8.36l.17-1.01 4.06-2.85h.8z\" fill=\"#34A853\"/\u003e\n      \u003cpath d=\"M15.59 20.21a8.864 8.864 0 0 0 0 5.58l-5.03 3.86c-.98-2-1.53-4.25-1.53-6.64 0-2.39.55-4.64 1.53-6.64l1-.22 3.81 2.98.22 1.08z\" fill=\"#FBBC05\"/\u003e\n      \u003cpath d=\"M24 14.14c2.11 0 4.02.75 5.52 1.98l4.36-4.36C31.22 9.43 27.81 8 24 8c-5.93 0-11.01 3.4-13.45 8.36l5.03 3.85A8.86 8.86 0 0 1 24 14.14z\" fill=\"#EA4335\"/\u003e\n    \u003c/svg\u003e\n    \u003cdiv class=\"gradient-container\"\u003e\u003cdiv class=\"gradient\"\u003e\u003c/div\u003e\u003c/div\u003e\n  \u003c/div\u003e\n  \u003cdiv class=\"carousel\"\u003e\n    \u003ca class=\"chip\" href=\"https://www.google.com/search?q=what+is+ai-applications&client=app-vertex-grounding&safesearch=active\"\u003ewhat is Agent Search\u003c/a\u003e\n  \u003c/div\u003e\n\u003c/div\u003e\n"
       },
       "retrievalMetadata": [
         {
           "source": "GOOGLE_SEARCH",
           "dynamicRetrievalMetadata": {
             "predictorMetadata": {
               "version": "V1_INDEPENDENT",
               "prediction": 0.671875
             }
           }
         }
       ]
     }
   }
 ]
}

יצירת תשובה שמבוססת על מקורות בכמה תורות

בתהליך יצירת תשובות רב-שלבי, בכל בקשה צריך לשלוח את כל הטקסט שהועבר בין המשתמש לבין המודל בכל השלבים הקודמים. כך נשמרת הרציפות וההקשר כדי ליצור את התשובה להנחיה האחרונה.

כדי לקבל תשובה שמבוססת על מקורות באמצעות יצירת תשובה רב-שלבית, מבצעים את הפעולות הבאות:

REST

בדוגמאות הבאות אפשר לראות איך שולחים טקסט של הנחיה למעקב בכמה תורות. בדוגמאות האלה נעשה שימוש בשיטה generateGroundedContent והתשובות מבוססות על חיפוש Google. אתם יכולים להשתמש בשלבים דומים כדי ליצור תשובות מבוססות-קרקע באמצעות מקורות אחרים לביסוס.

שולחים את ההנחיה הראשונה בבקשת curl הבאה.
```
curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/PROJECT_NUMBER/locations/global:generateGroundedContent" \
-d '
{
"contents": [
 {
   "role": "user",
   "parts": [
     {
       "text": "PROMPT_TEXT_TURN_1"
     }
   ]
 }
],
"systemInstruction": {
   "parts": {
       "text": "SYSTEM_INSTRUCTION_TURN_1"
   }
},
"groundingSpec": {
 "groundingSources": [
   {
     "googleSearchSource": {}
   }
 ]
},
"generationSpec": {
 "modelId": "MODEL_ID",
 "temperature": TEMPERATURE,
 "topP": TOP_P,
 "topK": TOP_K
},
"user_context": {
 "languageCode: "LANGUAGE_CODE",
 "latLng": {
   "latitude": LATITUDE,
   "longitude": LONGITUDE
 },
}
}'
```
מחליפים את מה שכתוב בשדות הבאים:
- ‫PROJECT_NUMBER: מספר הפרויקט ב- Google Cloud .
- ‫PROMPT_TEXT_TURN_1: טקסט ההנחיה מהמשתמש בתור הראשון.
- ‫SYSTEM_INSTRUCTION_TURN_1: שדה אופציונלי שבו אפשר לספק הקדמה או הקשר נוסף. כדי ליצור תשובה רב-שלבית, צריך לספק את הוראות המערכת בכל שלב.
- ‫MODEL_ID: שדה אופציונלי להגדרת מזהה המודל של מודל Gemini שבו רוצים להשתמש כדי ליצור את התשובה המבוססת. רשימה של מזהי המודלים הזמינים מופיעה במאמר מודלים של Gemini במסמכי המוצר בנושא AI גנרטיבי ב-Vertex AI.
- ‫TEMPERATURE: שדה אופציונלי להגדרת רמת האקראיות שמשמשת לדגימה. ‫Google ממליצה על רמת אקראיות של 0.0. מידע נוסף זמין במאמר פרמטרים של מודל Gemini.
- ‫TOP_P: שדה אופציונלי להגדרת ערך top-P של המודל. מידע נוסף זמין במאמר פרמטרים של מודל Gemini.
- ‫TOP_K: שדה אופציונלי להגדרת ערך ה-K העליון של המודל. מידע נוסף זמין במאמר פרמטרים של מודל Gemini.
- ‫LANGUAGE_CODE: שדה אופציונלי שאפשר להשתמש בו כדי להגדיר את השפה של התשובה שנוצרה ושל הטקסט של החלק שמוחזר. אם אי אפשר לקבוע את השפה מהשאילתה, נעשה שימוש בשדה הזה. ערך ברירת המחדל הוא en. רשימת קודי שפה זמינה במאמר שפות.
- ‫LATITUDE: שדה אופציונלי להגדרת קו הרוחב. מזינים את הערך במעלות עשרוניות – לדוגמה, -25.34.
- ‫LONGITUDE: שדה אופציונלי להגדרת קו האורך. מזינים את הערך במעלות עשרוניות – לדוגמה, 131.04.
תשובה

אתם אמורים לקבל תגובת JSON שדומה לתגובה הבאה (התגובה קוצרה). כדי להבין את התגובה, אפשר לעיין בנתוני הפלט.
```
{
"candidates": [
 {
   "content": {
     "role": "model",
     "parts": [
       {
         "text": "ANSWER_TEXT_TURN_1"
       }
     ]
   },
   "groundingScore": GROUNDING_SCORE,
   "groundingMetadata": {
     "supportChunks": [],
     "groundingSupport": [
       {
         "claimText": "CLAIM_TEXT_1",
         "supportChunkIndices": [
           0,
           1
         ]
       },
       {
         "claimText": "CLAIM_TEXT_2",
         "supportChunkIndices": [
           1
         ]
       }
     ],
     "webSearchQueries": [
       "QUERY_BUILT_FROM_USER_PROMPT"
     ],
     "searchEntryPoint": {
       "renderedContent": "RENDERED_CONTENT"
     }
   }
 }
]
} 
```
שולחים את ההנחיה השנייה כהנחיית המשך. מוסיפים את ההנחיה הראשונה מהמשתמש ואת התשובה התואמת מהמודל כדי לספק הקשר.
```
curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/PROJECT_NUMBER/locations/global:generateGroundedContent" \
-d '
{
"contents": [
 {
   "role": "user",
   "parts": [
     {
       "text": "PROMPT_TEXT_TURN_1"
     }
   ]
 },
 {
   "role": "model",
   "parts": [
     {
       "text": "ANSWER_TEXT_TURN_1"
     }
   ]
 },
 {
   "role": "user",
   "parts": [
     {
       "text": "PROMPT_TEXT_TURN_2"
     }
   ]
 }
],
"systemInstruction": {
   "parts": {
       "text": "SYSTEM_INSTRUCTION_TURN_2"
   }
},
"groundingSpec": {
 "groundingSources": [
   {
     "googleSearchSource": {}
   }
 ]
},
"generationSpec": {
 "modelId": "MODEL_ID",
 "temperature": TEMPERATURE,
 "topP": TOP_P,
 "topK": TOP_K
},
"user_context": {
 "languageCode: "LANGUAGE_CODE",
 "latLng": {
   "latitude": LATITUDE,
   "longitude": LONGITUDE
 },
}
}'
```
מחליפים את מה שכתוב בשדות הבאים:
- ‫PROJECT_NUMBER: מספר הפרויקט ב- Google Cloud .
- ‫PROMPT_TEXT_TURN_1: טקסט ההנחיה מהמשתמש בתור הראשון.
- ‫ANSWER_TEXT_TURN_1: טקסט התשובה מהמודל בפנייה הראשונה.
- ‫PROMPT_TEXT_TURN_2: טקסט ההנחיה מהמשתמש בתור השני.
- ‫SYSTEM_INSTRUCTION_TURN_2: שדה אופציונלי שבו אפשר לספק הקדמה או הקשר נוסף. כדי ליצור תשובה רב-שלבית, צריך לספק את הוראות המערכת בכל שלב.
- ‫MODEL_ID: שדה אופציונלי להגדרת מזהה המודל של מודל Gemini שבו רוצים להשתמש כדי ליצור את התשובה המבוססת. רשימה של מזהי המודלים הזמינים מופיעה במאמר מודלים של Gemini במסמכי המוצר בנושא AI גנרטיבי ב-Vertex AI.
- ‫TEMPERATURE: שדה אופציונלי להגדרת רמת האקראיות שמשמשת לדגימה. ‫Google ממליצה על רמת אקראיות של 0.0. מידע נוסף זמין במאמר פרמטרים של מודל Gemini.
- ‫TOP_P: שדה אופציונלי להגדרת ערך top-P של המודל. מידע נוסף זמין במאמר פרמטרים של מודל Gemini.
- ‫TOP_K: שדה אופציונלי להגדרת ערך ה-K העליון של המודל. מידע נוסף זמין במאמר פרמטרים של מודל Gemini.
- ‫LANGUAGE_CODE: שדה אופציונלי שאפשר להשתמש בו כדי להגדיר את השפה של התשובה שנוצרה ושל הטקסט של החלק שמוחזר. אם אי אפשר לקבוע את השפה מהשאילתה, נעשה שימוש בשדה הזה. ערך ברירת המחדל הוא en. רשימת קודי שפה זמינה במאמר שפות.
- ‫LATITUDE: שדה אופציונלי להגדרת קו הרוחב. מזינים את הערך במעלות עשרוניות – לדוגמה, -25.34.
- ‫LONGITUDE: שדה אופציונלי להגדרת קו האורך. מזינים את הערך במעלות עשרוניות – לדוגמה, 131.04.
תשובה

אתם אמורים לקבל תגובת JSON שדומה לתגובה הבאה (התגובה קוצרה). כדי להבין את התגובה, אפשר לעיין בנתוני הפלט.
```
{
"candidates": [
 {
   "content": {
     "role": "model",
     "parts": [
       {
         "text": "ANSWER_TEXT_TURN_2"
       }
     ]
   },
   "groundingScore": GROUNDING_SCORE,
   "groundingMetadata": {
     "supportChunks": [],
     "groundingSupport": [
       {
         "claimText": "CLAIM_TEXT_1",
         "supportChunkIndices": [
           0
         ]
       },
       {
         "claimText": "CLAIM_TEXT_2",
         "supportChunkIndices": [
           1,
           2
         ]
       }
     ],
     "webSearchQueries": [
       "QUERY_BUILT_FROM_USER_PROMPT"
     ],
     "searchEntryPoint": {
       "renderedContent": "RENDERED_CONTENT"
     }
   }
 }
]
}   
```
כדי לקבל תשובות נוספות, חוזרים על התהליך הזה. בכל תור, מוסיפים את כל ההנחיות הקודמות מהמשתמש ואת התשובות התואמות מהמודל.

דוגמה ליצירת תשובה רב-שלבית

בדוגמה הבאה, הבקשה מציינת שלושה טקסטים של עובדות מוטמעות כמקור לביסוס התשובות בשני תורות. בדוגמה הזו נעשה שימוש בשיטה generateGroundedContent. בדוגמה הזו נעשה שימוש גם בהוראת מערכת כדי לסיים את התשובה בתור הראשון עם אמוג'י של סמיילי.

REST

שולחים את ההנחיה הראשונה בבקשת curl הבאה.

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/123456/locations/global:generateGroundedContent" \
-d '
{
"contents": [
 {
   "role": "user",
   "parts": [
     {
       "text": "Summarize what happened in 2023 in one paragraph."
     }
   ]
 }
],
"systemInstruction": {
  "parts": {
      "text": "Add a smiley emoji after the answer."
  }
},
"grounding_spec": {
 "grounding_sources": [
   {
     "inline_source": {
       "grounding_facts": [
         {
           "fact_text": "In 2023, the world population surpassed 8 billion. This milestone marked a significant moment in human history, highlighting both the rapid growth of our species and the challenges of resource management and sustainability in the years to come.",
           "attributes": {
             "title": "title_1",
             "uri": "some-uri-1"
           }
         }
       ]
     }
   },
   {
     "inline_source": {
       "grounding_facts": [
         {
           "fact_text": "In 2023, global e-commerce sales reached an estimated $5.7 trillion. The continued rise of online shopping solidified its position as a dominant force in retail, with major implications for traditional brick-and-mortar stores and the logistics networks supporting worldwide deliveries.",
           "attributes": {
             "title": "title_2",
             "uri": "some-uri-2"
           }
         }
       ]
     }
   },
   {
     "inline_source": {
       "grounding_facts": [
         {
           "fact_text": "In 2023, the global average surface temperature was approximately 0.2 degrees Celsius higher than the 20th-century average. This continued the worrying trend of global warming, underscoring the urgency of worldwide climate initiatives, carbon reduction efforts, and investment in renewable energy sources.",
           "attributes": {
             "title": "title_3",
             "uri": "some-uri-3"
           }
         }
       ]
     }
   }
 ]
},
"generationSpec": {
 "modelId": "gemini-2.5-flash"
}
}'

תשובה

אתם אמורים לקבל תגובת JSON שדומה לתגובה הבאה (התגובה קוצרה). כדי להבין את התגובה, אפשר לעיין בנתוני הפלט.

{
"candidates": [
 {
   "content": {
     "role": "model",
     "parts": [
       {
         "text": "In 2023, the global average surface temperature increased, the world population surpassed 8 billion, and global e-commerce sales reached an estimated $5.7 trillion.  😊 \n"
       }
     ]
   },
   "groundingScore": 1,
   "groundingMetadata": {
     "supportChunks": [
       {
         "chunkText": "In 2023, global e-commerce sales reached an estimated $5.7 trillion. The continued rise of online shopping solidified its position as a dominant force in retail, with major implications for traditional brick-and-mortar stores and the logistics networks supporting worldwide deliveries. ",
         "source": "1",
         "sourceMetadata": {
           "uri": "some-uri-2",
           "title": "title_2"
         }
       },
       {
         "chunkText": "In 2023, the world population surpassed 8 billion. This milestone marked a significant moment in human history, highlighting both the rapid growth of our species and the challenges of resource management and sustainability in the years to come. ",
         "source": "0",
         "sourceMetadata": {
           "uri": "some-uri-1",
           "title": "title_1"
         }
       },
       {
         "chunkText": "In 2023, the global average surface temperature was approximately 0.2 degrees Celsius higher than the 20th-century average. This continued the worrying trend of global warming, underscoring the urgency of worldwide climate initiatives, carbon reduction efforts, and investment in renewable energy sources. ",
         "source": "2",
         "sourceMetadata": {
           "title": "title_3",
           "uri": "some-uri-3"
         }
       }
     ],
     "groundingSupport": [
       {
         "claimText": "In 2023, the global average surface temperature increased, the world population surpassed 8 billion, and global e-commerce sales reached an estimated $5.7 trillion.",
         "supportScore": 1,
         "supportChunkIndices": [
           0,
           1,
           2
         ]
       }
     ]
   }
 }
]
}

שולחים את ההנחיה השנייה כהנחיית המשך. מוסיפים את ההנחיה הראשונה מהמשתמש ואת התשובה התואמת מהמודל כדי לספק הקשר.

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/123456/locations/global:generateGroundedContent" \
-d '
{
"contents": [
 {
   "role": "user",
   "parts": [
     {
       "text": "Summarize what happened in 2023 in one paragraph."
     }
   ]
 },
 {
   "role": "model",
   "parts": [
     {
       "text": "In 2023, the global average surface temperature increased, the world population surpassed 8 billion, and global e-commerce sales reached an estimated $5.7 trillion.  😊 \n"
     }
   ]
 },
 {
   "role": "user",
   "parts": [
     {
       "text": "Rephrase the answer in an abstracted list."
     }
   ]
 }
],
"grounding_spec": {
 "grounding_sources": [
   {
     "inline_source": {
       "grounding_facts": [
         {
           "fact_text": "In 2023, the world population surpassed 8 billion. This milestone marked a significant moment in human history, highlighting both the rapid growth of our species and the challenges of resource management and sustainability in the years to come.",
           "attributes": {
             "title": "title_1",
             "uri": "some-uri-1"
           }
         }
       ]
     }
   },
   {
     "inline_source": {
       "grounding_facts": [
         {
           "fact_text": "In 2023, global e-commerce sales reached an estimated $5.7 trillion. The continued rise of online shopping solidified its position as a dominant force in retail, with major implications for traditional brick-and-mortar stores and the logistics networks supporting worldwide deliveries.",
           "attributes": {
             "title": "title_2",
             "uri": "some-uri-2"
           }
         }
       ]
     }
   },
   {
     "inline_source": {
       "grounding_facts": [
         {
           "fact_text": "In 2023, the global average surface temperature was approximately 0.2 degrees Celsius higher than the 20th-century average. This continued the worrying trend of global warming, underscoring the urgency of worldwide climate initiatives, carbon reduction efforts, and investment in renewable energy sources.",
           "attributes": {
             "title": "title_3",
             "uri": "some-uri-3"
           }
         }
       ]
     }
   }
 ]
},
"generationSpec": {
 "modelId": "gemini-2.5-flash"
}
}'

תשובה

אתם אמורים לקבל תגובת JSON שדומה לתגובה הבאה (התגובה קוצרה). כדי להבין את התגובה, אפשר לעיין בנתוני הפלט.

{
"candidates": [
 {
   "content": {
     "role": "model",
     "parts": [
       {
         "text": "- The global average surface temperature increased in 2023.\n- The world population surpassed 8 billion in 2023.\n- Global e-commerce sales reached an estimated $5.7 trillion in 2023. \n"
       }
     ]
   },
   "groundingScore": 0.99073017,
   "groundingMetadata": {
     "supportChunks": [
       {
         "chunkText": "In 2023, the global average surface temperature was approximately 0.2 degrees Celsius higher than the 20th-century average. This continued the worrying trend of global warming, underscoring the urgency of worldwide climate initiatives, carbon reduction efforts, and investment in renewable energy sources. ",
         "source": "2",
         "sourceMetadata": {
           "uri": "some-uri-3",
           "title": "title_3"
         }
       },
       {
         "chunkText": "In 2023, the world population surpassed 8 billion. This milestone marked a significant moment in human history, highlighting both the rapid growth of our species and the challenges of resource management and sustainability in the years to come. ",
         "source": "0",
         "sourceMetadata": {
           "uri": "some-uri-1",
           "title": "title_1"
         }
       },
       {
         "chunkText": "In 2023, global e-commerce sales reached an estimated $5.7 trillion. The continued rise of online shopping solidified its position as a dominant force in retail, with major implications for traditional brick-and-mortar stores and the logistics networks supporting worldwide deliveries. ",
         "source": "1",
         "sourceMetadata": {
           "title": "title_2",
           "uri": "some-uri-2"
         }
       }
     ],
     "groundingSupport": [
       {
         "claimText": "- The global average surface temperature increased in 2023.",
         "supportScore": 0.9883382,
         "supportChunkIndices": [
           0
         ]
       },
       {
         "claimText": "- The world population surpassed 8 billion in 2023.",
         "supportScore": 0.9919262,
         "supportChunkIndices": [
           1
         ]
       },
       {
         "claimText": "- Global e-commerce sales reached an estimated $5.7 trillion in 2023.",
         "supportScore": 0.9919262,
         "supportChunkIndices": [
           2
         ]
       }
     ]
   }
 }
]
}

הצגת תשובות מבוססות באופן שוטף

אתם יכולים לבחור להזרים את התשובות מהמודל. האפשרות הזו שימושית במקרים שבהם התשובה ארוכה במיוחד ושליחת התשובה כולה בבת אחת גורמת לעיכוב משמעותי. הזרמת התשובה מפרקת את התשובה למערך של כמה מועמדים שמכילים חלקים עוקבים של טקסט התשובה.

כדי לקבל תשובה מבוססת קרקע שמוזרמת, פועלים לפי השלבים הבאים:

REST

בדוגמה הבאה אפשר לראות איך להזרים תשובה מבוססת. בדוגמה הזו נעשה שימוש בשיטה streamGenerateGroundedContent והתשובה מבוססת על חיפוש Google בלי הגדרת אחזור דינמי. אפשר להשתמש בשלבים דומים כדי ליצור תשובות שמבוססות על מקורות באמצעות מקורות אחרים.

שולחים את ההנחיה בבקשת curl הבאה.
```
curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1alpha/projects/PROJECT_NUMBER/locations/global:streamGenerateGroundedContent" \
-d '
[
{
 "contents": [
   {
     "role": "user",
     "parts": [
       {
         "text": "PROMPT_TEXT"
       }
     ]
   }
 ],
 "systemInstruction": {
     "parts": {
         "text": "SYSTEM_INSTRUCTION"
     }
 },
 "groundingSpec": {
   "groundingSources": [
     {
       "googleSearchSource": {}
     }
   ]
 },
"generationSpec": {
 "modelId": "MODEL_ID",
 "temperature": TEMPERATURE,
 "topP": TOP_P,
 "topK": TOP_K
},
"user_context": {
 "languageCode: "LANGUAGE_CODE",
 "latLng": {
   "latitude": LATITUDE,
   "longitude": LONGITUDE
 },
}
}
]'
```
מחליפים את מה שכתוב בשדות הבאים:
- ‫PROJECT_NUMBER: מספר הפרויקט ב- Google Cloud .
- ‫PROMPT_TEXT: ההנחיה מהמשתמש.
- ‫SYSTEM_INSTRUCTION: שדה אופציונלי שבו אפשר לספק הקדמה או הקשר נוסף.
- ‫MODEL_ID: שדה אופציונלי להגדרת מזהה המודל של מודל Gemini שבו רוצים להשתמש כדי ליצור את התשובה המבוססת. רשימה של מזהי המודלים הזמינים מופיעה במאמר מודלים של Gemini במסמכי המוצר בנושא AI גנרטיבי ב-Vertex AI.
- ‫TEMPERATURE: שדה אופציונלי להגדרת רמת האקראיות שמשמשת לדגימה. ‫Google ממליצה על רמת אקראיות של 0.0. מידע נוסף זמין במאמר פרמטרים של מודל Gemini.
- ‫TOP_P: שדה אופציונלי להגדרת ערך top-P של המודל. מידע נוסף זמין במאמר פרמטרים של מודל Gemini.
- ‫TOP_K: שדה אופציונלי להגדרת ערך ה-K העליון של המודל. מידע נוסף זמין במאמר פרמטרים של מודל Gemini.
- ‫LANGUAGE_CODE: שדה אופציונלי שאפשר להשתמש בו כדי להגדיר את השפה של התשובה שנוצרה ושל הטקסט של החלק שמוחזר. אם אי אפשר לקבוע את השפה מהשאילתה, נעשה שימוש בשדה הזה. ערך ברירת המחדל הוא en. רשימת קודי שפה זמינה במאמר שפות.
- ‫LATITUDE: שדה אופציונלי להגדרת קו הרוחב. מזינים את הערך במעלות עשרוניות – לדוגמה, -25.34.
- ‫LONGITUDE: שדה אופציונלי להגדרת קו האורך. מזינים את הערך במעלות עשרוניות – לדוגמה, 131.04.
תשובה

אתם אמורים לקבל תגובת JSON שדומה לתגובה הבאה (התגובה קוצרה). כדי להבין את התגובה, אפשר לעיין בנתוני הפלט.
```
[{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "ANSWER_TEXT_PART_1"
         }
       ]
     }
   }
 ]
},
{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "ANSWER_TEXT_PART_2"
         }
       ]
     }
   }
 ]
},
{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "ANSWER_TEXT_PART_3"
         }
       ]
     }
   }
 ]
},
{
 "candidates": [
   {
     "groundingMetadata": {
       "supportChunks": [
         {
          "source": "0",
          "sourceMetadata": {
            "uri": "REDIRECTION_URI",
            "domain": "PUBLISHER_DOMAIN"
          }
         }
       ],
       "groundingSupport": [
         {
           "claimText": "CLAIM_TEXT_1",
           "supportChunkIndices": [
             0
           ]
         }
       ],
       "webSearchQueries": [
         "QUERY_BUILT_FROM_USER_PROMPT"
       ],
       "searchEntryPoint": {
         "renderedContent": "RENDERED_CONTENT"
       }
     }
   }
 ]
}]
```

Python

from google.cloud import discoveryengine_v1 as discoveryengine

# TODO(developer): Uncomment these variables before running the sample.
# project_id = "YOUR_PROJECT_ID"

client = discoveryengine.GroundedGenerationServiceClient()

request = discoveryengine.GenerateGroundedContentRequest(
    # The full resource name of the location.
    # Format: projects/{project_number}/locations/{location}
    location=client.common_location_path(project=project_number, location="global"),
    generation_spec=discoveryengine.GenerateGroundedContentRequest.GenerationSpec(
        model_id="gemini-2.5-flash",
    ),
    # Conversation between user and model
    contents=[
        discoveryengine.GroundedGenerationContent(
            role="user",
            parts=[
                discoveryengine.GroundedGenerationContent.Part(
                    text="Summarize how to delete a data store in Vertex AI Agent Builder?"
                )
            ],
        )
    ],
    grounding_spec=discoveryengine.GenerateGroundedContentRequest.GroundingSpec(
        grounding_sources=[
            discoveryengine.GenerateGroundedContentRequest.GroundingSource(
                google_search_source=discoveryengine.GenerateGroundedContentRequest.GroundingSource.GoogleSearchSource()
            ),
        ]
    ),
)
responses = client.stream_generate_grounded_content(iter([request]))

for response in responses:
    # Handle the response
    print(response)

דוגמה לשידור תשובות מבוססות

בדוגמה הבאה, הבקשה מציינת את חיפוש Google כמקור המידע להזרמת תשובה בלי הגדרת אחזור דינמי. התשובה שמוזרמת מחולקת לכמה תשובות אפשריות. בדוגמה הזו נעשה שימוש בשיטה streamGenerateGroundedContent.

REST

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1alpha/projects/123456/locations/global:streamGenerateGroundedContent" \
-d '
[
{
  "contents": [
    {
      "role": "user",
      "parts": [
        {
          "text": "Summarize How to delete a data store in Agent Search?"
        }
      ]
    }
  ],
  "groundingSpec": {
    "groundingSources": [
      {
        "googleSearchSource": {}
      }
    ]
  },
  "generationSpec": {
    "modelId": "gemini-2.5-flash"
  }
}
]'

תשובה

אתם אמורים לקבל תגובת JSON שדומה לתגובה הבאה (התגובה קוצרה). כדי להבין את התגובה, אפשר לעיין בנתוני הפלט.

[{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": "To"
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": " delete a data store in Agent Search, you must first purge all data"
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": " from the data store. "
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "groundingMetadata": {
      "supportChunks": [
        {
          "source": "0",
          "sourceMetadata": {
            "uri": "https://vertexaisearch.cloud.google.com/grounding-api-redirect/{unique_string}",
            "domain": "cloud.google.com"
          }
        }
      ],
      "groundingSupport": [
        {
          "claimText": "To delete a data store in Agent Search, you must first purge all data from the data store. ",
          "supportChunkIndices": [
            0
          ]
        }
      ],
      "webSearchQueries": [
        "how to delete a data store in Agent Search"
      ],
      "searchEntryPoint": {
      }
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": "You can purge data from a data store"
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": " using the Google Cloud console or the command line. "
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "groundingMetadata": {
      "groundingSupport": [
        {
          "claimText": "You can purge data from a data store using the Google Cloud console or the command line. ",
          "supportChunkIndices": [
            0
          ]
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": "Once the data is purged, you can delete the data store. "
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "groundingMetadata": {
      "groundingSupport": [
        {
          "claimText": "Once the data is purged, you can delete the data store. ",
          "supportChunkIndices": [
            0
          ]
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": "You cannot delete"
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": " a data store that is connected to an app. "
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "groundingMetadata": {
      "groundingSupport": [
        {
          "claimText": "You cannot delete a data store that is connected to an app. ",
          "supportChunkIndices": [
            0
          ]
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": "You must first delete the app that the data store is connected to. "
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "groundingMetadata": {
      "groundingSupport": [
        {
          "claimText": "You must first delete the app that the data store is connected to. ",
          "supportChunkIndices": [
            0
          ]
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": "You also"
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": " cannot delete a data store that is in the process of upgrading or downgrading. "
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "groundingMetadata": {
      "groundingSupport": [
        {
          "claimText": "You also cannot delete a data store that is in the process of upgrading or downgrading. ",
          "supportChunkIndices": [
            0
          ]
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": "You must wait for the upgrade or downgrade to complete before deleting the data store."
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": " \n"
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "groundingMetadata": {
      "groundingSupport": [
        {
          "claimText": "You must wait for the upgrade or downgrade to complete before deleting the data store. \n",
          "supportChunkIndices": [
            0
          ]
        }
      ]
    }
  }
]
}
]

המאמרים הבאים

כאן מוסבר איך להשתמש בשיטת היצירה המבוססת על נתונים עם ממשקי API אחרים של RAG כדי ליצור תשובות מבוססות-נתונים מנתונים לא מובנים.

יצירת תשובות שמבוססות על מקורות באמצעות RAG קל לארגן דפים בעזרת אוספים אפשר לשמור ולסווג תוכן על סמך ההעדפות שלך.

הסברים על המונחים

מונחים שקשורים ל-RAG

נתוני קלט

נתוני פלט

יצירת תשובה שמבוססת על מקורות בשיחה אחת

התשובה תהיה מבוססת על טקסט מוטבע ומאגר נתונים של חיפוש מבוסס סוכנים

REST

תשובה

דוגמה ליצירת תשובה בשיחה חד-תורית שמבוססת על טקסט מוטבע ועל חיפוש באמצעות סוכן

REST

תשובה

Python

יצירת תשובה מעוגנת באמצעות חיפוש Google

אחזור דינמי

ציון וסף של חיזוי אחזור דינמי

עיגון התשובה באמצעות חיפוש Google

REST

תשובה

Python

דוגמה ליצירת תשובה בשיחה חד-תורית שמבוססת על חיפוש Google

REST

תשובה

יצירת תשובה שמבוססת על מקורות בכמה תורות

REST

תשובה

תשובה

דוגמה ליצירת תשובה רב-שלבית

REST

תשובה

תשובה

הצגת תשובות מבוססות באופן שוטף

REST

תשובה

Python

דוגמה לשידור תשובות מבוססות

REST

תשובה

המאמרים הבאים

יצירת תשובות שמבוססות על מקורות באמצעות RAG