Agent Platform Memory Bank API quickstart

You can use the Agent Platform SDK to make API calls directly to Memory Bank. Use the Agent Platform SDK if you don't want an agent framework to orchestrate calls for you, or you want to integrate Memory Bank with agent frameworks other than Agent Development Kit (ADK).

This document shows how to create, upload, retrieve, and remove memories using API calls.

For the quickstart using ADK, see Memory Bank quickstart with ADK.

Before you begin

To complete the steps demonstrated in this tutorial, you must first follow the steps in Set up for Memory Bank. Before you begin this quickstart, ensure that you have created a Memory Bank instance and a Sessions instance, as shown in the following example:

import vertexai

client = vertexai.Client(
  project="PROJECT_ID",
  location="LOCATION"
)

memory_bank = client.agent_engines.create()
sessions = client.agent_engines.create()

Generate memories with Agent Platform Sessions

After setting up Agent Platform Sessions, you can create sessions and append events to them. Then, you can use Memory Bank to generate memories from the user's conversation with the agent so that the memories are available for future user interactions. For more information, see Generate memories and Fetch memories.

Create a session with an opaque user ID. Any memories generated from this session are automatically keyed by the scope {"user_id": "USER_ID"} unless you explicitly provide a scope when generating memories.
```
import vertexai

client = vertexai.Client(
  project="PROJECT_ID",
  location="LOCATION"
)

session = client.agent_engines.sessions.create(
  # The name can be fetched using `sessions.api_resource.name`.
  name="SESSIONS_NAME",
  user_id="USER_ID"
)
```
Replace the following:
- PROJECT_ID: Your project ID.
- LOCATION: Your region. See the supported regions for Memory Bank.
- SESSIONS_NAME: The name of the Sessions instance that you created or an existing Sessions instance. The name should be in the following format: projects/{your project}/locations/{your location}/reasoningEngine/{your reasoning engine}.
- USER_ID: An identifier for your user. Any memories generated from this session are automatically keyed by the scope {"user_id": "USER_ID"} unless you explicitly provide a scope when generating memories.

Iteratively upload events to your session. Events can include any interactions between your user, agent, and tools. The ordered list of events represents your session's conversation history. This conversation history is used as the source material for generating memories for that particular user.

import datetime

client.agent_engines.sessions.events.append(
  name=session.response.name,
  author="user",  # Required by Sessions.
  invocation_id="1",  # Required by Sessions.
  timestamp=datetime.datetime.now(tz=datetime.timezone.utc),  # Required by Sessions.
  config={
    "content": {
      "role": "user",
      "parts": [{"text": "hello"}]
    }
  }
)

To generate memories from your conversation history, trigger a memory generation request for the session:

client.agent_engines.memories.generate(
  name=memory_bank.api_resource.name,
  vertex_session_source={
    # `session` should have the format "projects/.../locations/.../reasoningEngines/.../sessions/...".
    "session": session.response.name
  },
  # Optional when using Sessions. Defaults to {"user_id": session.user_id}.
  scope=SCOPE
)

Replace the following:

(Optional) SCOPE: A dictionary representing the scope of the generated memories, with a maximum of 5 key value pairs and no * characters. For example, {"session_id": "MY_SESSION"}. Only memories with the same scope are considered for consolidation. If not provided, {"user_id": session.user_id} is used.

Upload memories

As an alternative to generating memories using raw dialogue, you can upload memories or have your agents add them directly using GenerateMemories with pre-extracted facts. Rather than Memory Bank extracting information from your content, you provide the facts that should be stored about your user directly.

To ensure consistency with generated memories, try to write pre-extracted facts in the same perspective that you've configured for the given scope. By default, memories are generated in the first-person perspective (for example, I am a software engineer).

client.agent_engines.memories.generate(
    name=memory_bank.api_resource.name,
    direct_memories_source={"direct_memories": [{"fact": "FACT"}]},
    scope=SCOPE
)

Replace the following:

FACT: The pre-extracted fact that should be consolidated with existing memories. You can provide up to 5 pre-extracted facts in a list like the following:
```
{"direct_memories": [{"fact": "fact 1"}, {"fact": "fact 2"}]}
```
SCOPE: A dictionary, representing the scope of the generated memories. For example, {"session_id": "MY_SESSION"}. Only memories with the same scope are considered for consolidation.

Alternatively, you can use CreateMemory to upload memories without using Memory Bank for either memory extraction or consolidation.

memory = client.agent_engines.memories.create(
    name=memory_bank.api_resource.name,
    fact="This is a fact.",
    scope={"user_id": "123"}
)

"""
Returns an AgentEngineMemoryOperation containing the created Memory like:

AgentEngineMemoryOperation(
  done=True,
  metadata={
    "@type': 'type.googleapis.com/google.cloud.aiplatform.v1beta1.CreateMemoryOperationMetadata",
    "genericMetadata": {
      "createTime": '2025-06-26T01:15:29.027360Z',
      "updateTime": '2025-06-26T01:15:29.027360Z'
    }
  },
  name="projects/.../locations/us-central1/reasoningEngines/.../memories/.../operations/...",
  response=Memory(
    create_time=datetime.datetime(2025, 6, 26, 1, 15, 29, 27360, tzinfo=TzInfo(UTC)),
    fact="This is a fact.",
    name="projects/.../locations/us-central1/reasoningEngines/.../memories/...",
    scope={
      "user_id": "123"
    },
    update_time=datetime.datetime(2025, 6, 26, 1, 15, 29, 27360, tzinfo=TzInfo(UTC))
  )
)
"""

Retrieve and use memories

You can retrieve memories for your user and include them in your system instructions to give the LLM access to your personalized context.

For more information about retrieving memories using a scope-based method, see Fetch memories.

# Retrieve all memories for User ID 123.
retrieved_memories = list(
    client.agent_engines.memories.retrieve(
        name=memory_bank.api_resource.name,
        scope={"user_id": "123"}
    )
)

You can use jinja to convert your structured memories into a prompt:


from jinja2 import Template

template = Template("""
<MEMORIES>
Here is some information about the user:
{% for retrieved_memory in data %}* {{ retrieved_memory.memory.fact }}
{% endfor %}</MEMORIES>
""")

prompt = template.render(data=retrieved_memories)

"""
Output:

<MEMORIES>
Here is some information about the user:
* This is a fact
</MEMORIES>
"""

Remove memories

There are multiple ways to delete memories from your Memory Bank instance depending on how you want to select which memories should be removed.

Remove by resource name

If you know exactly which memory resource you want to remove, you can delete a specific memory using its resource name:

client.agent_engines.memories.delete(
    name=MEMORY_NAME,
    config={
        # Set to false (default) if you want to delete the memory asynchronously.
        "wait_for_completion": True
    }
)

Replace the following:

MEMORY_NAME: The name of the Memory to delete. The name should be in the following format: projects/{your project}/locations/{your location}/reasoningEngine/{your reasoning engine}/memories/{your memory}. You can find the Memory name by fetching memories.

Remove by criteria

You can use criteria-based deletion to remove one or more memories. Only memories that match the provided filters will be deleted. You must specify at least one of filter (applied to system fields) or filter_groups (applied to metadata fields).

operation = client.agent_engines.memories.purge(
    name=memory_bank.api_resource.name,
    # Specify at least one of `filter` or `filter_groups`.
    filter="FILTER_STRING",
    filter_groups=FILTER_GROUPS,
    # Set to false (default) if you want to stage but not execute the purge operation.
    force=True,
    config={
        # Set to false (default) if you want to purge memories asynchronously.
        "wait_for_completion": True
    }
)

Replace the following:

FILTER_STRING: A string using EBNF syntax for filtering against system fields. System fields include create_time, update_time, fact, and topics. For more information about filtering against system fields, see the Filter by metadata fields section on the Fetch memories page.
FILTER_GROUPS: A list of dictionaries or objects for filtering against memory metadata. For more information about filtering against metadata fields, see the Filter by system fields section on the Fetch memories page.

The operation will return a count of how many memories were purged (if force=True) or would be purged if the operation was executed (if force=False).

print(operation.response.purge_count)

For example, you can purge all memories that belong to a scope for user_id "123":

operation = client.agent_engines.memories.purge(
    name=memory_bank.api_resource.name,
    filter="scope.user_id=\"123\""
    force=True
)

Remove by semantic meaning

During memory generation, Memory Bank will decide whether to create, update, or delete memories based on the content of the newly extracted information and the existing memories. A memory may be deleted if the new information contradicts it or if the extracted content instructs Memory Bank to forget a subject (in the case of the EXPLICIT_INSTRUCTIONS memory topic).

For example, the following request would delete existing memories that contain information about dietary preferences, if they exist for the given scope:

from google import genai

client.agent_engines.memories.generate(
    name=memory_bank.api_resource.name,
    direct_contents_source={
      "events": [{
        "content": genai.types.Content(
          role="user",
          parts=[
            genai.types.Part.from_text(text="Forget my dietary preferences.")
          ]
        )
      }]
    },
    scope={...}
)

Clean up

To clean up all resources used in this project, you can delete the Google Cloud project you used for the quickstart.

Otherwise, you can delete the individual resources you created in this tutorial, as follows:

Use the following code sample to delete the Agent Platform instance, which also deletes any sessions or memories associated with the Agent Platform instance.
```
agent_engine.delete(force=True)
```
Delete any locally created files.

What's next

Quickstart

Quickstart with Agent Development Kit

Get started with the Agent Development Kit (ADK).

Agent Platform Memory Bank API quickstart Stay organized with collections Save and categorize content based on your preferences.

Before you begin

Generate memories with Agent Platform Sessions

Upload memories

Retrieve and use memories

Remove memories

Remove by resource name

Remove by criteria

Remove by semantic meaning

Clean up

What's next

Quickstart with Agent Development Kit

Agent Platform Memory Bank API quickstart