"Managed Service for Apache Spark" is the new name for the product formerly known as "Dataproc on Compute Engine" (cluster deployment) and "Google Cloud Serverless for Apache Spark" (serverless deployment).

Google uses AI technology to translate content into your preferred language. AI translations can contain errors.

Managed Service for Apache Spark 文档

阅读产品文档

借助集群上的 Managed Service for Apache Spark，您可以充分利用开源数据工具来进行批处理、查询、流式传输和机器学习。Managed Service for Apache Spark 自动化功能可帮助您快速创建并轻松管理集群，以及在不需要集群时将其关闭以节省费用。这样一来，您就可以减少在管理方面花费的时间和金钱，专注于作业和数据。

使用 Managed Service for Apache Spark 无服务器运行 Spark 批处理工作负载，而无需预配和管理您自己的集群。指定工作负载参数，然后将工作负载提交到 Managed Service for Apache Spark 服务。该服务将在托管式计算基础架构上运行工作负载，并根据需要自动扩缩资源。Managed Service for Apache Spark 仅在执行工作负载时产生费用。

前往 Managed Service for Apache Spark 产品页面了解详情。

免费开始使用

获享 $300 免费赠金开始概念验证

使用我们最新的生成式 AI 模型和工具进行开发。
免费使用 20 多款热门产品，包括 Compute Engine 和 AI API。
不会自动收费，无需承诺。

查看免费产品优惠

继续探索 20 多种提供“始终免费”用量的产品。

使用适用于常见应用场景（包括 AI API、虚拟机、数据仓库等）的 20 多种免费产品。

文档资源

查找快速入门和指南，查看重要参考文档，并获取有关常见问题的帮助。

Managed Service for Apache Spark 文档

获享 $300 免费赠金开始概念验证

继续探索 20 多种提供“始终免费”用量的产品。

文档资源

指南

参考信息

资源

在 Google Kubernetes Engine 上运行 Spark 作业

Cloud Dataproc 简介：Google Cloud 上的 Hadoop 和 Spark

使用 Dataproc 上的 Spark 进行机器学习

工作流安排解决方案

将 HDFS 数据从本地迁移到 Google Cloud

管理 Apache Spark 的 Java 和 Scala 依赖项

Python API 示例

Java API 示例

Node.js API 示例

Go API 示例

相关视频