Use GKE and Cloud Run together

Autopilot Standard

This guide is designed for infrastructure administrators and application operators who run a diverse set of containerized workloads, and want to leverage the strengths of both Google Kubernetes Engine (GKE) and Cloud Run to deploy applications to the Google Cloud Platform. A hybrid strategy empowers you to optimize costs, performance, and management overhead.

You should have familiarity with:

Kubernetes concepts
Stateless and stateful workloads

Why use GKE and Cloud Run together?

GKE and Cloud Run offer different advantages for running containerized applications, and cater to different levels of workload complexity. However, you don't need to choose between the two platforms. You can simultaneously leverage the strengths of both GKE and Cloud Run by migrating your workloads between the two platforms as the need arises.

The following are some benefits of using both runtimes to deploy your workloads:

GKE and Cloud Run offer a relatively high level of portability:
- Both platforms use standard container images as deployment artifacts. You can use the same image for your application in either platform without any modifications, thus enabling seamless migration of workloads between GKE and Cloud Run. You don't need to update your continuous integration setup to migrate between GKE and Cloud Run as long as container images are stored in Artifact Registry.
- GKE and Cloud Run both use a declarative API model. The Cloud Run Admin API v1 is designed to be compatible with the Kubernetes API. This means you can use familiar Kubernetes concepts like Deployments, Services, and horizontal Pod autoscalers to manage your Cloud Run service. This similarity makes it easier to translate configurations between the two platforms.
- Resources are represented in YAML files with the same declarative and standard structure, and can therefore easily be migrated between runtimes. Here's an example comparing the YAML files of a Kubernetes deployment and a Cloud Run service.
Both GKE and Cloud Run integrate seamlessly with Cloud Logging and Cloud Monitoring, providing you with a centralized view on the Google Cloud console to observe application metrics regardless of their platform. You can also use service-level objectives (SLO) monitoring on both platforms, and view a unified display of the SLOs on the Cloud Monitoring dashboard.
You can implement continuous delivery to either GKE resources or Cloud Run services by using Cloud Deploy. Or, if you prefer, simultaneously deploy your application to both GKE and Cloud Run using parallel deployment.
You can facilitate advanced traffic management by using external and internal load balancers for services on GKE and Cloud Run. This includes the ability to expose external endpoints so that you can deploy and run different URLs for the same application across both platforms. You can also split traffic to the same service across GKE and Cloud Run, enabling a seamless migration from one platform to another.
Google Cloud provides security tools to improve your security posture when using both runtimes. OS scanning lets you scan containers for vulnerabilities before deploying to either platform. A central Binary Authorization policy can enforce integration with the GKE and Cloud Run control plane to allow or block image deployment based on the policies that you define. With VPC Service Controls, security teams can define fine-grained perimeter controls across your GKE and Cloud Run resources.

Compare GKE and Cloud Run

To take advantage of the best features of GKE and Cloud Run, and know when to move workloads between them, it's important to understand how the two services differ from one another.

Feature	GKE	Cloud Run
Deployment and management	Manage the Kubernetes clusters, including node configuration, networking, scaling, and upgrades. Google Cloud manages the underlying infrastructure and provides tools for simplifying cluster operations, but you're still responsible for the core Kubernetes aspects.	Run containers directly on top of Google Cloud's scalable infrastructure. All you have to do is provide source code or a container image, and Cloud Run can build the container for you. You don't have to create a cluster, or provision and manage infrastructure.
Control and flexibility	Full control over the Kubernetes cluster. You can create advanced customizations of node configurations, network policies, security settings, and add-ons.	Limited control over the underlying infrastructure. You can configure things like environment variables, concurrency and network connections, but you can't customize the underlying infrastructure or environment. Ideal for simplicity and speed.
Application types	Supports both stateless and stateful applications, and is ideal for complex applications with specific resource needs.	Best suited for stateless, request or event-driven services, web services, and functions.
Pricing model	Pay-per-cluster per hour, irrespective of the mode of operation (Standard or Autopilot), cluster size or topology.	Pay-per-use, rounded up to the nearest 100 millisecond.

Use case

Consider that you are a platform administrator of a retail company building a large ecommerce platform. You have the following containerized workloads to deploy:

Frontend website and mobile app: A stateless web application handling product browsing, search, and checkout.
Product inventory management: A stateful service managing product availability and updates.
Recommendation engine: A complex microservice generating personalized product recommendations for each user.
Batch processing jobs: Includes periodic tasks like updating product catalogs and analyzing user behavior.

These workloads represent a mix of stateless and stateful services, so you decide to take advantage of both GKE and Cloud Run for optimal performance. Here's one way you can implement a hybrid approach for your workloads.

After reading the criteria for suitability of Cloud Run workloads, you decide to use Cloud Run for the website and mobile app, and the batch processing jobs. Deploying these services on Cloud Run has the following benefits:
- Automatic scaling as traffic spikes and large batch jobs are handled without manual intervention.
- Cost efficiency with a pay-per-use model. You only pay when users are browsing or checking out, and when resources are used during batch job execution.
- Faster deployments as updates are available instantly, improving user experience.
- Easy integration with other Google Cloud services. For example, for event-driven processing, you can use Cloud Functions to initiate batch processing jobs on Cloud Run, and enable seamless workflows.
Product inventory management is a stateful service that requires fine-grained control and potentially custom storage solutions. You decide to use GKE to deploy this service as it offers persistent storage and lets you attach volumes for product data persistence and reliability.
The recommendation engine is a complex microservice that benefits from GKE. With GKE, you can manage complex dependencies, and exercise fine-grained control over resource allocation and scaling.

GKE is best suited for complex microservices architectures, stateful applications, workloads requiring custom infrastructure or network configurations, and scenarios where deep control over Kubernetes is essential. Cloud Run is best suited for event-driven apps. It's ideal for stateless web services, APIs, batch jobs, and other workloads that benefit from pay-per-use pricing.

The preceding example demonstrates how combining GKE and Cloud Run can provide a powerful and flexible solution for your ecommerce platform. You gain the benefits of both platforms; serverless efficiency for stateless workloads, and Kubernetes control for complex microservices and stateful components.

Considerations

GKE and Cloud Run complement each other, addressing different needs within a complex application landscape.

The following are some considerations when adopting a hybrid approach to deploying workloads:

Run stateless microservices on Cloud Run for cost efficiency and scalability.
Deploy complex stateful applications requiring deep customization on GKE.
If you use a private network on Google Cloud, to access resources in your GKE cluster from your Cloud Run service, you can send a request to a Virtual Private Cloud (VPC) network by using Direct VPC egress. To access the GKE cluster's Services, the Cloud Run service must be connected to the cluster's VPC network, and the GKE Service must use an internal passthrough Network Load Balancer.
To migrate traffic between Cloud Run and GKE, you can expose external endpoints behind a Global external Application Load Balancer. When you implement this load balancer in front of services in both runtimes, you're able to deploy the same application across both Cloud Run and GKE, enabling you to gradually shift traffic from one platform to the other.
To expose Cloud Run services in Virtual Private Cloud behind private IPs, use an internal load balancer.

Remember, if your workloads are already on Cloud Run, you can always migrate to GKE as needed.

When not to use GKE and Cloud Run together

While GKE and Cloud Run offer a compelling approach for many containerized workloads, there are situations where using them together might not be the best fit. Here are some examples when you might decide not to adopt a hybrid approach:

Tightly-coupled microservices: If your microservices are highly dependent on each other and require frequent, low-latency communication, managing them across separate platforms can introduce complexities. Frequent network calls between platforms could add overhead and potential bottlenecks, impacting performance.
Legacy applications with custom dependencies: If your application relies on specific libraries, frameworks, or configurations not supported by Cloud Run, using it for parts of the application might require significant refactoring or workarounds. This can negate the serverless benefits and introduce platform-specific maintenance overhead.
Budget constraints with predictable workloads: If your workload has consistent resource requirements and you're on a tight budget, GKE's pay-per-node model might be more cost-effective than Cloud Run's pay-per-use billing. If you have predictable workloads, you might not fully use the automatic scaling benefits of Cloud Run, making GKE's fixed cost more attractive.

The best approach ultimately depends on your specific needs and priorities. Carefully evaluate your application's requirements, resource constraints, and team expertise before deciding on a hybrid GKE and Cloud Run architecture.

What's next

Learn how to convert your Cloud Run service into a Kubernetes deployment in Migrate from Cloud Run to GKE.
Package your Kubernetes deployment into a Cloud Run-compatible container following Migrate from Kubernetes to Cloud Run.