Platform Engineering: Building Golden Paths and Developer Self-Service Templates

Home › Blog › Platform Engineering: Building Golden Paths and Developer Self-Service Templates

Platform Engineering with Golden Paths

Platform Engineering golden paths represent the evolution of DevOps into a product-oriented discipline focused on developer self-service. Instead of requiring every team to configure their own CI/CD pipelines, infrastructure, monitoring, and security controls, platform teams build standardized “golden paths” — opinionated, pre-configured templates that embody organizational best practices and enable developers to ship features faster. In practice, the goal is to shrink the gap between “I have an idea” and “it is running in production” from weeks to hours.

This guide covers building an Internal Developer Platform (IDP) with golden path templates using Backstage, Crossplane, and ArgoCD. Moreover, you will learn how to balance standardization with flexibility, measure platform adoption, and evolve your golden paths based on developer feedback. Crucially, a golden path is a product, not a project — it never reaches a “done” state, because the underlying tools, compliance rules, and team needs keep shifting.

What Makes a Golden Path

A golden path is not a mandate — it is the path of least resistance that also happens to be the path of best practice. When creating a new microservice is as simple as filling out a form and clicking “Create,” developers naturally choose the platform’s golden path over hand-rolling their own infrastructure. Furthermore, golden paths encode security controls, observability standards, and compliance requirements so developers get them automatically. The distinction matters: mandates create resentment and shadow IT, whereas a genuinely easier path earns adoption voluntarily.

The key components of a golden path include service templates (scaffolding), infrastructure templates (cloud resources), CI/CD pipeline templates, observability templates (dashboards, alerts), and documentation templates. Each template should be opinionated but customizable — covering 80% of use cases out of the box while allowing escape hatches for the remaining 20%. Importantly, the escape hatch is not a failure of the golden path; rather, it is a signal that tells you which 20% might become the next golden path once enough teams need it.

Platform engineering and developer tools — Golden paths guide developers toward best practices by default

Platform Engineering: Backstage Service Catalog

Spotify’s Backstage serves as the frontend for most IDPs, providing a service catalog, software templates, and a plugin ecosystem. Therefore, it becomes the single pane of glass where developers discover services, create new ones, and access documentation. Because Backstage centralizes ownership metadata in catalog-info.yaml files, it also doubles as a living inventory — when an incident hits, on-call engineers can instantly find who owns a service, where its runbook lives, and which dashboards to open.

# backstage-template.yaml — Golden path service template
apiVersion: scaffolder.backstage.io/v1beta3
kind: Template
metadata:
  name: spring-boot-service
  title: Spring Boot Microservice
  description: |
    Creates a production-ready Spring Boot service with CI/CD,
    monitoring, and cloud infrastructure.
  tags:
    - java
    - spring-boot
    - recommended
spec:
  owner: platform-team
  type: service
  parameters:
    - title: Service Details
      required:
        - name
        - owner
        - description
      properties:
        name:
          title: Service Name
          type: string
          pattern: '^[a-z][a-z0-9-]*
		
		
	


          description: Lowercase with hyphens (e.g., order-service)
        owner:
          title: Team Owner
          type: string
          ui:field: OwnerPicker
          ui:options:
            catalogFilter:
              kind: Group
        description:
          title: Description
          type: string
          maxLength: 200
        tier:
          title: Service Tier
          type: string
          enum: [tier-1-critical, tier-2-important, tier-3-standard]
          default: tier-3-standard

    - title: Technical Configuration
      properties:
        javaVersion:
          title: Java Version
          type: string
          enum: ["21", "17"]
          default: "21"
        database:
          title: Database
          type: string
          enum: [postgresql, mysql, none]
          default: postgresql
        messaging:
          title: Message Broker
          type: string
          enum: [kafka, rabbitmq, none]
          default: none
        caching:
          title: Cache
          type: string
          enum: [redis, none]
          default: none

    - title: Infrastructure
      properties:
        environment:
          title: Environments
          type: array
          items:
            type: string
            enum: [dev, staging, production]
          default: [dev, staging, production]
        replicas:
          title: Production Replicas
          type: integer
          default: 3
          minimum: 2
          maximum: 10

  steps:
    # Step 1: Generate service code from template
    - id: fetch-skeleton
      name: Fetch Service Skeleton
      action: fetch:template
      input:
        url: ./skeleton
        values:
          name: ${{ parameters.name }}
          owner: ${{ parameters.owner }}
          javaVersion: ${{ parameters.javaVersion }}
          database: ${{ parameters.database }}

    # Step 2: Create GitHub repository
    - id: create-repo
      name: Create Repository
      action: publish:github
      input:
        repoUrl: github.com?owner=my-org&repo=${{ parameters.name }}
        description: ${{ parameters.description }}
        defaultBranch: main
        protectDefaultBranch: true
        requireCodeOwnerReviews: true

    # Step 3: Provision infrastructure via Crossplane
    - id: create-infrastructure
      name: Provision Infrastructure
      action: kubernetes:apply
      input:
        manifest:
          apiVersion: platform.myorg.io/v1alpha1
          kind: ServiceInfrastructure
          metadata:
            name: ${{ parameters.name }}
          spec:
            database: ${{ parameters.database }}
            messaging: ${{ parameters.messaging }}
            caching: ${{ parameters.caching }}
            environments: ${{ parameters.environment }}

    # Step 4: Register in Backstage catalog
    - id: register
      name: Register in Catalog
      action: catalog:register
      input:
        repoContentsUrl: ${{ steps.create-repo.output.repoContentsUrl }}
        catalogInfoPath: /catalog-info.yaml

  output:
    links:
      - title: Repository
        url: ${{ steps.create-repo.output.remoteUrl }}
      - title: Service in Catalog
        icon: catalog
        entityRef: ${{ steps.register.output.entityRef }}

Notice how the template encodes governance directly into the steps. For example, protectDefaultBranch and requireCodeOwnerReviews mean every service created via the golden path starts with branch protection enabled — there is no “we forgot to turn that on” failure mode. Similarly, the tier field is not cosmetic; downstream automation can read it to decide alerting thresholds, on-call escalation, and backup frequency. Consequently, a single dropdown choice cascades into dozens of correct defaults.

Designing Skeletons That Stay DRY

The hardest part of a service template is not the scaffolder YAML — it is the skeleton it generates. A naive skeleton copies a full Dockerfile, CI config, Helm chart, and linting rules into every repository, which means a single best-practice change requires a pull request against hundreds of repos. A better pattern is to keep the skeleton thin and delegate the heavy lifting to shared, versioned artifacts: a reusable GitHub Actions workflow referenced by tag, a base Docker image maintained by the platform team, and a Helm library chart. As a result, upgrading the base image or tightening a CI gate becomes a version bump rather than a fleet-wide migration.

That said, thin skeletons trade some transparency for maintainability. Developers can no longer read every line that runs their pipeline because it lives in a shared workflow. To compensate, document the contract clearly and pin versions so an upstream change never silently alters a team’s build. A common pattern in production teams is to expose a single platform-version input that teams bump on their own schedule, giving them control over when they absorb changes.

Infrastructure Templates with Crossplane

Additionally, Crossplane enables platform teams to define infrastructure abstractions that developers consume without needing cloud provider expertise. Composite Resource Definitions (XRDs) create simplified APIs over complex cloud resources. The mental shift here is significant: instead of handing developers raw Terraform modules and trusting them to wire up encryption, backups, and tagging correctly, you expose a tiny API — “I want a small PostgreSQL database” — and the Composition fills in the dozens of secure defaults behind the scenes.

# composition.yaml — Database infrastructure abstraction
apiVersion: apiextensions.crossplane.io/v1
kind: Composition
metadata:
  name: postgresql-standard
  labels:
    crossplane.io/xrd: xpostgresqlinstances.platform.myorg.io
spec:
  compositeTypeRef:
    apiVersion: platform.myorg.io/v1alpha1
    kind: XPostgreSQLInstance
  resources:
    - name: rds-instance
      base:
        apiVersion: rds.aws.upbound.io/v1beta1
        kind: Instance
        spec:
          forProvider:
            engine: postgres
            engineVersion: "16"
            instanceClass: db.t3.medium
            allocatedStorage: 50
            storageEncrypted: true
            deletionProtection: true
            autoMinorVersionUpgrade: true
            backupRetentionPeriod: 7
            monitoringInterval: 60
            performanceInsightsEnabled: true
            tags:
              managed-by: crossplane
              team: platform
      patches:
        - fromFieldPath: spec.parameters.size
          toFieldPath: spec.forProvider.instanceClass
          transforms:
            - type: map
              map:
                small: db.t3.medium
                medium: db.r6g.large
                large: db.r6g.2xlarge

    - name: parameter-group
      base:
        apiVersion: rds.aws.upbound.io/v1beta1
        kind: ParameterGroup
        spec:
          forProvider:
            family: postgres16
            parameter:
              - name: log_statement
                value: "mod"
              - name: log_min_duration_statement
                value: "1000"
              - name: shared_preload_libraries
                value: "pg_stat_statements"

This Composition is where compliance becomes invisible and automatic. Because storageEncrypted, deletionProtection, and a seven-day backupRetentionPeriod are baked in, a developer literally cannot provision an unencrypted, unbacked-up production database through the golden path. Meanwhile the size patch maps a friendly t-shirt size onto a specific instance class, so teams reason about “small/medium/large” rather than memorizing AWS instance taxonomy. When your cost or capacity assumptions change, you edit one map in one Composition and every future database inherits the new sizing.

Developer platform and self-service tools — Self-service infrastructure provisioning through platform abstractions

Closing the Loop with ArgoCD GitOps

Scaffolding a repo and provisioning a database still leaves a gap: how does the code actually reach the cluster? This is where ArgoCD completes the golden path. Rather than giving developers kubectl access to production, the template registers an ArgoCD Application that continuously reconciles the cluster against the Git repository. Therefore, the desired state lives in Git, deployments happen by merge, and drift is corrected automatically.

# argocd-application.yaml — Generated per service by the golden path
apiVersion: argoproj.io/v1alpha1
kind: Application
metadata:
  name: order-service
  namespace: argocd
spec:
  project: default
  source:
    repoURL: https://github.com/my-org/order-service.git
    targetRevision: main
    path: deploy/overlays/production
  destination:
    server: https://kubernetes.default.svc
    namespace: order-service
  syncPolicy:
    automated:
      prune: true       # delete resources removed from Git
      selfHeal: true    # revert manual cluster edits
    syncOptions:
      - CreateNamespace=true

With selfHeal enabled, a panicked manual kubectl edit during an incident is reverted within seconds, which forces every change through review and keeps the cluster honest. This GitOps wiring is the part developers most appreciate getting for free, because configuring ArgoCD projects, RBAC, and sync policies by hand is tedious and easy to get subtly wrong. For more on how these layers interact at runtime, our post on API gateway vs service mesh covers the traffic-routing side of the platform.

Measuring Platform Success

Consequently, platform engineering is a product discipline — you need metrics to understand adoption, satisfaction, and impact. The DORA metrics (deployment frequency, lead time, MTTR, change failure rate) improve when golden paths work well. Equally important is the leading indicator of “time to first deploy” for a brand-new service: if a developer can go from clicking “Create” to a running staging deployment in under an hour, your golden path is healthy; if it takes days, something in the path is broken and teams will route around it.

# Platform metrics to track
metrics:
  adoption:
    - golden_path_usage_percentage    # Target: > 80%
    - template_creation_count         # New services created via templates
    - escape_hatch_usage              # How often teams deviate from golden path
    - time_to_first_deploy            # How long from service creation to first deploy

  satisfaction:
    - developer_nps_score             # Net Promoter Score for the platform
    - support_ticket_volume           # Should decrease over time
    - self_service_success_rate       # % of requests completed without help

  impact:
    - deployment_frequency            # DORA metric
    - lead_time_for_changes           # DORA metric
    - mean_time_to_recovery           # DORA metric
    - change_failure_rate             # DORA metric
    - security_vulnerability_count    # Should decrease with golden paths
    - compliance_audit_findings       # Should decrease

Treat these numbers as a conversation starter rather than a scoreboard. A spike in escape_hatch_usage for one capability, for instance, is not a failure to punish — it is a backlog item telling you exactly what to template next. Likewise, rising support_ticket_volume around a particular step usually points to confusing documentation or a missing default rather than incompetent users.

When NOT to Use Platform Engineering

Small organizations (under 20 engineers) rarely benefit from building an IDP — the investment in platform tooling exceeds the productivity gains. A few well-documented runbooks and shared CI/CD templates achieve most of the benefit at a fraction of the cost. As a result, platform engineering makes economic sense when you have 5+ development teams creating similar services repeatedly. Below that threshold, the platform team itself becomes a bottleneck: every new feature funnels through a handful of people who could otherwise be shipping product.

Avoid building platforms in isolation from the developers who will use them. The most common failure mode is platform teams building what they think developers need rather than what developers actually need. Treat your platform as a product with user research, feedback loops, and iterative improvement. There are other anti-patterns worth naming explicitly. First, beware the “second-system” trap of trying to abstract every cloud service on day one — start with the single most repeated workflow and earn trust before expanding. Second, an overly rigid golden path with no escape hatch will simply be ignored, because developers always find a way around tools that block them. Third, a platform with no dedicated owners rots quickly; treat staffing the platform team as seriously as staffing any other product team. For a deeper look at safe rollout mechanics, see our guide on feature flags and trunk-based development.

Team collaboration and platform design — Platform engineering success requires close collaboration with developers

Key Takeaways

Platform Engineering golden paths transform how organizations deliver software by making best practices the default choice. Backstage provides the developer portal, Crossplane abstracts infrastructure complexity, and ArgoCD automates deployments — together they create a self-service platform that scales with your organization. Furthermore, measuring adoption and developer satisfaction ensures your platform evolves based on real needs.

Key Takeaways

Start with a solid foundation and build incrementally based on your requirements
Test thoroughly in staging before deploying to production environments
Monitor performance metrics and iterate based on real-world data
Follow security best practices and keep dependencies up to date
Document architectural decisions for future team members

Start by identifying the most common developer workflow in your organization and build a golden path for it. For deeper exploration, see the Backstage documentation and the Platform Engineering community. Our posts on API gateway vs service mesh and feature flags and trunk-based development complement your platform strategy.

In conclusion, Platform Engineering Golden Paths is an essential topic for modern software development. By applying the patterns and practices covered in this guide, you can build more robust, scalable, and maintainable systems. Start with the fundamentals, iterate on your implementation, and continuously measure results to ensure you are getting the most value from these approaches.