Senior Cloud & Container Infrastructure Engineer
- Embrace Excellence: We strive for best-in-class delivery of innovation and service.
- Be Accountable: Integrity, ownership and accountability are non-negotiable.
- Adventure Together: We are committed to fostering a culture that embraces continuous improvement.
- Succeed as a Team: We believe harnessing the power of a team drives outcomes not achievable by individuals.
- Boundaries and Balance: Work-life balance is a core facet of our culture.
- Design, deploy, and operate containerized workloads on GKE across enterprise-scale environments.
- Manage GCP compute resources (Compute Engine, Cloud Run, GKE Autopilot) for high availability and cost efficiency.
- Operate and scale Weaviate vector database clusters to support production AI and semantic search workloads
- Optimize indexing, query performance, and storage configurations as data volumes grow
- Collaborate with AI/ML teams to define schema strategies and ingestion pipelines
- Build and maintain monitoring dashboards and alerting pipelines using Grafana
- Integrate LLM observability tooling (LangFuse / LangSmith) to track model performance, latency, and usage across AI services
- Drive incident response, root cause analysis, and continuous reliability improvements
- Implement infrastructure-as-code (Terraform / Deployment Manager) for reproducible, auditable deployments and CI/CD integration.
- Define and enforce multitenant GKE architecture: cluster security, namespace/tenant isolation, RBAC, network policies, maintenance, and scaling.
- Mentor engineers and drive platform adoption and best practices.
- Automate end-to-end provisioning, deployment pipelines, and day-2 operations using CI/CD tools (Cloud Build, GitHub Actions, ArgoCD, etc.)
- Design and implement observability stacks using Google Cloud Operations Suite (formerly Stackdriver), Prometheus/Grafana, Cloud Logging, Cloud Monitoring, and distributed tracing (Cloud Trace)
- Troubleshoot complex production issues spanning compute, networking, storage, and Kubernetes layers
- 6+ years of hands-on experience building and operating production cloud infrastructure
- 4+ years of deep, production experience with GCP, particularly in a senior or lead capacity
- 3+ years of strong expertise with Kubernetes in production (preferably GKE), including cluster design, upgrades, troubleshooting, and scaling
- Expert-level proficiency with Terraform for GCP infrastructure provisioning
- Strong experience with container technologies: Docker, container registries (Artifact Registry), container security scanning
- Solid understanding of GCP core services: Compute Engine, Cloud Run, Cloud SQL / AlloyDB, Cloud Storage, BigQuery, Pub/Sub, Cloud Functions, VPC, Cloud Load Balancing, Cloud Interconnect
- Experience implementing secure IAM strategies, organization policies, and security controls in GCP
- Proficiency in Linux systems administration, networking fundamentals, and scripting (Bash, Python, Go preferred)
- Experience with modern CI/CD and GitOps practices in cloud environment
- Experience supporting or using HPC environments leveraging SLUR
- Containerization/orchestration (Docker, Kubernetes/GKE)
- Strong understanding of data governance, cataloging, and lineage tools; basic familiarity with regulated environments (GxP, HIPAA).
- Experience assessing existing code and workflows and identifying bottlenecks and optimization opportunities
- Experience in software requirements gathering, documentation, design, and development
- Google Cloud Professional certifications (e.g., Professional Cloud Architect, Professional Cloud DevOps Engineer, Professional Kubernetes Engineer)
- Experience with Anthos, Config Management, Policy Controller, or multi-cluster management
- Familiarity with service mesh (Istio/Envoy), ingress controllers (GKE Gateway API / Ingress), and microservices observability
- A competitive salary and bonus package based on experience
- Comprehensive health and wellness benefits, including Medical, Dental, and Vision Insurance
- Company-provided Life and Long-Term Disability Insurance
- Company-sponsored 401(k) Plan
- Company-provided continuing education benefit
- Team-focused culture and unlimited opportunity for advancement
Recommended Jobs
Director of Visual and Performing Arts
Job Description Job Description Description: ABOUT SEWICKLEY ACADEMY Distinguished by its rigorous academics, outstanding faculty, and highly motivated student body, Sewickley Academy is Pi…
Sales Representative - SIP Trunking (VoIP / FoIP)
Job Description Job Description Salary: T38Fax is seeking a tech-savvy sales professional with experience in the telecom industry to help us grow our popular T.38 Fax Over IP (FoIP) service. O…
Office Assistant
Busy Cranberry Township Chiropractic office seeks people loving, health minded person for Office Assistant position. Pay starts at $15 to $22 per hour, plus bonus based on qualifications. Must be …
Dog Kennel Attendant
Dog Kennel Attendant (Part-Time) $14.00-16.00 per hour. Overview: Camp Bow Wow Highland Park is looking for a few dog lovers to join our growing pack! Full and Part Time available. Camp Bow …
Experienced HVAC Install Technician - Up to $25 base
Job Description Job Description Are you an HVAC Install Technician who is passionate about problem solving and customer service? Looking to accelerate your career (and income!) with an organizati…
IT Field Services Technician
Come join a growing organization as we pursue towards our growth plans. This opportunity will give the right individual the customer exposure and experience desired to accelerate their career. Apply …
Regional Director, Northeast
Summary The National Park Service (NPS) Northeast Region encompasses 86 parks spanning from Maine to Virginia. The region contains one-third of all NPS museum collections, one-fourth of all his…