Job Description
About Us – Magnetic
Magnetic is building next-generation agentic AI systems designed to manage complex workflows across supply chains and procurement—a $3 trillion global industry. Our mission is to strengthen the resilience of global manufacturing supply chains through safe, ethical, and effective deployment of generative AI.
Backed by leading venture capital and led by experts from Open AI, NASA, and McKinsey, we are pioneering the future of intelligent systems with a strong commitment to human-centric, responsible AI development.
The Role
We’re looking for experienced cloud infrastructure engineers to design, build, and operate the core platform that powers our AI-driven products. You’ll be instrumental in delivering scalable, secure, and cost-effective infrastructure, helping deploy our agentic systems to some of the world’s largest supply chain organizations.
Key Responsibilities
-
Cloud Platform Design: Build and evolve a multi-cloud infrastructure (AWS & Azure) using Infrastructure-as-Code and Getup’s best practices.
-
Kubernetes Management: Maintain scalable Kubernetes clusters (EKS), ensuring secure networking, service mesh integration, policy enforcement, and developer-friendly tooling.
-
CI/CD Pipelines: Develop automated pipelines to deploy containerized services with testing, security scanning, and progressive rollout strategies.
-
Observability: Implement robust monitoring, tracing, logging, and alerting systems using tools like Prometheus, Open Telemetry, and Graafian.
-
Platform Security: Manage secrets, enforce least-privilege IAM, implement zero-trust networking, and ensure compliance with audit-ready logging.
-
Collaboration: Work closely with AI and product teams to support large-scale model deployment and data workflows.
-
Reliability & Cost Efficiency: Drive continuous improvement through performance tuning, chaos testing, capacity planning, and incident response.
What We’re Looking For
You might be a great fit if you have deep technical experience in most of the following:
Cloud & Infrastructure
-
Proficiency with AWS and/or Azure (IAM, networking, compute, storage).
-
Strong experience with Kubernetes (EKS/GKE/AKS), Docker, Helm.
-
Skilled with Infrastructure as Code (Terraform), Git Ops tools (Argo  CD, Flux), and policy-as-code (OPA, Contest).
DevOps & Security
-
Expertise in CI/CD systems (GitHub Actions, GitLab CI, Circle CI, etc.).
-
Knowledge of container security, image scanning, secrets management (Vault, AWS Secrets Manager).
-
Experience implementing access controls, audit logging, and compliance practices.
Observability
-
Familiar with Prometheus, Graafian, Loki, ELK stack, and Open Telemetry for system visibility and performance monitoring.
Software Engineering
-
Proficient in Python, Go, or Java with a focus on clean, testable, high-performance code.
-
Experienced in distributed system design, REST/graph APIs, and asynchronous processing (Celery, Kafka, SQS).
-
Knowledgeable about RDBMS (PostgreSQL, MySQL) and NoSQL (DynamoDB, Radis, Cassandra).
Developer Experience
-
Passion for building internal tools and developer platforms that improve team velocity and system reliability.
Benefits
-
Equity: Meaningful ownership in Magnetics’ success.
-
Flexible Work: Hybrid setup with remote flexibility.
-
Annual Retreat: Fully-funded offsite to connect, recharge, and align.
-
Growth: Work on ambitious challenges with a world-class team and cutting-edge technologies.
Diversity & Inclusion
Magnetic is an equal opportunity employer. We are committed to creating a diverse, inclusive workplace and strongly encourage applicants from all backgrounds and experiences to apply. We make all employment decisions based on qualifications, merit, and business needs.
If you require any accommodations during the recruitment process, please let us know. We’re happy to help.
Join Us
If you’re excited about building robust infrastructure to power next-gen AI in global supply chains—and want to be part of a team that values innovation, impact, and responsibility—then we’d love to hear from you.