Arjun Naik
Staff software engineer with 10+ years of experience building and operating platform infrastructure at scale. Specialized in Kubernetes, Go, and cloud-native systems. Proven track record of leading cross-team technical initiatives, contributing to open source, and mentoring engineers.
Experience
Staff Software Engineer, Delivery Hero Aug 2022 – Present
- Led development of self-service deployment tooling used across the platform organization
- Designed and implemented an application configuration management platform
- Contributed upstream to AWS Controllers for Kubernetes for managing cloud resources via CRDs
- Integrated in-house authentication and authorization solution into the service mesh
- Mentored engineers and drove system design across multiple domains
Senior Software Engineer OpenShift, Red Hat Jun 2021 – Jul 2022
- Built Kubernetes operators extending the OpenShift Container Platform via CRDs and controllers
- Delivered features for networking, authentication, and cluster upgrade workflows
Senior Software Engineer SRE, Red Hat Jul 2020 – May 2021
- Developed and standardized operators for managing OpenShift Dedicated clusters
- Improved monitoring and alerting for OSD cluster features using Prometheus
Senior Software Engineer (AWS and Kubernetes), Zalando SE Aug 2018 – Jun 2020
- Built fleet management tooling to operate and upgrade Kubernetes clusters serving hundreds of feature teams
- Built open source tooling for blue-green deployments, pre-scaling, and rollbacks
- Contributed autoscaling improvements to upstream Kubernetes (VPA and HPA)
- Designed CI/CD integration enabling teams to deploy with minimal configuration
Site Reliability Engineer, Cubits Jan 2018 – May 2018
- Migrated production stack from bare metal to GKE, containerizing existing services
Software Engineer SRE, Zalando SE Aug 2015 – Dec 2017
- Supported feature teams migrating from a monolith to cloud-native microservices
- Performed load testing and capacity planning to validate scaling requirements
- Built tooling to manage Elasticsearch and Cassandra clusters for the Recommendations team
- Defined and maintained SLOs for personalized recommendation services
Research Assistant, TU Dresden Mar 2014 – Dec 2014
- Developed a C-to-x86 translator adding instruction-level redundancy for fault detection (SREX project)
- Benchmarked consensus systems (ZooKeeper, Consul) and contributed features to ZooKeeper client libraries
Earlier Experience
- Application Developer, DMFS (2013–2014)
- Web and Mobile Developer, Locus Labs (2011–2012)
- Web Developer, Netbramha Studios (2010–2011)
Education
MSc Distributed Systems Engineering
Technische Universität Dresden Oct 2012 – Mar 2015
Thesis on scaling ZooKeeper through namespace partitioning to increase throughput. Coursework in security, cryptography, and software fault tolerance.
BEng Computer Science and Engineering
BMS College of Engineering Jul 2006 – Apr 2010
Skills
- Languages: Go, Python, Java, C/C++
- Infrastructure: Kubernetes, AWS, GCP, Terraform, Helm
- Observability: Prometheus, monitoring and alerting design, SLO definition
- Practices: System design, cross-team technical leadership, mentoring, open source contribution
- Other: Linux internals (containers, networking, cgroups), CI/CD pipeline design, REST APIs