

What You’ll Do:
Run and optimize production Kubernetes clusters (deployments, networking, autoscaling, upgrades).
Own observability: metrics, logs, traces, alerting, dashboards, SLOs.
Operate S3-compatible storage (e.g. S3/MinIO): security, lifecycle, performance.
Manage caching (Redis/Memcached) and message queues (Kafka/RabbitMQ/NATS or similar) with a focus on reliability and throughput.
Implement security and hardening across Kubernetes, storage, caching and MQ (RBAC, network policies, secrets, TLS).
Troubleshoot and design networking (DNS, load-balancing, ingress, L3–L7).
Collaborate with dev teams in an agile environment and help define platform best practices.
What We’re Looking For
5+ years in DevOps / SRE / Platform Engineering with real production ownership.
Strong hands-on experience with Kubernetes in production.
Solid background in observability tooling (Prometheus/Grafana/ELK/Otel or similar).
Practical experience with S3 storage, caching systems, and message queues in production.
Good understanding of networking (TCP/IP, DNS, HTTP, TLS) and security (RBAC, encryption, hardening).
Comfortable with Linux, scripting, IaC (Terraform/Helm/Kustomize) and CI/CD.
Why Join Us
Enterprise-scale problems with agile decision-making and short feedback loops.
Strong engineering culture: automation first, blameless postmortems, knowledge sharing.
Real ownership over the platform and room to influence architecture and tooling
ثبت مشکل و تخلف آگهی
ارسال رزومه برای آریا همراه سامانه