Site Reliability Engineer
21 000 - 24 000 PLN/ mies.
MidFull-time
#326800·Dodano 19 dni temu·25
Źródło: LinkGroupTech Stack / Keywords
AIAzure DevOpsKubernetesDatadogAzureLLMCI/CD
Firma i stanowisko
We are looking for a Senior Site Reliability Engineer who will take end-to-end ownership of reliability for AI-driven applications and pipelines. This is a hands-on engineering role, not a coordination or ticket-driven position. The ideal candidate actively diagnoses, resolves, and automates production issues rather than only designing solutions.
Wymagania
- 5+ years as SRE / Production / Platform Engineer
- Strong incident management & RCA experience
- Hands-on with: Azure DevOps, Kubernetes, Datadog, Azure, CI/CD
- Proactive, ownership mindset, self-driven
- Experience in production environments
Nice to have:
- AI/LLM pipelines, Grafana
Obowiązki
- Build and maintain monitoring, alerting, dashboards
- Lead incident response & root cause analysis
- Ensure reliability and performance of AI pipelines
- Standardize telemetry (latency, failures, throughput)
- Optimize CI/CD and release quality
- Reduce recurring incidents with engineering teams
linkgroup
286 aktywnych ofert