#299683Dodano Invalid Date17źródło: Infogain
Infogain
Infogain

Network Operations Center Engineer

23 520 - 26 880 PLN(znormalizowane)
Doświadczenie

Mid

Lokalizacja

Tryb pracy

Zdalnie

Wymiar

Full-time

AWSAzureBashCloudDynatraceKubernetesPowershellPythonPrometheusTerraformDataDogGCPGKEPrometheus/Grafana

O ofercie

You will join an SRE-aligned operations team responsible for keeping a mission-critical, global cloud platform reliable, performant, and secure. The project focuses on 24/7 cloud operations, proactive monitoring, incident response, and continuous improvement of observability coverage across multi-region GCP environments. You will work closely with SRE, Cloud Engineering, and development teams to maintain high availability, support business continuity, and drive operational excellence.

Wymagania

  • Bachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent professional experience).
  • 1–2 years of experience in a NOC, operations, or cloud infrastructure support role.
  • Strong understanding of cloud platforms and services (AWS, Azure, GCP).
  • Familiarity with container orchestration technologies (Kubernetes, GKE) and CI/CD pipelines.
  • Experience with monitoring and logging tools such as Datadog, Dynatrace, Prometheus, Grafana, ELK, CloudWatch, Splunk, Sumo Logic, New Relic, or similar solutions.
  • Proficiency in Linux/Unix environments.
  • Basic scripting or automation skills (Python, Bash, PowerShell) and/or Infrastructure as Code exposure (Terraform).
  • Strong communication skills, with the ability to document incidents and collaborate effectively.

Obowiązki

  • Monitor cloud infrastructure across multiple regions using advanced observability and monitoring tools.
  • Respond to alerts and incidents in real time; provide supporting data for root cause analysis and escalate issues when required.
  • Troubleshoot issues related to cloud networking, containers, storage, and APIs.
  • Maintain and continuously improve troubleshooting guides (TSGs), incident response procedures, and operational documentation.
  • Collaborate with SRE, Cloud Engineering, and development teams to resolve infrastructure and reliability issues.
  • Perform routine health checks across the cloud environment.
  • Perform routine patching and upgrades of observability and monitoring agents across the platform.
  • Ensure compliance with SLAs, security policies, and operational standards.
  • Participate in a 24/7 on-call rotation and support disaster recovery and business continuity activities.
  • Analyze performance metrics and provide recommendations for optimization and automation.

Benefity

  • Hybrid work model combining office & remote work
  • Attractively located office with collaboration spaces
  • Onsite parking space for employees
  • Referral program with financial bonus
  • Life Insurance
  • Budget for development (including language courses and others), clear career path with the possibility to gain experience in international environment
  • Access to internal Learning Platform with multiple trainings oriented for professional growth
  • Access to MyBenefit platform (Multisport included)
  • Team Building activities
  • Charity initiatives
  • Working environment promoting diversity and inclusion
  • Private medical care - Platinum Package

Inne informacje

Must possess a legal work permit in Poland