#300136Dodano Invalid Date11źródło: nofluffjobs.com
Graphcore
Graphcore

Senior Staff Engineer - Platform Telemetry

350 700 - 474 400 PLN(znormalizowane)
Doświadczenie

Senior

Lokalizacja

Gdańsk

Tryb pracy

Hybryda

Wymiar

Full-time

Redfish APIPythonRedshiftOpem ComputeDMTF StandardsDatadogDynatraceSplunk

O ofercie

Graphcore is a company building the future of AI compute, with expertise in semiconductor, software, and AI. It is part of the SoftBank Group and delivers technology into the SoftBank AI ecosystem. The company is expanding globally to address AI infrastructure challenges.

Wymagania

  • BSc or MSc degree in Computer Engineering, Computer Science, or related field, or equivalent experience.
  • Proven success in architecting and implementing scalable, performant, reliable cluster management systems including telemetry collection and analysis engines.
  • Good understanding of computer systems architecture (CPU, GPU, DPU, server platforms).
  • Experience with programming and debugging server platforms.
  • Expertise in in-band and out-of-band management architectures and associated tools.
  • Detailed knowledge and experience with Redfish APIs.
  • Experience with large-scale telemetry datasets, time series databases, down-sampling techniques, and creating actionable dashboards.
  • Strong skills in at least one of C, C++, Go, or Python.
  • Excellent written and verbal communication skills.

Nice to have:

  • 10+ years of relevant post-degree experience.
  • Familiarity with Open Compute (OCP).
  • Familiarity with DMTF standards and working groups.
  • Knowledge of data center networking and monitoring best practices.
  • Knowledge of commercial observability solutions like Datadog, Dynatrace, or Splunk.
  • Knowledge of monitoring, observability, and management solutions used by hyperscalers.
  • Knowledge of declarative management systems.

Obowiązki

  • Contribute to all phases of product development including definition, architecture, design, implementation, debugging, testing, and early customer support.
  • Design and implement fault-remediation solutions at scale.
  • Implement multi-component integrations based on Graphcore and third-party technology stacks, covering data ingestion to decision making.
  • Create reference designs including documentation, configuration files, scripts, and source code.
  • Deploy solutions internally to support engineering teams in debugging, performance analysis, benchmarking, and QA.
  • Ensure solutions are properly tested by collaborating with development and QA teams to enhance unit testing and test plans.
  • Mentor and guide junior engineers to foster continuous learning and improvement.

Benefity

  • Competitive salary.
  • Annual leave policy.
  • Medical and dental health plans.
  • Gym card.
  • Employee pension matched up to 4%.
  • Private healthcare.
  • International projects.
  • Team events.
  • Training budget.
  • Modern office with stunning view.
  • Free coffee, snacks, beverages, and breakfast.
  • Gym and bike parking with shower.
  • Free parking.
  • Playroom.
  • Startup atmosphere.
  • No dress code.