Senior Distributed Systems Engineer (HPC Platform)

Brak informacji o wynagrodzeniu
SeniorFull-time
#346813·Dodano dziś·0
Źródło: nofluffjobs.com
Aplikuj teraz

Tech Stack / Keywords

Distributed computingRustRabbitMQAWSApache PulsarCUDARDMAGPURuntime APIsThrust

Firma i stanowisko

We are looking for a Senior Distributed Systems Engineer to design and build core backend services for a high-performance distributed computing platform. This role focuses on developing resilient, high-throughput infrastructure that orchestrates workloads across CPU and GPU nodes, working at the intersection of distributed systems, high-performance computing, and modern backend engineering.


Wymagania

  • Strong experience in backend development with Rust
  • Solid understanding of distributed systems architecture
  • Hands-on experience with message queues (e.g., Apache Pulsar, RabbitMQ)
  • Experience designing and building gRPC-based APIs / service-oriented architectures
  • Experience with AWS or similar cloud platforms
  • Strong problem-solving skills and ability to work with complex systems

Nice to have:

  • Experience with high-performance networking (e.g., RDMA, libfabric)
  • Familiarity with high-performance storage systems (e.g., Lustre)
  • Understanding of GPU architecture and memory management
  • Experience with CUDA ecosystem (Runtime APIs, Thrust, CUB, PTX)
  • Knowledge of LLVM / compiler toolchains

Obowiązki

  • Design and build core backend services for a high-performance distributed computing platform
  • Develop resilient, high-throughput infrastructure to orchestrate workloads across CPU and GPU nodes
  • Build scalable systems from the ground up using cutting-edge technologies
Itransition

Itransition

6 aktywnych ofert

Zobacz wszystkie oferty
Aplikuj teraz