Postúlate en Kit Empleo: kitempleo.com.ar/empleo/q12oq
The Role
Kraken is building a dedicated AI Compute and Infrastructure team to power the next generation of model training, inference, evaluation, and experimentation across the exchange. This team sits within engineering leadership and owns the infrastructure layer that lets Kraken run AI workloads with control, speed, reliability, and cost discipline.
The team is responsible for GPU and accelerator infrastructure, cluster operations, scheduling, model serving, observability, capacity planning, and cost‑efficient compute at scale. This is the backbone that allows Kraken to train, serve, evaluate, and iterate on AI systems in‑house where it matters for privacy, latency, reliability, cost, or product differentiation.
You will join a small, senior, high‑impact team working directly with AI/ML researchers, platform engineers, security teams, and product teams. The mandate is simple: make Kraken's AI ambitions real by building compute infrastructure that is fast, dependable, efficient, and production‑grade.
Responsibilities
- Own and operate GPU and accelerator clusters used for training, inference, evaluation, and experimentation, including drivers, runtimes, kernels, device plugins, node configuration, scheduling primitives, and workload isolation.
- Design infrastructure that enables Kraken teams to run models locally on GPUs where it is strategically and economically preferable, reducing unnecessary dependency on external providers and containing compute costs.
- Build and improve scheduling, orchestration, placement, quota management, and utilization systems across heterogeneous accelerator environments.
- Optimize inference pipelines for latency, throughput, reliability, memory efficiency, and cost using frameworks such as vLLM, Triton Inference Server, TensorRT, or equivalent serving stacks.
- Partner with ML engineers and researchers to remove bottlenecks in training, evaluation, batch inference, online inference, deployment, and production debugging workflows
Postúlate en Kit Empleo: kitempleo.com.ar/empleo/q12oq
📌 Senior AI Compute Infrastructure Engineer (Ezeiza)
🏢 kraken
📍 Ezeiza