To join its international team, Yoursafe is seeking to hire a
AI Engineer (LLM Infrastructure)
Full-Time Position
About Yoursafe
At Yoursafe, we believe that everyone deserves access to safe and easy financial services - wherever they are. Our mission is to build financial tools that empower people, especially those who are new to a country or outside the traditional banking system. We combine solid financial expertise with smart technology to make everyday money management simple, secure, and fair.
The Role
Behind our scalable Open Issuing platform and our push into 100 target countries, sits a highly advanced, bare-metal AI infrastructure. We are building a high-density, on-premise AI and automation environment consisting of cutting-edge heavy compute (Nvidia H200), high-performance enterprise storage, and a fleet of automated agents.
We are seeking a senior AI Engineer to take ownership of our LLM inference infrastructure. You will own the hardware and software running our AI models, ensuring that our internal automation agents and our growth teams have zero-latency, highly optimized access to state-of-the-art open-source LLMs. You will collaborate closely with our CEO, COO, Product Manager and Head of Growth to engineer the AI-driven pipelines required for large-scale content creation, localization and personalised agentic support.
This is a senior, hands-on role. You are expected to manage hardware, deploy models, optimize inference speeds, and scale what works.
Your responsibilities:
LLM Deployment & Optimization
· Own and manage our heavy-compute AI hardware, specifically optimizing workloads for our Nvidia HGX H200 infrastructure.
· Deploy, fine-tune, and maintain open-source LLMs, ensuring maximum throughput and minimal latency.
· Manage inference engines (e.g., vLLM, TensorRT-LLM) and handle dynamic GPU memory allocation for hundreds of concurrent agent requests.
AI Automation & Agent Infrastructure
· Provide a flawless, millisecond-response API layer for our "OpenClaw" agent farm (a fleet of 200+ bare-metal Apple Silicon nodes).
· Monitor model performance, detect hallucinations, and build synthetic training data pipelines to continuously improve agent accuracy.
· Design, scale, and maintain a high-performance, on-premise RAG service and vector database (e.g., Qdrant, Milvus, Milvus/Chroma) to seamlessly serve internal documentation to our LLMs.
· Build robust data ingestion and embedding pipelines to ensure internal knowledge bases and documents are updated in real-time for the RAG service.
Cross-Functional AI Tooling
· Work tightly with Product, Operations, and the Growth team to ensure alignment.
· Provide the technical foundation for the Growth team's AI-assisted content creation, verification, and contextual validation pipelines.
· Build repeatable AI engines that can guarantee linguistic, cultural, and regulatory correctness across dozens of markets simultaneously.
You are:
· A senior AI/MLOps engineer with proven experience scaling self-hosted LLM infrastructure in a production environment.
· Experienced with semantic search, vector databases, and information retrieval techniques (RAG) at scale.
· Deeply experienced with Python, PyTorch, CUDA, and modern inference serving frameworks.
· Experienced in using AI as a production and verification tool, not a gimmick.
· Comfortable working closely with networking and storage architects to eliminate I/O bottlenecks in a ZFS/Linux ecosystem.
· Highly structured, data-driven, and execution-focused.
· Motivated by building systems that scale, not campaigns that win awards.
What we offer:
· A senior tech role with direct ownership over a state-of-the-art AI hardware stack.
· A fast-scaling international fintech with real infrastructure, licenses, and products.
· Competitive salary and employment conditions.
· A modern office in Amsterdam Houthavens overlooking ‘Het IJ’.
· Daily healthy lunches prepared by our in-house chef.
· A culture that values execution, ownership, and long-term thinking.
Job Types: Full-time, Permanent
Work Location: In person