Descripción de la oferta
You will join a team building next-generation AI-powered products based on Large Language Models. The project focuses on transforming LLM prototypes into scalable, secure, and production-grade systems. The role covers the full AI lifecycle – from architecture and backend development to infrastructure, monitoring, and cost optimisation. ✅ Your responsibilities: Build and maintain production-grade LLM systems (including RAG, semantic search, embeddings, vector databases). Design and develop APIs and microservices in Python (e.g. FastAPI). Develop and maintain CI/CD pipelines for models (LLMOps/MLOps). Deploy and manage infrastructure in AWS (Lambda, ECS/EKS, S3, API Gateway, CloudWatch). Implement containerisation (Docker) and orchestration (Kubernetes). Monitor model quality (latency, drift, hallucinations, cost efficiency). Collaborate closely with Data Science, ML, Data Engineering and Product teams.