An exceptional opportunity to join an innovative, high-growth organisation shaping the future of AI-powered automation and digital interaction.
We’re seeking a Machine Learning Engineer with full-stack development experience to work on cutting-edge projects involving Generative AI, Retrieval-Augmented Generation (RAG), and multi-agent reasoning frameworks.
This is a hands-on, end-to-end engineering role with impact across the full ML lifecycle – from experimentation to deployment.
- Conversational AI & Reasoning:Design, fine-tune, and deploy advanced LLMs with agentic capabilities
- RAG Pipelines:Build and optimise scalable pipelines for structured and unstructured data retrieval
- LLM Training & Fine-Tuning:Use methods like LoRA, QLoRA, SFT, PEFT, and RLHF
- Inference & Acceleration:Serve models using vLLM, DeepSpeed, Triton, TensorRT
- Multi-Agent Orchestration:Work with LangChain, AutoGen, CrewAI, DSPy and similar tools
- Cloud & MLOps (AWS):Deploy with SageMaker, Bedrock, Lambda, S3, ECS, EKS
- Full-Stack Integration:Build APIs (FastAPI, Flask) and integrate with React, TypeScript, Node.js
- Vector Search:Use FAISS, Weaviate, Pinecone, ChromaDB, OpenSearch
Required skills & experience:
- 3-5+ years of experience in ML engineering and software development
- Deep Python proficiency, with PyTorch, TensorFlow or Hugging Face
- Proven experience with LLMs, RAG, and deploying cloud-native AI on AWS
- Strong full-stack skills (React, TypeScript, Node.js) and API development
- Familiarity with vector databases and multi-agent frameworks
Apply now to join this high growth and award-winning organisation with the opportunity to be part of building the future of AI driven projects and solutions. The role offers a highly competitive salary and benefits package and will be office based in Manchester.
INDAMS
