Insignia's
looking for a Senior Machine Learning Engineer who has gone beyond prompt-tuning and fine-tuning, someone who's architected, deployed, and optimized RAG-based systems and LLM pipelines in real-world applications.
Our Office at Meruya, West Jakarta
You'll lead the design and implementation of AI solutions that power search, automation, and intelligent agents. If you've debugged retrieval drift, optimized chunking strategies, or scaled LLM serving with low latency, we want to see how you build.
This is a hands-on leadership role based in West Jakarta, where your code sets the standard and your decisions shape our AI future.
What You'll Do:
Lead the design and deployment of production-grade RAG and LLM systems
Optimize retrieval accuracy, context quality, and model performance at scale
Build robust data pipelines for ingestion, chunking, embedding, and indexing
Collaborate with data engineers, software teams, and product leads to integrate AI into core features
Set best practices for prompt engineering, evaluation, and monitoring
Mentor junior engineers and drive technical direction for GenAI projects
Who You Are:
4+ years in ML engineering, with 1.5+ years focused on LLMs and RAG systems
Deep hands-on experience with LangChain, LlamaIndex, vector databases (Pinecone, Weaviate, FAISS)
Strong in Python, PyTorch/TensorFlow, and MLOps tools (MLflow, Kubernetes, Docker)
Experience deploying models on AWS, GCP, or Azure or hybrid environments
Familiar with evaluation frameworks, A/B testing, and latency optimization
Bonus: Experience with fine-tuning, LoRA, or distillation for domain-specific performance
Practical mindset you care about reliability, cost, and impact, not just novelty
Perks & Benefits:
- Hybrid work mode (2 days WFH, 3 days WFO)
- Daily meals provided
- Health insurance coverage
- Clear career development path
- A collaborative and growth-driven team culture,
- etc