Insignia's
looking for a Senior Machine Learning Engineer who has gone beyond prompt-tuning and fine-tuning, someone who's architected, deployed, and optimized RAG-based systems and LLM pipelines in real-world applications.

Our Office at Meruya, West Jakarta

You'll lead the design and implementation of AI solutions that power search, automation, and intelligent agents. If you've debugged retrieval drift, optimized chunking strategies, or scaled LLM serving with low latency, we want to see how you build.

This is a hands-on leadership role based in West Jakarta, where your code sets the standard and your decisions shape our AI future.

What You'll Do:

Lead the design and deployment of production-grade RAG and LLM systems

Optimize retrieval accuracy, context quality, and model performance at scale

Build robust data pipelines for ingestion, chunking, embedding, and indexing

Collaborate with data engineers, software teams, and product leads to integrate AI into core features

Set best practices for prompt engineering, evaluation, and monitoring

Mentor junior engineers and drive technical direction for GenAI projects

Who You Are:

4+ years in ML engineering, with 1.5+ years focused on LLMs and RAG systems

Deep hands-on experience with LangChain, LlamaIndex, vector databases (Pinecone, Weaviate, FAISS)

Strong in Python, PyTorch/TensorFlow, and MLOps tools (MLflow, Kubernetes, Docker)

Experience deploying models on AWS, GCP, or Azure or hybrid environments

Familiar with evaluation frameworks, A/B testing, and latency optimization

Bonus: Experience with fine-tuning, LoRA, or distillation for domain-specific performance

Practical mindset you care about reliability, cost, and impact, not just novelty

Perks & Benefits:

Hybrid work mode (2 days WFH, 3 days WFO)
Daily meals provided
Health insurance coverage
Clear career development path
A collaborative and growth-driven team culture,
- etc

Apply on the website