Back to search:Associate Data / Tangerang
Responsibilities

We are looking for a Junior Data Engineer to join our team in building a data lake using Microsoft Data Fabric. This role involves ingesting, processing, and optimizing data pipelines from multiple sources, ensuring data integrity and performance. The ideal candidate should have experience with big data solutions, cloud environments, and data pipeline automation.

  • Design, develop, and maintain a scalable Data Lake architecture using Microsoft Data Fabric.
  • Build and optimize ETL workflows using Apache Spark for structured and unstructured data.
  • Integrate and manage data sources from SQL Server for efficient data ingestion.
  • Ensure data integrity, performance, and security across the data pipeline.
  • Collaborate with data scientists, analysts, and business teams to provide clean and structured data.
  • Implement best practices for data modeling, processing, and performance tuning.
  • Automate data pipeline monitoring and troubleshooting to improve system reliability.
Requirements
  • Strong problem‑solving and analytical skills.
  • Ability to work in a collaborative team environment.
  • 0–2 years of experience in data engineering or a similar role.
  • Strong understanding of SQL Server or other RDBMS.
  • Proficiency in Python or Scala for data processing and automation.
  • Experience working with Microsoft Data Fabric or similar data lake solutions.
  • Hands‑on experience with Apache Spark for ETL processing.
  • Experience with CI/CD pipelines and workflow orchestration tools.
  • Solid understanding of cloud‑based data architectures (Azure, AWS, GCP).
Seniority level

Entry level

Employment type

Full‑time

Job function

Information Technology

#J-18808-Ljbffr