Data Engineer II
MoMo is the market leader in mobile payments in Vietnam, driven by a commitment to enhancing the lives of Vietnamese citizens through technological innovation.
Within the MoMo BigData AI department, we prioritize Smart, Efficient, and Excellent execution. We are currently undergoing a major transformation to build a new hybrid data platform spanning multiple cloud vendors (GCP & AWS).
We are seeking an experienced Data Engineer to help us architect this platform to optimize for both budget control and technological flexibility. You will play a pivotal role in shifting our mindset from "managing data" to creating valuable Data Products that empower our internal consumers.
Mô tả công việc
- With MoMo's AI-first mission, we are designing and building a self-serve data platform to empower both internal teams and external partners. This platform allocates resources based on users’ needs to support:
- Ingesting data from diverse sources — either in batch or streaming, using both pull and push mechanisms
- Developing and deploying resilient data pipelines across the data lake, data warehouse, and streaming systems
- Delivering high-quality, derived datasets to downstream tools such as BI solutions (e.g., Apache Superset,Looker Data Studio), via multiple delivery methods including APIs, datasets, and streaming data
- Monitoring data quality throughout all data pipelines in the platform to ensure high-quality data, resulting in better decision-making, accurate reporting, and reliable machine learning outputs
- Tracking and optimising resource usage for efficiency
Additionally, we are building Data Management Systems that enable the Data Governance team and data consumers to:- Manage the full data lifecycle within the big data platform
- Explore the MoMo data ecosystem independently
- Provide a single source of truth with high data quality to downstream consumers
- Track and manage infrastructure costs across major projects, teams, and departments
Yêu cầu công việc
The Mindset
Passion for Data: You dream in SQL ("SELECT COUNT(SHEEP)...") and care deeply about data accuracy.
Product Thinking: You view data as a product, focusing on the usability and reliability of what you deliver to stakeholders.
The Tech Stack
Strong Coding Skills: Proficiency in Java/Kotlin (for robust backend services) and Python (for data processing/scripting).
Hybrid Cloud Infrastructure: Hands-on experience with GCP. Proficiency in Kubernetes, Docker, and IaC tools like Pulumi or Terraform.
Big Data Engines: Deep understanding of computing engines like Spark, Trino, BigQuery, and Clickhouse.
Orchestration: Experience building DAGs and workflows in Airflow or Temporal.
Data Sources: Familiarity with diverse sources including App Events, CDC from transactional DBs (Oracle, MySQL, MSSQL), and streaming systems (Kafka, PubSub).
Soft Skills
Strong problem-solving abilities with a focus on root-cause analysis.
Collaborative spirit: You can explain complex infrastructure decisions to non-technical stakeholders.
