Senior Data Engineer, AI HN
The Data Science Hanoi Team manages Recommendation Platform which is realtime and large scale platform. We are looking for candidate that join the team to develop large scale data pipelines as well as the Recommendation Platform.
Mô tả công việc
With the AI first company mission, design and build the self-serve data platform to serve people in MoMo and partner needs. They will be allocated resource based on theirs needs to:
Ingesting multiple data sources, from batch to stream, from pull to push mechanism;
Developing and deploying the resilient data pipeline in the data lake, data warehouse, streaming data;
Delivering the derived data to downstream with the high quality, such as: BI solution (PowerBI, Google Data Studio,…), Marketing Platform, Promotion Platform, Experiment Platform,… by multiple ways (API, Dataset, Streaming Data,…);
Building Machine Learning models, Business Intelligent dashboards;
Monitoring the data quality of data pipelines in the data platform;
Monitoring and optimizing the resource usages;
Design and build the Data Management Systems enabling the Data Governance team and data consumers:
Managing the data life cycle in the big data platform;
Having MoMo data ecosystem self-exploration ability;
Proving the single source of truths to multiple downstream with the high quality;
Managing the attribution infrastructure cost from various big projects, teams, departments;
Design and build the Data Loss Prevention (DLP) solution to protect our data and achieve data visibility in large organization;
Collaborate with Machine Learning Engineers, Data Scientists , Business Analysts, Data Analysts, Product Owners, Product Operators to strive for greater functionality in our data systems.
Yêu cầu công việc
Love of data. You run “SELECT COUNT(SHEEP) FROM BACKYARD” during your sleep!
Strong programming skill; 5 year experience in Data Engineer
Language: Java/Kotlin, Python, Scala, VueJs, BashScripts;
Infrastructure: Google Cloud Platform, Kubernetes;
Data Warehouse: BigQuery,Oracle;
Data Source: App Events, Oracle, MySQL, MSSQL, Kafka, Pubsub, REST API;
Data Pipeline Orchestration: Airflow;
BI: Data Studio;
- Problem solving skills, Teamwork spirit;
- Experience with cloud platforms such as Google Cloud Platform or Amazon Web Service is a plus.
- Have a strong background in Distributed System is big plus