Fresher Software Engineer (3-month contract)
- Ho Chi Minh
- Internship/Collaborator
- 24-ITC-0601
We are seeking a skilled Software Engineer with expertise in web crawling and data scraping, focused on Vietnamese-related image content. The ideal candidate will develop and manage an automated data collection system to gather and organize large volumes of image data from various websites and social media. Your work will involve implementing a scalable pipeline for data extraction, processing, and storage in our database.
What you will do
- Design, develop, and maintain an automated web scraping solution targeting Vietnamese-related image content.
- Collect and organize image data across multiple categories, including:
- Official documents, contracts, newspapers, magazines, textbooks
- Product images from e-commerce sites.
- Cultural and lifestyle images of Vietnam, including historical landmarks, food, events, and daily life images from platforms such as Facebook, Instagram, and TikTok.
- Ensure compliance with web scraping ethics, site policies, and security best practices.
What you will need
- Programming Languages: Proficient in Python
- Big Data & Image Data Processing: Experience handling large datasets, especially image data
- Web Technologies: Understanding of HTML, JavaScript, and website security practices to bypass scraping challenges.
- Database Management: Proficient in NoSQL (e.g., MongoDB) and SQL (e.g., PostgreSQL) for efficient data storage and retrieval
- Pipeline Development: Experienced in designing and implementing data scraping pipelines.
- Knowledge of social media APIs (Facebook, Instagram, TikTok) for targeted data collection is a plus.
Related Jobs
Senior Front-End Developer
- Ho Chi Minh
- Fulltime
Software Engineer
- Ho Chi Minh
- Fulltime
Manager – Credit Risk Management
- Ho Chi Minh
- Fulltime