Data Engineer (Databricks)
Founded in 1994 and headquartered in Switzerland, ERNI is a leading Software Development company with over 800 employees worldwide. Specializing in IT and software engineering, we drive innovation in process and technology. Our first service center in Asia Pacific, located in Metro Manila (Mandaluyong), supports clients across Europe, APAC, the Philippines, and the USA. As we continue to grow, we're looking for passionate and motivated individuals to join our team.
Why ERNI is the Perfect Place for You: 🏡
• International Exposure: Work with global clients on cutting-edge projects.
• Inclusive Culture: Thrive in a collaborative and diverse work environment.
• Career Development: Enjoy continuous learning and professional growth opportunities.
🤩Perks and Benefits:
• Career Stability: Enjoy a stable career path with ample project opportunities.
• Immediate Coverage: Private HMO and insurance benefits from day one.
• Jubilee Celebration: A 5-year milestone includes a complimentary trip to any European ERNI sites.
• Comprehensive Benefits: Government-mandated benefits including 13th-month pay.
• Skill Enhancement: Access free training and certifications.
• Wedding Gift: To celebrate your special day.
• Baby Basket: To welcome your newborn to the ERNI family.
• Fruit Basket: Boost of vitamins during hospitalization.
• Office Perks: Enjoy free snacks and coffee.
🔐Growth and Opportunities:
• Free Training: Advance your skills through technical and non-technical training.
• Challenging Projects: Engage in complex software projects across MedTech, Industry,
Finance, and Transportation.
• Supportive Environment: Benefit from a team dedicated to guiding and supporting your success.
• Recognition and Advancement: Receive acknowledgment for your efforts and
opportunities for promotion.
• Open Communication: Experience transparency and value your input in our culture.
⏱Flexibility:
• Hybrid Work Setup: Balance remote and in-person work for better work-life integration.
🎉Events:
• Connect and Celebrate: Participate in a variety of events including leisure, summer,
family, social, and year-end gatherings.
👋What are our wishes:
Experience:
- 7+ years of experience in data engineering roles, with at least 2 years in a leadership role and projects involving Databricks and AWS/Azure.
- Proven expertise in data pipelines, feature engineering, and dataset preparation for machine learning, specifically LLMs.
- Experience building enterprise-grade applications with GenAI or AI/ML integrations.
Technical Skills:
- Expertise in Databricks, Apache Spark, and Delta Lake.
- Strong programming skills in Python and SQL; knowledge of libraries like pandas, NumPy, or PyTorch is a plus
- Understanding of state management libraries like Redux, Recoil, or Zustand.Cypress), and version control (Git).
- Understanding of web security principles and compliance requirements for enterprise applications.
Soft Skills:
- Exceptional problem-solving and decision-making abilities.
- Excellent communication and leadership skills, with the ability to guide technical discussions and mentor team members.
- Strong focus on delivering quality
💼How can you contribute to the team?
The Senior Data Engineer will specialize in building and optimizing data pipelines with Databricks and preparing datasets for Large Language Models (LLMs). This role will focus on designing scalable, efficient data architectures to support cutting-edge machine learning initiatives, particularly in generative AI applications.
1. Data Pipeline Development:
- Design, implement, and optimize end-to-end data pipelines using Databricks, AWS, Azure, and related technologies.
- Build workflows to handle large-scale data ingestion, transformation, and storage.
2. Data Preparation for LLMs:
- Preprocess, clean, and structure diverse datasets (text, structured, and unstructured) for LLM training and fine-tuning.
- Implement feature engineering, tokenization, and vectorization techniques to support NLP models.
3. Performance Optimization:
- Use Databricks features, including Delta Lake and MLflow, to streamline data workflows.
- Optimize data infrastructure for high availability, scalability, and cost-efficiency.
4. Collaboration with Teams:
- Work closely with data scientists, ML engineers, and other stakeholders to understand data requirements for LLM technology requirements.
- Ensure alignment between engineering pipelines and machine learning goals.
5. Data Quality & Governance:
- Implement processes to ensure data quality, consistency, and compliance with governance policies.
- Monitor and maintain data integrity throughout the pipeline lifecycle.
6. Emerging Technology Adoption:
- Stay updated on advancements in Databricks, generative AI, and LLM technologies.
- Contribute to the adoption of innovative tools and practices to improve workflows.
Switzerland · Germany · Spain · Slovakia · Romania · Philippines · Singapore · USA
ERNI Development Center Philippines Inc., 9th Floor, Lica Malls Shaw, 500 Shaw Boulevard, 1555, Mandaluyong City, Philippines
+63 5310 1707 | www.betterask.erni | info@erni.ph
- Department
- Data & AI
- Role
- Data Engineer
- Locations
- Metro Manila
- Remote status
- Hybrid

About ERNI
We deliberately focus on what we know best.
- 18 Locations in 8 Countries
- 800+ Employees across the Globe
- ISO Certified
Data Engineer (Databricks)
Loading application form
Already working at ERNI?
Let’s recruit together and find your next colleague.