ML Data Engineer
Über diesen Job
Job description
Artificial Intelligence holds immense potential for improving our lives and our work. One key challenge is its effective application in manufacturing. At EthonAI, you will have the unique opportunity to shape the manufacturing landscape across entire industries. As a member of our team, you will have the chance to tackle exciting engineering challenges from distributed computing to computer vision, using state-of-the-art technologies.
Who is EthonAI?
EthonAI is a technology leader in Industrial AI trusted by global manufacturers to optimize cost, quality, and speed. Our Platform contains purpose-built applications with proprietary AI, integrated into a scalable infrastructure. Our modules automate insight generation, ensure stable processes through closed-loop control, and build resilience against skill gaps and labor shortages.
We are multinational, multifaceted, and action-oriented. We are intentional about the principles that guide us every day: create immediate impact and value, play in the top league, and create the best place to work and grow.
What is this job about?
We are looking for a ML Data Engineer to work on our Industrial AI analytics products. You will have the opportunity to transform cutting-edge research into a product that is deployed in factories around the globe. You will have the responsibility to develop scalable data pipelines, curation workflows, and labeling infrastructure to fuel our ML models with high-quality industrial datasets. We expect you to be self-driven, eager to learn, and grow with our company in a dynamic and fast-paced environment.
Job requirements
First and foremost, we believe in curious people who are eager to learn and grow. Please also reach out if you don’t check all the points mentioned here.
Your Roles & Responsibilities
Design and operate scalable data pipelines for ingestion, preprocessing, augmentation, and validation.
Source and curate diverse computer vision datasets, including open-source and proprietary industrial data.
Build integrations with annotation tools (e.g., Label Studio, CVAT) and external labeling providers.
Ensure reproducibility and scalability of data workflows.
Support ML research by ensuring high-quality, well-organized datasets are always available for training and evaluation.
Your Experience & Skills
Solid programming skills in Python.
Familiarity with cloud infrastructure (AWS/GCP/Azure) and object stores (e.g., S3, MinIO).
Familiarity with data versioning tools (e.g., DVC, MLflow, LakeFS).
Practical ML skills - able to apply existing models (e.g., object detectors, SAM, OCR) to support data curation, pre-annotation, or quality checks.
Experience with annotation tools and labeling services is a strong plus.
Background with multimodal datasets (images, text, video) is a plus.
Familiarity with ML libraries such as PyTorch, HuggingFace, OpenCV is a plus.
Strong motivation to work in a dynamic and fast-paced environment.
Why is EthonAI right for you?
We foster a culture where team members can grow and learn from each other. We are dedicated to being the workplace where each of us thrives.
We also offer a good handful of specific benefits:
Comprehensive compensation package with stock options to become a co-owner of EthonAI.
On-site work is important, with significant scope for flexible schedules and partially remote work.
A CHF 2’000 yearly bonus when you take two consecutive weeks of vacation.
Flexible parental leave.
Support for continued learning, such as online courses and conferences.
See your work being deployed in cutting-edge factories around the globe.
We just moved to a brand new office and we are ready to grow
At EthonAI, there always are plenty of opportunities to step up and take charge. Other keys to our job satisfaction are the palpable impact we have on the real world, and being part of a top-notch team that is focused on a successful outcome for all.
EthonAI is an Equal Opportunity Employer. We are committed to creating a work environment that is fair and inclusive, where all decisions related to recruitment, advancement, and retention are free of discrimination. We particularly encourage applications from individuals traditionally underrepresented in tech.