Location: Zurich | Employment Type: Fixed-term | Workload: 100%
We are seeking a skilled Machine Learning Engineer to join our dynamic team. The ideal candidate will play a pivotal role in the development, optimization, and maintenance of our codebase, utilized for training large generative neural network models. This position requires a strong foundation in machine learning and software development, along with the ability to collaborate effectively in a research-focused environment.
The Swiss AI Initiative is a collaborative research project led by ETH Zurich and EPFL, dedicated to the development of responsible and transparent generative AI. A flagship component of this initiative is the Large Language Model (LLM) effort, aiming to create state-of-the-art language models across various scales, including an ambitious 70 billion parameter model.
This pioneering work leverages the Alps supercomputer at the Swiss National Supercomputing Centre (CSCS), which boasts more than 10,000 NVIDIA Grace Hopper GPUs, making it one of Europe’s most powerful AI-focused computing resources. The Swiss AI Initiative plans to distribute 15-20 million GPU hours annually to support diverse research and development projects in AI.
As a machine learning engineer on this project, you will contribute to the development and optimization of training pipelines for these large-scale models, working at the intersection of cutting-edge research and high-performance computing to enhance Switzerland's position in AI innovation.
In the role of Machine Learning Research Engineer, you will be responsible for developing and maintaining software that trains large-scale neural networks, particularly large language models. You will collaborate closely with researchers and other engineers to design and implement scalable solutions for model training, evaluation, and deployment. A crucial aspect of your responsibilities will involve optimizing existing machine learning frameworks to enhance performance and efficiency.
To excel in this position, staying informed of the latest advancements in AI and machine learning technologies is essential. You will actively participate in code reviews and maintain comprehensive documentation to ensure code quality and reproducibility. Additionally, there may be opportunities to contribute to research papers and technical reports, disseminating our team's technical achievements and research findings to the broader scientific community.
The successful candidate will enjoy a stimulating academic environment at one of the world's leading technical universities, facilitating access to state-of-the-art supercomputing infrastructure and cutting-edge AI research.
In alignment with our values, ETH Zurich fosters an inclusive culture. We promote equality of opportunity, embrace diversity, and nurture an environment in which the rights and dignity of all staff and students are respected. Visit our Equal Opportunities and Diversity website to understand how we ensure a fair and open environment that allows everyone to thrive.
Apply online using the form below. Only applications matching the job profile will be considered.
For further information about the ETH AI Center and the Swiss AI Initiative, please visit our website. Questions regarding the position can be directed to Dr. Imanol Schlag at ischlag@ethz.ch (no applications).
ETH Zurich is among the world’s leading universities specializing in science and technology. Renowned for our excellent education, cutting-edge research, and direct transfer of knowledge into society, we attract over 30,000 people from more than 120 countries. Our university fosters independent thinking and inspires excellence. Situated in the heart of Europe yet connected globally, we collaborate to develop solutions for today’s and tomorrow’s global challenges.
Location : Zürich
Country : Switzerland