Ähnliche Jobs

AI Engineer

AI Engineer

AI Engineer

AI Engineer

microTECH Global

Halbleiter, elektronische Bauteile

München

  • Art der Beschäftigung: Vollzeit
  • 58.500 € – 87.500 € (von XING geschätzt)
  • Vor Ort

AI Engineer

Über diesen Job

Brief:

We are looking for an AI Engineer passionate about Generative AI and Agentic AI systems, someone who thrives on optimizing models for efficient on-device deployment. You will work on large language models (LLMs), large multimodal models (LMMs), and Vision-Language-Action (VLA) models, ensuring they run reliably and efficiently on our NPU-based platforms.

Responsibilities:

Optimize LLMs and multimodal models for on-device deployment

Investigate, develop and apply advanced quantization (8-bit, 4-bit, mixed precision), pruning, and distillation techniques for deriving optimized models for NXP NPU targets.

Accelerate inference performance

Investigate, develop and implement system optimizations such as speculative decoding and other efficient decoding algorithms tailored for edge environments.

Engineer agentic AI capabilities towards tiny agents

Investigate methodologies for enhancing the performance of small language models towards enabling tiny agents at the edge, while ensuring these follow safety principles.

Work with inference engines and deployment frameworks

Deploy optimized models using Ollama, llama.cpp, ONNX Runtime, and TFLite for efficient NPU inference.

Benchmark LLMs and agentic systems

Design benchmarking pipelines for assessing the performance of Generative and Agentic AI systems on-device

Requirements:

MSc, PhD or EngD in a technical specialism, like Computer Science or equally relevant.

5+ years of experience in software/AI engineering with deep exposure to LLMs, VLMs, and systems performance.

Experience with LLM quantization techniques (e.g., SmoothQuant, SpinQuant, QuaRoT), pruning (Wanda, SparseGPT, etc.) and other system optimizations like speculative decoding.

Track-record experience in working with AI frameworks (PyTorch, TensorFlow, etc.), required.

Experience with Agentic AI technologies and familiarity with existing frameworks (e.g., LangChain, Google ADK, SmolAgents, etc.)

Understanding of AI toolchains, deployment, portability and inference engines (CUDA, TensorRT, TFLite, ONNX, Ollama, etc.) preferred.

Affinity and experience with embedded systems, and NPU accelerators required.

Broad experience with Operating systems GNU/Linux, embedded systems, development boards, and processors, and SW competencies required.

Familiarity with setting up and maintaining related ML-Ops development environments (MLFlow, ClearML, etc.) required.

Knowledge of build systems (YOCTO, OpenEmbedded, etc.) beneficial, working with cross-compilation toolchains for ARM preferred.

Solid programming experience of C, C++, Python and Bash programming languages on Linux systems required.

Gehalts-Prognose

Unternehmens-Details

company logo

microTECH Global

Halbleiter, elektronische Bauteile

Vereinigtes Königreich

Ähnliche Jobs

Externes Job-Angebot. Von einem Partner.

Application Engineer - AI Enablement (m/f/d)

Advantest Europe GmbH

München + 0 weitere

74.000 €105.000 €

Externes Job-Angebot. Von einem Partner.

Application Engineer - AI Enablement (m/f/d)

München + 0 weitere

Advantest Europe GmbH

74.000 €105.000 €

Externes Job-Angebot. Von einem Partner.

Tenure Track Assistant Professor in "Mathematics of Machine Learning"

Technische Universität München (TUM)

München + 0 weitere

47.500 €71.500 €

Externes Job-Angebot. Von einem Partner.

Tenure Track Assistant Professor in "Mathematics of Machine Learning"

München + 0 weitere

Technische Universität München (TUM)

47.500 €71.500 €

Senior AI/ML Engineer (AI Lead)

Custom Surgical GmbH

München + 0 weitere

90.000 €105.000 €

Senior AI/ML Engineer (AI Lead)

München + 0 weitere

Custom Surgical GmbH

90.000 €105.000 €

Senior AI / ML Engineer (w/m/d) für Technologieberatung im Bankingumfeld

Passion for People GmbH

München + 0 weitere

75.000 €95.000 €

Neu · 

Senior AI / ML Engineer (w/m/d) für Technologieberatung im Bankingumfeld

München + 0 weitere

Passion for People GmbH

75.000 €95.000 €

Neu · 

ML Engineer

Proclinical Staffing

München + 0 weitere

Neu · 

ML Engineer

München + 0 weitere

Proclinical Staffing

Neu · 

Senior Machine Learning Engineer - (m/f/d)

Apple Inc

München + 0 weitere

71.500 €89.500 €

Neu · 

Senior Machine Learning Engineer - (m/f/d)

München + 0 weitere

Apple Inc

71.500 €89.500 €

Neu · 

Spezialist Machine Learning im Einkauf & Lieferantennetzwerk (m/w/d)

Guldberg GmbH

München + 0 weitere

59.500 €67.000 €

Spezialist Machine Learning im Einkauf & Lieferantennetzwerk (m/w/d)

München + 0 weitere

Guldberg GmbH

59.500 €67.000 €

AI Developer Technology Engineer

NVIDIA

München + 0 weitere

57.000 €83.000 €

AI Developer Technology Engineer

München + 0 weitere

NVIDIA

57.000 €83.000 €

Senior Deep Learning Engineer

NVIDIA

München + 0 weitere

61.000 €84.000 €

Senior Deep Learning Engineer

München + 0 weitere

NVIDIA

61.000 €84.000 €