ML Software Engineer, Data Plane

מלאה

הגשת מועמדות יצירת בקשה לפגישה עם המעסיק

הנדסה|מהנדס תוכנה|תוכנה

מלאה

רמת שכר

25,000

פורסם לפני 2 ימים

פורסמה ברשת

The MLIL DataPlane team is looking for a Software Development Engineer to own the design and implementation of our inference data plane. We build the software that makes large models run efficiently on custom hardware – spanning model execution, memory management, data movement, and serving integration.
Our work covers the full inference path: integrating serving engines with custom hardware, developing high-performance compute kernels, enabling efficient data movement, and driving models from early validation through production. We operate at frontier scale with large distributed models.
This is a ground-up effort with rapidly evolving hardware and software. We are looking for an IC who can write and optimize low-level code for custom hardware, validate model architectures end-to-end, build test and profiling infrastructure, and drive performance across the stack.

Key job responsibilities
– Develop and optimize compute kernels for a custom ML accelerator architecture, targeting production-level performance for large language model inference.
– Implement and validate LLM architectures (decoder-only, mixture-of-experts) end-to-end – from PyTorch model definition through distributed execution on custom hardware.
– Integrate custom accelerator backends into open-source ML serving frameworks (vLLM, PyTorch), including scheduler extensions, memory management, and model parallelism.
– Build and maintain test infrastructure for model correctness validation across CPU, GPU, simulator, and hardware targets.
– Profile and optimize inference workloads – identify bottlenecks, instrument critical paths, and drive latency and throughput improvements from simulation through hardware bringup.
– Own features end-to-end: from design through implementation, testing, and integration into the broader software stack.
– Contribute to CI/CD pipelines that gate model and kernel changes on correctness and performance regressions.

Requirements:
Basic Qualifications
– Bachelor's degree or equivalent.
– 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience.
– Knowledge of computer architecture, operating systems, and parallel computing.

Preferred Qualifications
– Knowledge of Machine Learning and LLM fundamentals, including transformer architecture, training/inference lifecycles, and optimization techniques.
– Knowledge of ML frameworks including JAX, PyTorch, vLLM, SGLang, Dynamo, TorchXLA, and TensorRT.
– Experience in developing and deploying LLMs in production on GPUs, Neuron, TPU or other AI acceleration hardware.

This position is open to all candidates.

מידת ההתאמה שלי

התאמה למשרה

כישורים0%

יש לך 0 מתוך 6 הכישורים הנדרשים

כישורים חסרים:

Ci Cd Pipeline DevelopmentCompute Kernel DevelopmentCustom Accelerator IntegrationInference Workload OptimizationLlm Architecture ImplementationTest Infrastructure Development

התאמתך למשרה מחושבת על פי כישוריך וניסיונך (כפי שסיפרת לנו עליהם) מול דרישות המעסיק - אין בכך כדי להעיד על קבלתך לעבודה (זה יחליט המעסיק)

מידע על תפקיד

מהנדס תוכנה

מהנדס תוכנה מתכנן, מפתח ומתחזק יישומי תוכנה ומערכות. תחומי האחריות כוללים כתיבת קוד נקי ויעיל, ניפוי שגיאות ובדיקת תוכנה, ושיתוף פעולה עם צוותים חוצי-פונקציות כדי לספק מוצרים באיכות גבוהה. מיומנויות מפתח כוללות שליטה בשפות תכנות (למשל, Java, Python, C++), הבנה של מתודולוגיות פיתוח תוכנה ויכולות פתרון בעיות. היכרות עם מערכות בקרת גרסאות וכלי פיתוח היא חיונית.

קורסים והכשרות להגיע לתפקיד

C Essentials 2

C Essentials 2 בוחן מושגי תכנות מתקדמים יותר כמו פונקציות ומבנים. תלמד גם איך לעבוד עם קבצים וזרמים ולהשתמש בהנחיות מעבד קדם ובהצהרות מורכבות כדי לשפר את כישורי התכנות שלך.

C++ Essentials 2

C++ Essentials 2 מלמד את הניגוד בין מתודולוגיות פרוצדורות לתכנות מונחה עצמים (OOP), מבני מחלקות, תורשה ופולימורפיזם. תלמדו גם מושגים מתקדמים כמו חריגים, עומס יתר של מפעילים וסוגים מסופרים.

C Essentials 1

שפת התכנות C מהווה את הליבה של מערכות הפעלה, מסדי נתונים ומנועי משחק חדישים. מוערך בזכות המהירות, היעילות והרבגוניות שלה, C היא אחת משפות התכנות המבוססות ביותר.

משרות חדשות במערכת שיכולות לעניין אותך

לחברת הי טק העוסקת בסחר רכיבי אלקטרוניקה דרוש מהנדס /ת תמיכה טכנית Edge AI

מלאה

פתח תקווה

פורסם לפני 10 שעות

הצטרפות לצוות התמיכה של החברה, האחראי לליווי טכני של לקוחות בתהליך בחירה, אינטגרציה והטמעה של רכיבי וכרטיסי Edge AI במוצרי ...

כאן תמצאו את ג׳וב החלומות שלכם

ML Software Engineer, Data Plane

רוצה להתייעץ? אנחנו כאן לעזור