About Celestial AI
With the growth in Generative AI, data center infrastructure it is not just about the System on Chip but about the System of Chips. In the era of Accelerated Computing, data center bottlenecks are no longer limited to compute performance, but rather the system’s interconnect bandwidth, memory bandwidth, and memory capacity. Celestial AI's Photonic Fabric is the next-generation interconnect technology offering a10X increase in performance and energy efficiency over competitive technologies.
The Photonic Fabric™ is available to our customers in multiple technology offerings, including optical chiplets, optical interposers, and Optical Multi-chip Interconnect Bridges (OMIB). This enables our customers to seamlessly integrate high bandwidth, low power, low latency optical interfaces into their AI accelerators and GPUs. The technology is fully compatible with both protocol and physical layers, including standard 2.5D packaging flows. This ease of integration enables XPUs to have optical interconnects for compute-to-compute and compute-to-memory fabrics that deliver tens of Tbps bandwidth with nano-second latencies.
This innovation empowers hyperscalers to improve the efficiency and economics of AI processing by optimizing the XPUs needed for training and inference and significantly lowering the TCO2 impact. To support customer engagements, Celestial AI is cultivating a Photonic Fabric ecosystem. These tier-1 partnerships consist of custom silicon/ASIC design, system integrators, HBM memory, assembly, and packaging suppliers.
Job Description:
We are seeking a highly skilled Senior ASIC/VLSI Synthesis and Design Engineer to join our team. The successful candidate will be responsible for developing and implementing complex digital designs for high-performance ASICs and SoCs. The role will involve working closely with design teams to optimize the designs for power, performance, and area while meeting stringent time-to-market constraints. The ideal candidate should have a strong background in ASIC/VLSI design flow, digital design, and synthesis methodologies, as well as experience in clock level and top-level synthesis, timing closure, gate level simulation, DFT, power analysis, memory BIST and repair. Experience with deep technology nodes and very large designs is also essential.
ESSENTIAL DUTIES AND RESPONSIBILITIES:
- Develop and implement high-performance, low-power, and area-efficient digital designs for ASICs and SoCs using industry-standard EDA tools.
- Work closely with design teams to understand the requirements and constraints of the design, and provide feedback on design feasibility, timing, and power.
- Write and implement block level and top level constraints for synthesis, perform timing closure, and gate level simulation.
- Develop and implement synthesis flows and methodologies, and drive improvements in the design process.
- Debug and resolve design issues related to synthesis, timing, power, and area.
- Design and support DFT flows, including scan insertion and ATPG.
- Optimize designs for power, performance, and area, and meet design goals within the given schedule. Implement memory BIST and repair, and support post-silicon validation.
- Implement pipelining at different levels for performance optimization and timing closure.
- Perform power analysis and optimize designs for low power.
QUALIFICATIONS:
- Bachelor's/Master's degree in Electrical/Electronic Engineering or Computer Science. At least 10 years of experience in ASIC/VLSI design, with a focus on synthesis, design, and DFT.
- Strong understanding of digital design principles, and experience with RTL coding in Verilog/SystemVerilog.
- In-depth knowledge of synthesis methodologies and tools from leading EDA vendors ( Genus, Tempus, DC, PrimeTime, etc).
- Experience with writing design constraints for synthesis, timing closure, gate level simulation, and pipelining at different levels for performance optimization and timing closure.
- Experience with DFT flows, including scan insertion and ATPG.
- Familiarity with physical design and backend flow.
- Experience with power analysis and optimization flows such as power gating, clock gating, voltage scaling, and dynamic voltage frequency scaling.
- Experience with memory BIST and repair, and post-silicon validation.
- Experience with deep technology nodes and very large designs.
- Experience with scripting languages such as Perl, Python, or Tcl.
- Excellent problem-solving skills and ability to work independently and in a team environment.
- Strong communication and interpersonal skills, with the ability to interact effectively with cross-functional teams.
- Proven track record of delivering successful designs on time and meeting performance, power, and area goals.
For California location:
As an early startup experiencing explosive growth, we offer an extremely attractive total compensation package, inclusive of competitive base salary and a generous grant of our valuable early-stage equity. The target base salary for this role is approximately $175,000.00 - $195,000.00. The base salary offered may be slightly higher or lower than the target base salary, based on the final scope as determined by the depth of the experience and skills demonstrated by candidate in the interviews.
We offer great benefits (health, vision, dental and life insurance), collaborative and continuous learning work environment, where you will get a chance to work with smart and dedicated people engaged in developing the next generation architecture for high performance computing.
Celestial AI Inc. is proud to be an equal opportunity workplace and is an affirmative action employer.
#LI-Onsite
Top Skills
What We Do
Celestial AI is a Machine Learning (ML) accelerator company that has developed a proprietary technology platform which enables the next generation of high-performance computing solutions. Celestial AI’s mission is to transform data parallel computing with a proprietary Photonic Fabric™ technology platform which uses light for data movement both within chip and between chips.
Advancements in data communications have driven robust silicon photonics technology and volume manufacturing ecosystems that are ripe for commercial implementation of ML and high-performance computing (HPC) solutions which leverage integrated silicon photonics for data movement.
Celestial AI’s system delivers differentiated single node performance that scales efficiently, providing significant performance gains for multi-node and multi model applications. The scalability of Celestial AI’s accelerator architecture enables an efficient and performant mapping of data and compute over a broad range of ML model types without the need for complex software optimizations. Celestial AI’s competitive advantage will further grow over time as ML models continue to increase in complexity and size.
Celestial AI has assembled a highly experienced team of industry leaders who have a track record of building multiple successful technology businesses. The company’s Orion AI accelerator products serve an addressable market that is projected by Omida to exceed $70 billion in 2025