About the role
Lightmatter is leading the revolution in AI data center infrastructure, enabling the next giant leaps in human progress. The company invented the world’s first 3D-stacked photonics engine, Passage™, capable of connecting thousands to millions of processors at the speed of light in extreme-scale data centers for the most advanced AI and HPC workloads.
Lightmatter raised $400 million in its Series D round, reaching a valuation of $4.4 billion. We will continue to accelerate the development of data center photonics and grow every department at Lightmatter!
If you're passionate about tackling complex challenges, making an impact, and being an expert in your craft, join our team of brilliant scientists, engineers, and accomplished industry leaders.
Lightmatter is (re)inventing the future of computing with light!
In this role, you will lead the development of a comprehensive High Temperature Operating Life (HTOL) Test Software system. Your work will involve designing, implementing, and maintaining a scalable multi-chassis testing platform that performs automated stress and performance testing with real-time monitoring and comprehensive data collection capabilities.
Responsibilities
- System Design & Development: Architect, build, and maintain scalable architecture for a multi-chassis HTOL testing system.
- Orchestration: Develop containerized applications for deployment at scale using Python-based services for chassis coordination and management.
- Hardware Monitoring & Management: Create hardware abstraction layers and develop APIs that represent hardware systems, providing essential capabilities for monitoring and management of those systems.
- Manage Data: Develop data collection pipelines handling sensor data and performance metrics.
- Deploy and Update Software: Create automated deployment and testing pipelines using CI/CD best practices.
- Collaboration with Front-End Teams: Work closely with the frontend team to ensure seamless integration of backend APIs with applications.
- Testing & Documentation: Write automated tests to monitor the reliability and performance of the system; maintain clear and concise documentation for troubleshooting.
- Performance and Reliability: Continuously monitor and optimize performance to reduce response times and improve system scalability; ensure uptime in production environments; establish capacity planning procedures.
Required Skills
- BS and 12+ years of experience or MS and 8+ years of experience; degree in Computer Science, Electrical Engineering, or related field.
- Expert level Python, knowledge of web frameworks such as FastAPI, Flask, Django; strong understanding of API design principles and best practices.
- Experience with containerization and orchestration technologies such as Docker and Docker Compose.
- Experience with one or more databases such as MongoDB, PostgreSQL, Redis, time-series databases.
- Familiarity with testing frameworks such as pytest and integration testing, performance testing tools.
- Experience with CI/CD tools such as GitHub Actions/Runners and Infrastructure as Code tools such as Ansible.
- Experience with hardware integration or embedded systems; interfacing with BMCs, FPGAs, temperature sensors, thermal management, power management systems.
Nice-to-have skills
- Familiarity with real-time data handling and communication protocols, such as gRPC, TCP/IP, WebSockets, message brokers or similar technologies.
- Experience with high-availability, mission-critical systems.
- Experience in the Semiconductor Industry: HTOL, wafer-level testing, burn-in systems, reliability testing.
- Professional Certifications: Agile/Scrum certifications.
- Experience building backend services for web applications like Next.js, proficiency in JavaScript/TypeScript.
We offer competitive compensation. The base salary range for this role determined based on location, experience, educational background, and market data.
Benefits
- Comprehensive Health Care Plan (Medical, Dental & Vision)
- Retirement Savings Matching Program
- Life Insurance (Basic, Voluntary & AD&D)
- Generous Time Off (Vacation, Sick & Public Holidays)
- Paid Family Leave
- Short Term & Long Term Disability
- Training & Development
- Commuter Benefits
- Flexible, hybrid workplace model
- Equity grants (applicable to full-time employees)
Benefits eligibility may vary depending on your employment status and location. Lightmatter recruits, employs, trains, compensates, and promotes regardless of race, religion, color, national origin, sex, disability, age, veteran status, and other protected status as required by applicable law.
Export Control
Candidates should have capacity to comply with the federally mandated requirements of U.S. export control laws.
Skills & Tags
Aplyr's read
Lightmatter pioneers photonic computing to revolutionize AI, attracting engineers and scientists passionate about cutting-edge technology and innovation.
What's promising
- •Lightmatter is at the forefront of photonic computing, a promising field for AI acceleration.
- •The company attracts top talent with its focus on innovative hardware and software solutions.
- •Recent hires indicate robust growth and investment in diverse engineering roles.
What to watch
- •The niche focus on photonic computing may limit broader industry applicability.
- •Rapid technological changes in AI could pose adaptation challenges.
- •High specialization demands may create a steep learning curve for new employees.
Why Lightmatter
- •Lightmatter's emphasis on photonic computing uniquely positions it in the AI hardware landscape.
- •The company integrates cutting-edge photonics with AI, unlike traditional electronic approaches.
- •Lightmatter's roles reflect a deep commitment to developing proprietary technology solutions.
Aplyr’s read is generated by AI from public sources. Was it useful?
About Lightmatter
Lightmatter is a technology company focused on developing photonic computing solutions to accelerate artificial intelligence and machine learning applications.