Safety tip: Beware of fake, part-time, or illegal job ads. Verify employer details before you apply or share personal information.
- Good Environment
- Annual Bonus
- Overtime Pay
Responsibilities
- Design, develop, and optimize machine learning models for high-performance inference.
- Collaborate with cross-functional teams to maximize throughput and minimize latency across our GPU cloud platform.
- Implement fine-tuning strategies to push foundation models to their hardware limits.
- Analyze performance metrics and develop solutions to optimize cost-per-token for large-scale deployments.
- Stay up-to-date with the latest advancements in AI and machine learning technology to continuously enhance our platform.
- Mentor junior engineers and contribute to a collaborative team environment.
Requirements
- Bachelor's degree in Computer Science, Engineering, or a related field; Master’s or PhD preferred.
- Proven experience in machine learning, deep learning, and AI technologies.
- Strong programming skills in Python and experience with ML frameworks such as TensorFlow or PyTorch.
- Familiarity with GPU architectures and distributed computing.
- Excellent problem-solving skills and ability to work in a fast-paced environment.
- Strong communication skills and the ability to work collaboratively with a diverse team.
Benefits
- Competitive salary and equity options.
- Comprehensive health, dental, and vision insurance.
- Flexible working hours and the option to work remotely.
- Generous vacation and paid time off policies.
- Access to cutting-edge technology and resources to facilitate innovation.
- Opportunities for professional development and career advancement.
General Technician5-10 years
General Technician5-10 years