We're looking for a Senior AI/ML Engineer to join a fast-paced startup that's building AI teammates to automate the work of data teams. In this role, you'll be instrumental in developing and deploying large language models and AI systems at scale. You'll work on cutting-edge projects that involve designing intelligent agent architectures, hands-on with vector databases and model-fine tuning techniques that will power products used by thousands of data professionals worldwide.
Client offers a base salary and equity of 0.05% - 0.4%
location: Sunnyvale, California
job type: Permanent
salary: $180,000 - 210,000 per year
work hours: 8am to 5pm
education: No Degree Required
responsibilities:
- Advanced AI Agent Architecture: Design and implement sophisticated multi-agent systems that can understand context, make decisions, and enable seamless collaboration between AI and human data professionals.
- Large-Scale RAG Systems: Build and optimize RAG pipelines that can efficiently process and utilize enterprise-scale knowledge bases.
- Model Optimization & Deployment: Implement advanced techniques (LoRA, Prompt Tuning) to adapt off-the-shelf models for specific enterprise data tasks, ensuring faster, more accurate performance.
- Scalable AI Infrastructure: Develop and optimize cloud-native architectures (AWS, Kubernetes) for large-scale training, inference, and multi-agent orchestration.
- Enterprise Security & Compliance: Implement robust security measures for LLM applications, including data anonymization, prompt injection prevention, and compliance with enterprise security standards.
qualifications:
- 7+ years of hands-on experience in AI/ML or 6 years of direct experience working in a startup with PyTorch or TensorFlow.
- 2+ years of hands-on experience in LLM and GenAI projects.
- Proven track record of successfully deploying ML/AI systems in production environments.
- Experience working with enterprise clients and understanding their specific needs and challenges.
- Deep expertise in building and deploying LLMs and AI systems at scale.
skills:
- Programming: Strong Python programming skills and deep familiarity with the ML stack, including NumPy, Pandas, and scikit-learn.
- Deep Learning: Production experience with deep learning frameworks PyTorch & Tensorflow
- LLMs: Proven experience with prompt engineering best practices and LLM evaluation metrics.
- Infrastructure: Experience with cloud-native architectures (AWS, Kubernetes) is a plus.
Equal Opportunity Employer: Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status.
At Randstad Digital, we welcome people of all abilities and want to ensure that our hiring and interview process meets the needs of all applicants. If you require a reasonable accommodation to make your application or interview experience a great one, please contact HRsupport@randstadusa.com.
Pay offered to a successful candidate will be based on several factors including the candidate's education, work experience, work location, specific job duties, certifications, etc. In addition, Randstad Digital offers a comprehensive benefits package, including: medical, prescription, dental, vision, AD&D, and life insurance offerings, short-term disability, and a 401K plan (all benefits are based on eligibility).
This posting is open for thirty (30) days.
Qualified applicants in San Francisco with criminal histories will be considered for employment in accordance with the San Francisco Fair Chance Ordinance.
Qualified applicants with arrest or conviction records will be considered for employment in accordance with the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act.
We will consider for employment all qualified Applicants, including those with criminal histories, in a manner consistent with the requirements of applicable state and local laws, including the City of Los Angeles' Fair Chance Initiative for Hiring Ordinance.