Location: Burlingame,CA, USA
*Note: Minimum 3+ years of work experience is required
About Blueberry AI
Qualifications
Required Skills:
• Proven Experience: minimum 3+ years in AI/ML engineering, with a focus on NLP, LLMs, and retrieval systems.
• Technical Proficiency: Strong programming skills in Python, with experience in frameworks like TensorFlow, PyTorch, Hugging Face, or LangChain.
• LLM Expertise: Hands-on experience with large language models, including fine-tuning and prompt engineering.
• RAG Knowledge: Familiarity with retrieval-augmented generation methods and tools, such as vector databases (e.g., Pinecone, Weaviate, FAISS) and search technologies (e.g., Elasticsearch, OpenSearch).
• Data Pipelines: Experience building and optimizing data pipelines for large-scale applications.
• Cloud Computing: Proficiency with cloud platforms (AWS, GCP, Azure) for model training and deployment.
Key Responsibilities
• Develop and Optimize LLMs: Fine-tune, optimize, and deploy large language models (GPT, LLaMA, etc.) to meet specific application needs.
• Design RAG Pipelines: Build retrieval-augmented generation architectures, combining LLMs with vector databases, search engines, and custom retrievers.
• Data Engineering: Curate, preprocess, and manage large-scale datasets for model training and evaluation, ensuring high-quality data for fine-tuning and RAG workflows.
• Model Evaluation: Design and execute testing frameworks to assess performance, scalability, and accuracy of LLMs and RAG systems.
• Integration and Deployment: Collaborate with software engineers to integrate AI solutions into production systems, ensuring seamless deployment and real-time performance.
• Scalability and Optimization: Develop strategies to optimize computational efficiency, reduce latency, and enhance system reliability.
• Collaborative Problem Solving: Work closely with cross-functional teams, including product managers, data scientists, and software engineers, to align AI capabilities with business goals.
• Stay Current: Monitor and experiment with emerging research in NLP, RAG, and related fields to keep solutions cutting-edge.
Preferred Skills:
• Knowledge of distributed systems, parallel processing, and MLOps frameworks.
• Understanding of security best practices for handling sensitive data and models in production.
• Contributions to open-source projects in the LLM or RAG domains.
• Strong communication skills and the ability to explain complex concepts to non-technical stakeholders.
Why Join Us?
• Opportunity to work on cutting-edge AI technologies that redefine industry standards.
• Collaborative and innovative team culture.
• Competitive salary and benefits package.
• Flexible work environment with a focus on work-life balance.
How to Apply:
Please submit your resume, a cover letter, and links to any relevant projects or repositories to ...@Sharecreators.com