Location: Dallas,TX, USA
We are seeking a highly skilled and motivated Natural Language Processing (NLP) Data Scientist with a focus on fine-tuning Large Language Models (LLMs) to join our innovative team. The ideal candidate will have a solid background in NLP, deep learning, and machine learning, with expertise in fine-tuning LLMs to solve complex language understanding and generation tasks
Mandatory Skills: Predictive AI with Gen AI Knowledge
Experience Range: 2-3 Yrs Relevant Experience
Primary Duties & Responsibilities:
Research, design, and implement pioneering NLP algorithms with a focus on fine-tuning LLMs, including BERT, GPT, and their variants
Develop custom fine-tuning strategies and techniques to optimize performance on specific language tasks and domains. Collaborate with cross-functional teams to integrate fine-tuned LLM solutions into our products and services
Stay up-to-date with the latest advancements in NLP, LLMs, and AI technologies, particularly in fine-tuning methodologies
Analyze and interpret experimental results to guide decision-making and improve model performance
Apply creative problem solving and inter-departmental networking to address data and information needs across the enterprise.
Share proven techniques with peers through coaching and mentoring around core methodologies, patterns, standards and processes.
Drive innovation by exploring new experimentation methods and statistical techniques that could sharpen or speed up our product decision-making processes
Through extensive analysis, discover technology improvement opportunities and will work with other teams to address these opportunities.
Knowledge, Skills, Abilities
Bachelors or Masters degree in Computer Science, Data Science, Statistics, Mathematics, Operations Research, Computer Science, Econometrics or related field
6-8 years of professional experience
At least 6 years of professional data science or analytics experience
Good background in NLP, deep learning, and machine learning, with experience in fine-tuning LLMs
Proficiency in programming languages such as Python, TensorFlow, and PyTorch
Experience with popular NLP frameworks and libraries (e.g., Hugging Face Transformers, AllenNLP)
Solid understanding of linguistics, semantics, and syntactic structures
Extensive experience working with data and developing, designing and presenting analytics, including some programming experience and work with multiple databases
Knowledge of software engineering leading practices including version control and testing methodologies
Guide level knowledge of sophisticated analytical model development and construction
Comfort with questioning assumptions, demonstrable relational analysis and integration of all components needed to solve problems
Identifying data requirements for analytical needs
Familiarity with cloud computing platforms and distributed computing environments (i.e. GDP, Azure, AWS) preferred