AIToolboard - Discover 1,300+ Best AI Tools Directory 2026

About The Role The role bridges state-of-the-art language model research and the constraints of real production systems - latency, cost, interpretability, and compliance. You will work directly with research teams to fine-tune, evaluate, and deploy NLP models, and with engineering teams to make those models work reliably in client environments. Key Responsibilities • Fine-tune and evaluate transformer-based models (BERT, T5, LLaMA, Mistral) for named entity recognition, relation extraction, classification, and text generation tasks • Build systematic evaluation frameworks: benchmark datasets, human-evaluation pipelines, and automated regression suites for NLP model quality • Develop preprocessing pipelines for unstructured text - legal documents, clinical notes, financial filings - including OCR post-processing and entity normalization • Collaborate with product engineers to deploy NLP models within latency and cost constraints; implement distillation, quantization, and caching strategies as needed • Stay current on NLP/LLM research literature and evaluate its relevance to active client problems; bring relevant advances into production-ready implementations • Document model architecture, evaluation results, and known limitations clearly for both technical teammates and client-facing stakeholders • Participate in literature reviews, internal research discussions, and occasional external publications or conference presentations What We Are Looking For • 1–4 years of applied NLP or ML engineering experience, with hands-on model development beyond prompt engineering • Deep familiarity with Hugging Face Transformers, tokenizers, and the broader transformers ecosystem • Strong Python; experience processing and cleaning real-world unstructured text data at scale • Understanding of core NLP concepts: tokenization, embeddings, attention mechanisms, sequence labeling, span extraction • MS or PhD in Computer Science, Computational Linguistics, Statistics, or a related field strongly preferred • Ability to read and implement published NLP research papers independently • Bonus: experience with spaCy, Prodigy annotation tools, LLM-based NER, or RLHF methodologies Location Pittsburgh, PA (Carnegie Mellon corridor) • New York City • Boston • San Francisco • Seattle • Remote strongly considered

NLP Research Engineer

About the Role

Apply for this Position

About the Company

Job Details