Overview
Black Canyon Consulting (BCC) is actively looking for an experienced Natural Language Processing (NLP) Scientist to support the National Library of Medicine (NLM) at the National Institutes of Health (NIH). This individual will create a genotype–phenotype database focused on influenza viruses by developing an NLP pipeline that includes named entity recognition, entity linking, and relationship extraction. The resulting database will be used by the influenza research community for risk assessment.
If you enjoy being a part of a high performing, professional service and technology focused organization, please apply today!
 Duties & Responsibilities:
 
- Design and implement an NLP pipeline that includes named entity recognition, entity linking, and relationship extraction.
- Extract genotype–phenotype relations for influenza from literature and populate the database.
- Construct and maintain the genotype–phenotype database.
- Document the approach, the pipeline, and the database structure.
Core Expertise:
- Formal education with major study in an academic field related to the medical field, health sciences or allied sciences appropriate to the work of the position.
- Experience with NLP methods, including named entity recognition, entity linking, and relationship extraction.
- Ability to develop an NLP pipeline and produce structured outputs from literature sources,
Preferred Qualifications:
- Domain knowledge about influenza or related areas.
Contract Period
This position is currently set to be 9-12 months in length.