Generative AI Data Scientist
Company: Society of Exploration Geophysicists
Location: Baltimore
Posted on: October 17, 2024
Job Description:
!!!! You must be located within 2 hours of the Baltimore area
for this position or willing to relocate within 30
days!!!!!Description:We are seeking a Python GenAI Data Scientist
with expertise in developing, fine-tuning, and integrating AI
models, particularly in natural language processing (NLP). This
role focuses on building generative AI solutions for summarization
tasks and other NLP applications, while incorporating prompt
engineering and human-in-the-loop feedback to optimize AI outputs.
The ideal candidate will possess demonstrated prior experience
analyzing unstructured medical records, developing AI models for
extracting insights, and incorporating human-in-the-loop feedback
to improve model performance. You will collaborate closely with
data scientists, software engineers, and other stakeholders to
integrate AI models into production environments within cloud
infrastructure.Required Qualifications & Experience:
- 5+ years of experience in AI/ML development with a strong focus
on NLP and generative models, using frameworks such as TensorFlow,
PyTorch, and Hugging Face
- Expertise in Python, with experience in libraries like
Transformers, NLTK, SpaCy, Gensim, and data manipulation tools such
as Pandas and NumPy
- Implement dynamic prompt engineering strategies to optimize
model outputs (1-2 years preferred)
- Familiarity with generative AI models such as OpenAI's GPT,
Llama, and supporting libraries like VLLM
- Strong analytical skills and experience with statistical
modeling and data analysis
- Ability to effectively articulate technical challenges and
solutions
- Strong communicator with excellent written and verbal
communication skills
- Identify and analyze user requirements to generate stories and
tasks for team backlog
- Prioritize and execute tasks throughout the software
development life cycle
- Create custom NLP algorithms and annotators to evaluate medical
record data
- Create custom tools to enable analysts to perform data
research
- Solid understanding of statistical modeling, data analysis, and
performance evaluation metrics.
- Demonstrated experience analyzing and processing unstructured
clinical data (e.g., electronic health records, physician notes,
imaging reports), using techniques such as tokenization,
lemmatization, and word embeddings (e.g., TF-IDF, BERT)
- Familiarity with healthcare data formats and standards such as
HL7, FHIR, ICD codes, and SNOMED
- Experience with cloud platforms (AWS, Azure), containerization
(Docker), and using CI/CD pipelines for machine learning model
deployment
- Knowledge of SQL (PostgreSQL, MySQL) and NoSQL (MongoDB,
Elasticsearch) databases, and how to structure data pipelines for
efficient data processing
- Develop and fine-tune AI models for natural language processing
(NLP) tasks, including Named Entity Recognition (NER), text
classification, summarization, and sentiment analysis, particularly
with unstructured clinical records
- Conduct experiments to evaluate model performance, utilizing
metrics such as precision, recall, and F1-score to iteratively
improve models through hyperparameter tuning and training
optimizations
- Experience integrating AI models into production environments,
collaborating with software engineers and using cloud platforms
like AWS to ensure scalability and performance
- Analyze and preprocess large datasets, particularly
unstructured medical records (e.g., physician notes, discharge
summaries), using tools like Pandas, NLTK, and SpaCy
- Stay updated with the latest research and advancements in AI
and NLP, applying state-of-the-art techniques such as transfer
learning, attention mechanisms, and fine-tuning pre-trained models
to healthcare-specific challenges
- Master's degree (Data Science, AI, Computer Science, or a
related field) + 10 years experience; or PhD + 4 yearsPreferred
Qualifications:
- Experience in healthcare, particularly working with
unstructured medical records in clinical settings, leveraging NLP
models for insight extraction.
- Experience working with human-in-the-loop systems,
incorporating clinician/end-user feedback and leveraging tools like
SciPy and NumPy to improve AI model accuracy
- Educational background or practical training in a clinical
setting, with exposure to clinical workflows and medical
terminologies
- Familiarity with deep learning techniques, attention
mechanisms, and transformers applied to healthcare dataEducation:BS
RequiredWork Authorization: US Citizen, Green Card, H1-B
VisaBackground Check/Public Trust Clearance:Active clearance is NOT
required but candidate must be able to obtain and maintain a US
Public Trust clearanceHours: You must be located in the US and be
able to work US East Coast HoursAbout Us:Interactive Consulting
Services, Inc.'s (ICS) mission is to match elite talent with
cutting-edge technologies and projects.Interactive Consulting
Services (ICS) is a small woman-owned business headquartered in
Jarrettsville, MD, and has been providing IT and talent services
for over 20 years. ICS has a proven track record in both the public
and private sectors, providing talent and solutions in fields
ranging from healthcare, education, and finance, to social
services, defense, and aviation.
#J-18808-Ljbffr
Keywords: Society of Exploration Geophysicists, Olney , Generative AI Data Scientist, Other , Baltimore, Maryland
Didn't find what you're looking for? Search again!
Loading more jobs...