N4 Solutions, LLC offers our public and private sector clients the collective intellectual knowledge, expertise, and experience of our company and associates to help our customers evaluate, collaborate, innovate, and create business, technology, and scientific solutions that enable them to accomplish their mission and achieve organizational objectives. With the growth of the company, we are seeking a highly motivated and hard-working data analysis professional to join our team. The successful candidate will work in a high-paced and rewarding work environment.
JOB DESCRIPTION/DUTIES:
The Data Analyst will provide analytical and technical support services to the National Institute of General Medical Sciences (NIGMS), Division of Data Integration, Modeling, and Analytics (DIMA). The ideal candidate will design, develop, and implement machine learning, natural language processing, artificial intelligence, large language model, and generative AI solutions to improve internal grant and portfolio management processes. This role requires strong analytical, technical, communication, and problem-solving skills, as well as experience developing reproducible data pipelines, models, visualizations, reports, and decision-support products.
· Design, develop, and implement machine learning (ML), natural language processing (NLP), and artificial intelligence (AI) solutions to improve internal grant and portfolio management processes.
· Develop and maintain algorithms utilizing NLP and ML methodologies, including transformer models, topic modeling, clustering, document embeddings, and predictive analytics techniques.
· Build and optimize supervised and unsupervised machine learning models to characterize scientific portfolios and assist with grant categorization and decision-making efforts.
· Develop and maintain code and data pipelines using Python, R, SQL, and related tools to collect, process, clean, and analyze structured and unstructured data.
· Extract and integrate information from multiple data sources, including internal databases, APIs, and publicly available datasets.
· Develop large language model (LLM) and generative AI applications to support operational initiatives and improve analytical workflows.
· Create dashboards, visualizations, reports, presentations, and other communication products to support scientific and administrative decision-making.
· Support reviewer matching and grant portfolio analyses through advanced data science methodologies and predictive modeling techniques.
· Document methodologies, workflows, code repositories, and analytical processes to ensure reproducibility and facilitate knowledge transfer among staff members.
· Present findings and recommendations to scientific staff, leadership, and s