Skip to main content
Huajin Wang -

Huajin Wang

Senior Librarian

Huajin Wang leads innovative initiatives that help to create a culture change towards a more open and reproducible research landscape.


Expertise

Topics:  Data Collaboration, Open Science, Biomedical Data, AI-Readiness of Research Data

Industries: Library and Information Management, Education/Learning

Huajin Wang is a Senior Librarian and co-director for the Open Science & Data Collaborations program at Carnegie Mellon University Libraries. As a director for the Open Science & Data Collaborations program, she leads innovative initiatives that help to create a culture change towards a more open and reproducible research landscape through tools, training, community building and collaboration across disciplinary boundaries. As a researcher, she has led many successful research projects and collaborated with biologists, clinicians, information professionals and data scientists on interdisciplinary research topics. Her current research interest is on open science methodology and assessments, AI-readiness of research data and secondary reuse of biomedical data. She is the chair and co-PI for the NSF-funded Artificial Intelligence for Data Discovery and Reuse (AIDR) conference and co-chairs the annual Open Science Symposium.

Media Experience

Libraries Convene Community of Scholars to Tackle Data Challenges  — Carnegie Mellon University
"With the recent advances in machine learning and AI, it is possible to train computers to find optimal solutions to a problem, such as integrating different datasets and extracting metadata," said Huajin Wang, a CMU librarian and conference chair. "We created AIDR 2019 because it's about time that people working in a variety of disciplines come together to benefit from diverse expertise, and address these mutual challenges together, using the power of AI."

Education

(non-degree program), Machine Learning, Carnegie Mellon University
B.S., Microbiology, Shandong University
Ph.D., Cell Biology, University of Alberta

Spotlights

Links

Event Appearances

An End-to-end Open Science and Data Collaborations Program
(2021) Coalition for Networked Information (CNI), Virtual
January 1, 2025

The rising importance of open science and open data
(2021) Mid-Atlatic Chapter of the Medical Library Association Annual Meeting, Virtual
January 1, 2025

Data Discovery and Reuse: AI Solutions & the Human Factor
(2020) National Information Standards Organization (NISO) Plus Conference, Baltimore, MD
January 1, 2025

Building Community and Support for Open Science at Carnegie Mellon University
(2018) Coalition for Networked Information (CNI) Fall 2018 Membership Meeting, Washington, DC
January 1, 2025

AI for Data Reuse - Tools, Challenges, and Opportunities
(2019) Reproducibility and Data Reuse in Life Science, SciLifeLab Data Centre, Uppsala, Sweden
January 1, 2025

Articles

Implementation and assessment of an end-to-end Open Science & Data Collaborations program  —  F1000Research

Partitioning of MLX-family transcription factors to lipid droplets regulates metabolic gene expression  —  Molecular Cell

The Evolution of Information Literacy Outcomes in Interdisciplinary Undergraduate Science Courses  —  Issues in Science and Technology Librarianship

Functional contribution of the spastic paraplegia-related triglyceride hydrolase DDHD2 to the formation and content of lipid droplets  —  Biochemistry

Seipin is required for converting nascent to mature lipid droplets  —  Elife

Videos