Reimagined NLM Dataset Catalog Accelerates and Builds Trust in Biomedical Discovery. NLM Tech Bull. 2025 Jul-Aug;(465):e5
The National Library of Medicine (NLM) announces the relaunch of the
NLM Dataset Catalog, a centralized, searchable platform designed to connect researchers with a vast array of biomedical datasets from multiple repositories, thus accelerating scientific research. Powered by NLM's
DATaset Metadata Model (DATMM), researchers can now effortlessly explore over 2 million biomedical datasets in one centralized location and uncover data interconnections within the broader biomedical ecosystem.
Initially launched as a "beta" product in early 2024, feedback from NIH staff, medical librarians, and biomedical researchers informed several product enhancements now available:
- Dataset Expansion: From 80,000 to 2 million datasets, driven by the inclusion of repositories like Figshare and the Johns Hopkins and Dartmouth Dataverses, unlocking 25 times more data for discovery and reuse.
- Enhanced Scalability: AI automation has reduced dataset ingestion time from two weeks to one day.
- MeSH Enrichment: Enabling more relevant and semantically rich search results, including enriching all datasets with Medical Subject Heading (MeSH) terms using AI to further enrich the descriptive metadata for search.
- Redesigned Interface: Expanded filtering options and an updated user guide for easy navigation and search.
- Enhanced Search Functionality: Targeted index queries by title, author, funding information, or contributor names.
- Login Feature: Similar to MyNCBI, allowing users to save search history, repositories, and download datasets of interest.
- Improved Citation Features: Storing and exporting dataset citations in formats like APA.
- SPARQL Query Builder: Helping users explore connections visually through an interactive semantic graph.
The relaunch of the NLM Dataset Catalog aligns with national goals to
restore public trust in science. It lays the groundwork for the next generation of innovation that will drive biomedical discovery and improve health outcomes.
Please visit the
NLM Dataset Catalog for more information or
reach out to us if you have questions.