Contact Us   
  Home > Background
We provide the following resources for each of the baselines for research purposes. Please note that background information on some of these resources is available from our MBR Reference Material page.

Resource Restrictions Where to Find
MBR Query Tool Database: Baseline databases 2002 forward available for searching. Includes tables with MH, SH, MH/SH combination, Chemicals, and PMID data; also can limit or filter by Date Created, Date Completed, Date Last Revised, Publication Year, and Status. License Required MBR_Query_Tool
XML Formatted Citations: XML version of baseline citations. This is the format used to export the Medline/PubMed Baseline citations. License Required MBR_Query_Tool
MEDLINE ASCII Display Formatted Citations: Each XML citation translated to MEDLINE ASCII display format used in PubMed. License Required MBR_Query_Tool
DTD Files: We save a copy of the relevant DTD (Document Type Definition) files each year for working with the Baseline XML files. No Restrictions MBR_Files
Frequency Count Files: Basic frequency counts for the entire MEDLINE/PubMed Baseline sorted into alphabetical and numerical order for the following MEDLINE fields. For all fields but the NM field, we also provide a sort and count of their occurrences as starred (Index Medicus) items.
     a. MH (MeSH Headings)
     b. SH (MeSH Subheadings)
     c. MH/SH combinations
     d. NM (Chemicals)
No Restrictions MBR_Files
Raw Data Files: Files containing the raw data similar to what was used to create our MBR Query Tool Database for this Baseline year. There is a README file describing the various files available and their layouts. No Restrictions MBR_Files
Histogram/Summary Files: File showing the number of MH terms assigned to each of the various MeSH Tree top-level and top-level + 1 categories during the latest year to see how assignment of terms vary from year to year.

File showing the number of MH terms assigned to each of the UMLS Semantic Type Groupings categories during the latest year to see how assignment of terms vary from year to year from a different perspective.
No Restrictions MBR_Files
Related MeSH Files: We save a copy of the MeSH Vocabulary data files for each year and a copy of their associated DTD (Document Type Definition) files for working with the Baseline XML files. Memorandum of Understanding required MBR_Files
UMLS Semantic Groups File: We have saved a copy of the Semantic Groups file. The Semantic Groups are a coarse-grained set of semantic type groupings designed to reduce the complexity in the UMLS Metathesaurus. The 15 semantic groups provide a partition of the UMLS Metathesaurus for 99.5% of the concepts. No Restrictions MBR_Files
Unique Words from Medline Baseline: We use a very simplified idea of a word -- we throw away anything with all numbers, throw away anything with non-ascii characters, and break at anything that is not alphanumeric. The "words" files contains single words and bigram words. The bigram words are made up of a sliding window using the last "valid" word and the current word - so you get something like "last current" where we simply added a space. We also ignore a short (313) list of stop words, so they are not included in the various lists. Each of the "words" files also contains a frequency count for each item. Also, please note that we only look at the Title and Abstract fields to generate our list of words - we have ignored the MeSH Heading fields. No Restrictions MBR_Files

Copyright, Privacy, Accessibility, Viewers and Players,
Freedom of Information Act, Contact Us
Last Modified: December 31, 2015   
link to https://www.usa.gov/ - image is USA.gov logo link to https://www.hhs.gov - image is HHS.gov logo link to https://www.nih.gov - image is NIH.gov logo link to https://www.nlm.nih.gov - image spells out U.S. National Library of Medicine