Charlie Ornstein, ProPublica, charles.ornstein@propublica.org, @charlesornstein
Updated March 2016
Each year, the release of new datasets makes it more exciting to cover health care. No longer are we limited to comparing states to one another to look for differences. No longer are doctors’ practice patterns protected by outdated privacy rules. We have entered an era in which we can compare one doctor to another. And what we’re learning is that there are huge, seemingly unexplainable, differences among them.
This tip sheet offers both very broad data sources, as well as more granular ones. None of the data sets cover individual claims-level data, which require special permissions and often cost a lot of money.
Sign up for data updates from CMS:
CMS is the Centers for Medicare and Medicaid Services (yes, the acronym doesn’t quite work). https://dnav.cms.gov/. Click on receive email updates in the lower right. CMS’s Data Navigator is also a resource to look up data sources, but I find it overly complex to use. Socrata powers its own tool that allows you to look up all datasets available through its network.
CMS Fast Facts: This stat sheet is at the highest level, but it is the most current one the agency produces. (Updated December 2015)
Medicare and Medicaid Statistics (top level, super useful): Last released: January 2016. Released annually.
Additional summary data and charts can be found: https://www.ccwdata.org/web/guest/medicare-data and http://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/MedicareMedicaidStatSupp/index.html. In all cases, there is a data lag of at least a year, sometimes longer.
CMS Program Statistics: Includes summary statistics on national health care, Medicare populations, utilization, and expenditures, as well as counts for Medicare-certified institutional and non-institutional providers. CMS Program Statistics is organized into sections which can be downloaded and viewed separately. The Medicare Enrollment Dashboard is an interactive online tool presenting monthly enrollment figures and yearly trends at several geographical areas, including national, state/territory, and county. https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/CMSProgramStatistics/index.html
Geographic variation: How Medicare utilization differs from state to state and from county to county. Data is available from 2007 to 2014. http://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Medicare-Geographic-Variation/GV_PUF.html. This will help put other data sets in context. Last released: January 2016. Released annually.
National health expenditures: Released at least annually. Last update December 2015. https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/NationalHealthExpendData/index.html
How much doctors are paid by Medicare and what they do for that money: https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Medicare-Provider-Charge-Data/Physician-and-Other-Supplier.html. ProPublica has created a nice visualization for each provider: www.propublica.org/treatment and so does the Wall Street Journal: http://graphics.wsj.com/medicare-billing/. Last released: June 2015; covers the year 2013. The site includes historical data for 2012 as well. CMS has pledged to release this data annually. (Important caveat with this data: It only contains claims for services rendered in Medicare fee-for-service, not for those in Medicare Advantage. As a result, it will leave out lots of services delivered in regions of the country where Medicare HMO enrollment is high, and thus may not be representative.)
Another wonderful thing about this dataset is that it now includes demographic data on each doctor’s patients, including breakdowns by gender; age ranges; race; dual eligible status; prevalence of common chronic conditions; and an aggregate risk score that allows you to compare how “sick” a doctor’s patients are relative to peers.
Payments to hospitals for inpatient and outpatient visits: http://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Medicare-Provider-Charge-Data/index.html.
What drugs doctors prescribe most: In 2015, Medicare began publicly releasing prescribing data in its Part D program. https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Medicare-Provider-Charge-Data/Part-D-Prescriber.html The data covers 2013 and is spliced and diced in a variety of ways. ProPublica filed Freedom of Information Act requests for earlier data and has put the data online for free: https://projects.propublica.org/data-store/. You can see our app at www.propublica.org/checkup. This accounts for all prescriptions covered in Medicare Part D, but it’s important to note that not every Medicare recipient is enrolled in Part D. Also, data is redacted when a doctor prescribed fewer than 11 prescriptions for a particular drug. You can look up Medicaid utilization data as well (but this is summarized by state, not by doctor): http://www.medicaid.gov/medicaid-chip-program-information/by-topics/benefits/prescription-drugs/medicaid-drug-programs-data-and-resources.html.
Medicare drug spending dashboard: CMS released an online dashboard to look at Medicare prescription drugs for both Part B and Part D. These categories include drugs with high spending on a per user basis, high spending for the program overall, and those with high unit cost increases in recent years. (December 2015) https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Information-on-Prescription-Drugs/
Medicare opioid drug mapping tool: The tool shows geographic comparisons, at the state, county, and ZIP code levels, of opioid prescription claims in Medicare Part D. It allows you to see both the number and percentage of opioid claims at the local level. Obvious drawback: The data is from 2013 and, well, we’re now in 2016. For an evolving epidemic, this data sadly lags far behind. https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Medicare-Provider-Charge-Data/OpioidMap.html
Who doctors refer patients to: Health care data guru Fred Trotter of DocGraph has requested so-called referral data from CMS for a few years now under FOIA. The agency has posted it online in a hidden corner: https://questions.cms.gov/faq.php?faqId=7977. Again, data is redacted when the cell size is less than 11. This includes data through 2015.
Which insurance networks do doctors participate in? Under the Affordable Care Act, insurance companies participating in the Obamacare marketplaces are required to provide up-to-date information about which doctors and facilities are in their networks. Links to insurers’ directories can be found here: http://download.cms.gov/marketplace-puf/2016/machine-readable-url-puf.zip
What durable medical equipment doctors order: In October 2015, Medicare began releasing data on durable medical equipment and supplies ordered and dispensed in Medicare Part B, organized by the ordering physician. There is an aggregate files (high level) and a detail file (aggregating by type of DME ordered by each doctor). https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Medicare-Provider-Charge-Data/DME.html
Utilization of home health services: In December 2015, Medicare began releasing data on services provided to Medicare beneficiaries by home health agencies. The Home Health Agency Utilization and Payment Public Use File (Home Health Agency PUF) contains information on utilization, payments, and submitted charges organized by provider, state and home health resource group. It is organized by home health provider, not by doctor who ordered the services. https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Medicare-Provider-Charge-Data/HHA.html (Released December 2015)
Therapy services in nursing homes: The Skilled Nursing Facility Utilization and Payment Public Use File (SNF PUF) contains information on utilization, payments, and submitted charges organized by provider, state, and resource utilization group (RUG). The data include information on 15,055 skilled nursing facilities, over 2.5 million stays, and almost $27 billion in Medicare payments for 2013. https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Medicare-Provider-Charge-Data/SNF.html (Released March 2016)
Chronic conditions among Medicare beneficiaries: Tools to help you gain a better understanding of the burden of chronic conditions among Medicare fee-for-service beneficiaries, including prevalence, utilization of health services, and Medicare spending for specific chronic conditions and multiple chronic conditions. https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Chronic-Conditions/index.html (Updated January 2016)
What incentive payments doctors have received for using electronic medical records: http://www.cms.gov/Regulations-and-Guidance/Legislation/EHRIncentivePrograms/DataAndReports.html. Updated monthly.
Doctor addresses and specialties: These are part of the National Provider Identifier system, updated weekly/monthly, based on health professionals’ self-reported information. Download the file here: http://www.cms.gov/Regulations-and-Guidance/HIPAA-Administrative-Simplification/NationalProvIdentStand/DataDissemination.html (link to download is at the bottom of the page.) An open-source API for the data is available here: http://www.bloomapi.com/ (Here’s the GitHub link: https://github.com/untoldone/bloomapi). Physician Compare has additional information on doctors who participate in Medicare: https://data.medicare.gov/data/physician-compare. NPI data updated monthly. Physician Compare data last updated in December 2015.
Pharma/device company payments to doctors: Data for the last five months of 2013 and all of 2014 is available at http://www.cms.gov/openpayments/index.html. Every June, CMS must release data from the prior calendar year. ProPublica has an easy-to-use website to search the data at www.propublica.org/docdollars. ProPublica’s site also includes data from companies that were required to report this data earlier (because of corporate integrity agreements).
Quality measures: https://data.medicare.gov/ includes information about hospitals, nursing homes, doctors, dialysis facilities and home health care companies. Data includes demographic information (names, addresses, phone numbers), as well as quality and satisfaction measures. ProPublica used the emergency room quality measures to power its ER Wait Watcher app (https://projects.propublica.org/emergency/).
Hospital and nursing home inspection reports: The federal government has also started making available online its deficiency reports for health facilities. It currently does so for nursing homes (http://www.cms.gov/Medicare/Provider-Enrollment-and-Certification/CertificationandComplianc/FSQRS.html, scroll down under “related links”) and for hospitals (http://www.cms.gov/Medicare/Provider-Enrollment-and-Certification/CertificationandComplianc/Hospitals.html, scroll down to downloads). The Association of Health Care Journalists makes hospital inspection reports available in an easy-to-use interface: www.hospitalinspections.org and ProPublica does the same for nursing home reports: www.propublica.org/nursing-homes. Nursing home inspection reports are updated monthly; hospitals quarterly.
Custom calculations: Use this link to estimate Medicare cohorts for diseases, etc.: https://www.ccwdata.org/web/guest/pricing/estimate-study-size.
Accountable Care Organizations: Data from the shared savings program. https://www.cms.gov/Research-Statistics-Data-and-Systems/Downloadable-Public-Use-Files/SSPACO/Overview.html
Other health agencies
Drug side effects: Now available on the OpenFDA website, https://open.fda.gov/, along with other public use files from the agency. Check out http://www.researchae.com/ to look at how this data can be visualized.
Doctor discipline: The Health Resources and Services Administration has summary data by state and profession and year on doctor discipline, as well as malpractice payouts. http://www.npdb.hrsa.gov/resources/npdbstats/npdbStatistics.jsp.
Centers for Disease Control and Prevention: https://data.cdc.gov/ and wwwn.cdc.gov/communityhealth and http://wonder.cdc.gov/.
Agency for Healthcare Research and Quality maintains a free, on-line query system based on data from the Healthcare Cost and Utilization Project (HCUP). It provides access to health statistics and information on hospital inpatient and emergency department utilization. http://hcupnet.ahrq.gov/





