Skip to main content

Is De Identified Data Considered PHI?

by
Last updated on 5 min read

De-identified data is not considered protected health information (PHI) under HIPAA once it meets the Privacy Rule’s removal standards.

Is de-identified data covered by HIPAA?

De-identified data is not covered by HIPAA because it no longer contains individually identifiable information.

HIPAA’s Privacy Rule is pretty clear on this: when you strip out the 18 specific identifiers, what’s left isn’t PHI anymore. You can use or share it freely. Think of a dataset where ages are grouped into ranges and ZIP codes are gone entirely—that’s outside HIPAA’s reach. Still, always double-check with your privacy officer to make sure you’ve dotted all the i’s.

Is de-identified data confidential?

De-identified data is not confidential under HIPAA because it cannot be linked back to an individual.

Confidentiality rules usually apply when there’s even a chance someone could be identified—like in coded datasets where re-identification is possible. De-identified data, by definition, can’t be traced back to a person, so HIPAA doesn’t treat it as confidential health info. That said, you might still have ethical or contractual reasons to protect it.

What is considered de-identified data?

De-identified data is health information stripped of 18 specific identifiers, including names, geographic subdivisions, dates, and contact information.

The HIPAA Privacy Rule (45 CFR § 164.514) spells out exactly what counts. For example, swapping out a full birth date for just the birth year and replacing a full address with a three-digit ZIP code would qualify. Just remember to document the method you used—it’s always good practice.

What is de-identified data HIPAA?

Under HIPAA, de-identified data is health information from which all 18 identifiers have been removed, including relatives, employers, and biometric data.

HIPAA’s “safe harbor” method is pretty strict—it requires ditching things like phone numbers, email addresses, Social Security numbers, and full-face photos. The result? Data that can’t be re-identified without serious effort. That’s why organizations love using it for research or public health reporting without HIPAA getting in the way.

Is patient name alone considered PHI?

Yes, a patient name alone is considered PHI if it is associated with a healthcare context or record.

HIPAA (45 CFR § 160.103) makes this crystal clear: any individually identifiable health info—even just a name—is PHI when it’s tied to health details. So if someone’s first name pops up next to a diagnosis in a public setting, that’s PHI in HIPAA’s eyes.

Do you need a BAA for de-identified data?

No, you do not need a Business Associate Agreement (BAA) for de-identified data because it is not covered by HIPAA.

A BAA is only for when you’re sharing PHI with a third party that’ll handle it on your behalf. Since de-identified data isn’t PHI, you can share it freely—just make sure the de-identification process was done right and documented.

Is Data masking the same as Anonymization?

No, data masking is not the same as anonymization—masking hides data while anonymization removes identifiers completely.

Data masking (like turning a credit card number into **** **** **** 1234) keeps the original data hidden from prying eyes but doesn’t destroy it. Anonymization, on the other hand, wipes out identifying info for good so the data can’t be linked back to anyone. Both protect privacy, but anonymization is what HIPAA calls “de-identified.”

What is the difference between a limited data set and de-identified data?

Both are types of PHI with reduced identifiers, but a limited data set may still contain some geographic and date elements.

CharacteristicDe-identified DataLimited Data Set
Geographic identifiersRemoved entirelyMay include city, state, and 3-digit ZIP code
DatesRemoved entirelyMay include year and date ranges (e.g., "2023–2025")
HIPAA requirementsNo restrictionsRequires a data use agreement (DUA)
Re-identification riskNegligibleLow, but not zero

Is coded data de-identified?

Coded data may or may not be de-identified—it depends on whether the code can be linked back to an individual.

Coded data replaces identifying info with a key or algorithm—like turning patient names into unique ID numbers. If that key is destroyed or locked away, the data becomes de-identified. But if the key still exists and could be used to re-identify people, then it’s still PHI and needs protection.

What is de-identified data used for?

De-identified data is used for research, public health reporting, quality improvement, and machine learning without HIPAA restrictions.

Hospitals use it to study disease trends, insurers analyze treatment outcomes, and AI developers train models—all without privacy headaches. Just make sure the de-identification method follows HIPAA’s safe harbor or expert determination rules.

How do I identify de-identified data?

De-identified data cannot be linked to an individual using any reasonable means, including combining datasets.

Check that all 18 HIPAA identifiers are gone or generalized—like replacing exact birth dates with age ranges. You can also use statistical methods like k-anonymity to test re-identification risk. For example, grouping ages over 89 into “89+” makes it much harder to pinpoint someone.

What are some examples of PHI?

Examples of PHI include names, addresses, dates, telephone numbers, email addresses, and biometric identifiers.

PHI also covers Social Security numbers, medical record numbers, health plan beneficiary numbers, account numbers, certificate/license numbers, vehicle identifiers, device identifiers, web URLs, IP addresses, fingerprints, and full-face photos. Even details like a patient’s job or employer can count as PHI if they’re tied to health info.

When a patient wants a copy of their PHI?

You must provide access to PHI within 30 days of receiving a patient’s request under HIPAA.

The request can be written, oral, or electronic. You can ask for a 30-day extension for “unusual circumstances,” but you’ve got to tell the patient why and when they’ll get it within that first month. Copy fees are limited to reasonable costs for labor and supplies—not the time it takes to dig up the records.

Can I share de-identified data?

Yes, you can share de-identified data freely as it is not subject to HIPAA restrictions.

That means researchers, public health agencies, or even commercial partners can use it for drug development or analytics. Just be careful—poor de-identification can lead to accidental re-identification. For example, releasing ZIP codes for areas with fewer than 20,000 people is risky because they can be combined with other data to uncover identities.

Is name and address considered PHI?

Yes, a name combined with any address smaller than a state is considered PHI under HIPAA.

So a patient’s name with a street address, city, or ZIP code is PHI. Even a name paired with a state can count if it’s linked to health details. The only exception? A state or territory name (like “California”) is too broad to identify someone. And watch out for ages over 89—HIPAA treats those as identifiable because they’re so unique.

Edited and fact-checked by the TechFactsHub editorial team.
David Okonkwo

David Okonkwo holds a PhD in Computer Science and has been reviewing tech products and research tools for over 8 years. He's the person his entire department calls when their software breaks, and he's surprisingly okay with that.