Question

Problem 2. Figure 1 shows medical records from a fictitious hospital. ID QI SV Zip Code...

Problem 2.

Figure 1 shows medical records from a fictitious hospital.

ID

QI

SV

Zip Code

Age

Nationality

Condition

1

13053

28

Russian

Heart Disease

2

13053

36

Japanese

Flu

3

13068

35

American

Cancer

4

13068

29

American

Heart Disease

5

13068

21

Japanese

Viral Infection

6

14850

46

Indian

Flu

7

13053

31

American

Cancer

8

13053

23

American

Viral Infection

9

14853

50

Indian

Cancer

10

14853

55

Russian

Heart Disease

11

14850

47

American

Viral Infection

12

14850

49

American

Viral Infection

13

13053

37

Indian

Cancer

14

13068

36

Japanese

Cancer

15

13068

38

Russian

Flu

Figure 1: Inpatient microdata

Now you are asked to anonymize the above table according to the following requirements. Note that when you do the anonymization, you should try to generalize the QI with minimum changes (i.e., the changed values should be as accurate as possible).

  1. GeneralizetheQIvaluesofthetablefor2-anonymity.
  2. GeneralizetheQIvaluesofthetablefor4-anonymity.
  3. Generalize the QI values of the table for3-diversity.
  4. Instead of generalize QI values, now you are asked to use the anatomy method to develop the 3-diversity.
0 0
Add a comment Improve this question Transcribed image text
Answer #1

SV zip code 130 ** 130 ** ☆ ☆ odlo of A - Medical Record from a flotitions hospital. QI Age Nationality condition <30 Heart D Explanation:-Figure 2 shows a 4-anonymous table derived from the table in Figure 1 (here “*” denotes a suppressed value so, for example, “zip code = 1485*” means that the zip code is in the range [14850−14859] and “age=3*” means the age is in the range [30 − 39]). Note that in the 4-anonymous table, each tuple has the same values for the quasi-identifier as at least three other tuples in the table. Because of its conceptual simplicity, k-anonymity has been widely discussed as a viable definition of privacy in data publishing, and due to algorithmic advances in creating k-anonymous versions of a data set [3, 6, 16, 18, 21, 24, 25], k-anonymity has grown in popularity. However, does k-anonymity really guarantee privacy? In the next section, we will show that the answer to this question is interestingly no.ô * * Medical records from a fietitions hospital Sv zip code Nationality condition. 130 * Heart Disease 130 ** fly. 130&* can

consider the inpatient records shown in Figure 1. We present a 3-diverse version of the table in Figure 3. Comparing it with the 4-anonymous table in Figure 2 we see that the attacks against the 4-anonymous table are prevented by the 3-diverse table. For example, Alice cannot infer from the 3-diverse table that Bob (a 31 year old American from zip code 13053) has cancer. Even though Umeko (a 21 year old Japanese from zip code 13068) is extremely unlikely to have heart disease, Alice is still unsure whether Umeko has a viral infection or cancer. The ℓ-diversity principle advocates ensuring ℓ “well represented” values for the sensitive attribute in every q ⋆ -block, but does not clearly state what “well represented” means. Note that we called it a “principle” instead of a theorem — we will use it to give two concrete instantiations of the ℓ-diversity principle and discuss their relative trade-offs

Add a comment
Know the answer?
Add Answer to:
Problem 2. Figure 1 shows medical records from a fictitious hospital. ID QI SV Zip Code...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT