Consider the following table (Table 1): Marital status Sex Hours divorced divorced divorced married married single M M F
Posted: Fri May 20, 2022 12:39 pm
status, the sex, the working hours of
individuals, and whether they suffer from hypertension. Assume
attributes Marital status, Sex, and
Hours, these three attributes jointly constitute the
quasi-identifier. Note that the representation has
been simplified by collapsing tuples with the same
quasi-identifying values into a single tuple. The
numbers at the right hand side of the table report, for each tuple,
the number of actual occurrences,
also specifying how many of these occurrences have values Y and N,
respectively, for attribute
Hypertension.
For example, if we query with “Single, M, 40”, we get 26 results,
where 6 of them with Y for the
attribute Hypertension, and 20 with N.
Question a: What is the k value if we say this table enforces
k-anonymity? Why?
Question b: Suppose we perform generalization of the attributes on
Marital status and Sex
according to the scheme below:
Consider the following table (Table 1): Marital status Sex Hours divorced divorced divorced married married single M M F M F M 35 40 35 35 50 40 #tuples (Hyp. values) 2 (OY, 2N) 17 (16Y, IN) 2 (ΟΥ, 2N) 10 (8Y, 2N) 9 (2Y, 7N) 26 (6Y, 20N)
any.marital status any sex been married never_married F M married divorced single