search | contact us


CSES Logo

CSES Module 3 Data Set Errata
Posted: March 31, 2011

Mexico (2006) and Mexico (2009) - three cases that are potentially duplicates

The Mexico (2006) election study includes 1,591 cases. The CSES Secretariat has identified that other than the respondent identification variable C1005, the following two cases are identical:

484020062004134107 and 484020062004134110

The Mexico (2009) election study includes 2,400 cases. The CSES Secretariat has identified that other than the respondent identification variable C1004, the following two sets of cases are identical:

484020090009195107 and 484020090009195109
484020090009195108 and 484020090009195110

The CSES Secretariat does not yet know whether these potentially duplicate cases are the same respondents or different respondents, and will be investigating the issue and addressing it as necessary in a future data release. In the meantime, analysts who would be interested in deleting the one potentially duplicate case from Mexico (2006) and the two potentially duplicate cases from Mexico (2009) could use the following code to do so:

SPSS code:

SELECT IF C1005 <> "484020062004134110".
SELECT IF C1005 <> "484020090009195109".
SELECT IF C1005 <> "484020090009195110".

STATA code:

drop if C1005=="484020062004134110"
drop if C1005=="484020090009195109" | C1005=="484020090009195110"

Should users decide to drop these three potentially duplicate cases, the Mexico (2006) election study should have 1,590 cases remaining and the Mexico (2009) election study should have 2,398 cases remaining.