This GGS1_Kazakh_Belarus_readme.txt file was generated on 23/1/2021 by Tom Gaut

===================
GENERAL INFORMATION
===================

Title: Combining the Kazakhstan data with the GGS1 and Belarus datasets


Contacts: ggp@nidi.nl

==========================
SHARING/ACCESS INFORMATION
==========================

Citation for the data: 

This paper uses data from the GGS Waves 1 (DOI: 10.17026/dans-z5z-xn8g), 
see Gauthier, A. H. et al. (2018) or visit the GGP website (https://www.ggp-i.org/) 
for methodological details.



NOTE: This syntax uses the existing syntax to combine the GGS Wave 1 and GGS2020
	datasets created by Vytenis Juozas Deimantas (December 2019). Please make
	sure that this work is also cited if using this syntax.

Citation for Vytenis Juozas Deimantas syntax:

Deimantas, V. J. (2019). User-written stata syntax to combine GGS Wave 1 and 
GGS2020 Datasets. Retrieved from: https://www.ggp-i.org/data/methodology/


Citation for this syntax:

Gaut, T.J. (2020). User-written Stata syntax to add the GGS2020 dataset for Kazakhstan
to Belarus and the GGS Wave 1. Retreived from: https://www.ggp-i.org/data/methodology/



Links to other publicly accessible locations of the data:

https://www.ggp-i.org/form/accounts/login/?next=/form/


=============
FILE OVERVIEW
=============

File list:

i. GGS1_Kazakh_Belarus_master.do


This file uses the existing syntax created by Vytenis Juozas Deimantas (December 
2019) to combine the GGS Wave 1 and GGS2020 datasets. Before starting with the 
preparation for this file, please ensure that the following files have been
downloaded and placed in the relevant directory alongside the .dta files:

	i.   GGS1-GGS2020_BY_master.do
	ii.  1_GGS1_cleaning.do
	iii. 2_GGS2020_BY_preparation_appending_to_GGS1.do

Please see the "GGS1-GGS2020_readme.txt" for a notes on the variable labels and
which variables are suitable for comparison across both datasets.



How to use:

	1. In the GGS1-GGS2020_BY_master.do file, edit the command
		"global data C:\AlignmentWave1Belarus" at line 6 to point to the directory 
		where your .dta and .do files are stored (e.g. "global data D:\GGP_datasets").

	2. In the file 2_GGS2020_BY_preparation_appending_to_GGS1.do, change line 15 from
		'use "$data\\GGP2020_WAVE1_BLR_V_1_0.dta" to 'use "$data\\data_kazak_belarus.dta"'.
		This is important as it will now clean the combined Kazakhstan and Belarus
		datasets, instead of just the Belarus dataset.

	3. Run the do-file "GGS1_Kazakh_Belarus_master.do".



Belarus and Kazakhstan dataset differences:
	
	Fertility intentions:
	- The variable a615 is not in Belarus, but the same question is in the GGS1 dataset. The syntax 
	  recodes the variable in Kazakhstan to match that of the GGS Wave 1. Please note that the variable
	  in Kazakhstan also includes the answer "unsure".

	Economic wellbeing:
	- The variables a537_* and a538_* in the Belarus dataset are reversed in the Kazakhstan 
	  dataset (e.g. a538_1 in Belarus is the same question as a537_1 in Kazakhstan). This syntax
	  has accounted for this difference in the combined dataset "data_kazak_belarus.dta", although
	  these variables do not appear in the GGS Wave 1 dataset.

	Values:
	- The variables a1104a, a1104c, a1104e, a1104h a1104g, a1104f and a1104i from the Kazakhstan
	  dataset do not appear in the Belarus dataset, and have been renamed to merge with their 
	  counterparts in the GGS Wave 1.
	- The variables a1104k and a1104l on Values in the Kazakhstan dataset do not appear 
	  in either the GGS Wave 1 or Belarus datasets and so have been removed.
	- The questions a1104b, a1104d and a1104j appear in Kazakhstan and correlate to questions in 
	  the Australian survey. These have also been removed.


NOTE: the syntax does not harmonize the Life Histories, users intersted in analyzing fertility and partnership
histories are encourage to use the Harmonized Histories dataset.

==========================
METHODOLOGICAL INFORMATION
==========================

Software: Stata SE 14 (64-bit)

