Stratified sampling on a modal variable.
stratified sampling on a modal variable.
Here is an example: Let’s assume that:
•…we want to randomly select 10% of the input rows.
•…we want to “stratify” on the column “education”: i.e. we must ensure that the distribution of the variable “education” stays exactly the same inside our sample than inside our original population.