This repository contains notebooks related to processing, analysis, and machine learning applied to the speed-dating dataset
This investigation was conducted for the final research project of the Data Intelligence in Management Diploma Program, taught at Facultad de Ciencias Económicas Uncuyo. All of the data used for this project was gathered from a speed-dating experiment lead by researchers at Columbia University. The dataset along with its description can be found on the following Link: Speed Dating Dataset.
Originally, 2 papers associated with the collected data were published: Gender Differences in Mate Selection: Evidence from a Speed Dating Experiment, and Racial Preferences in Dating. In our analysis, we take a different approach (focused more on the data mining side) and compare our results with theirs.
Within the repository, you will find sections containing different tasks related to the necessary steps for the study.
-
Outlier Analysis
-
Hypothesis Testing
-
Machine Learning Applications
Although our research has concluded, this repository is still a work in progress and we hope to be able to update it shortly. For more information, feel free to contact the collaborators.