Skip to content

A study of data from the Medical Cost Personal dataset from Kaggle, looking for patterns, behaviors, and information that may be useful. The final objective is creating a Linear Regression model.

Notifications You must be signed in to change notification settings

JosePadillaMtnz/MedicalCost_KaggleDataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Kaggle dataset on medical costs

A study of data from the Kaggle personal medical costs dataset (https://www.kaggle.com/datasets/mirichoi0218/insurance), looking for patterns, behaviors, and information that may be useful. Throughout the document, different sections can be seen, highlighting the EDA part and the creation and evaluation of models.

The Exploratory Data Analysis part searches for patterns in the variations of the variables, relationships between them, and performs featuring engineering. The model part creates different model options both with the initial model, as well as with the cleaned model and the application of featuring engineering, to finally obtain metrics and perform a brief test.

About

A study of data from the Medical Cost Personal dataset from Kaggle, looking for patterns, behaviors, and information that may be useful. The final objective is creating a Linear Regression model.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published