Preparing Data for Machine Learning (ETL)

End-to-End Machine Learning: Titanic Survival Prediction

1 min read

Published Nov 18 2025

KerasMachine LearningMatplotlibNumPyPandasPythonscikit-learnSciPySeabornTensorFlow

Machine learning requires:

No missing values
All numeric inputs
Encoded categorical variables
Train/test split

We define feature lists:

features = [

"pclass", "sex", "age", "sibsp", "parch",

"fare", "embarked", "class", "who", "alone"

]

target = "survived"

data = titanic[features + [target]].copy()

data["age"] = data["age"].fillna(data["age"].median())

data["fare"] = data["fare"].fillna(data["fare"].median())

data["embarked"] = data["embarked"].fillna(data["embarked"].mode()[0])

Split the data:

X_train, X_test, y_train, y_test = train_test_split(

data[features], data[target],

test_size=0.2, random_state=42, stratify=data[target]

)

Define numeric and categorical transformers:

numeric_features = ["age", "sibsp", "parch", "fare", "pclass"]

categorical_features = ["sex", "embarked", "class", "who", "alone"]

preprocessor = ColumnTransformer(

transformers=[

("num", StandardScaler(), numeric_features),

("cat", OneHotEncoder(handle_unknown="ignore"), categorical_features)

]

)

Preparing Data for Machine Learning (ETL)

End-to-End Machine Learning: Titanic Survival Prediction

1 min read

Published Nov 18 2025

Guide Sections

Guide Comments

Products from our shop

Docker Cheat Sheet - Print at Home Designs

Docker Cheat Sheet Mouse Mat

Docker Cheat Sheet Travel Mug

Docker Cheat Sheet Mug

Vim Cheat Sheet - Print at Home Designs

Vim Cheat Sheet Mouse Mat

Vim Cheat Sheet Travel Mug

Vim Cheat Sheet Mug

SimpleSteps.guide branded Travel Mug

Developer Excuse Javascript - Travel Mug

Developer Excuse Javascript Embroidered T-Shirt - Dark

Developer Excuse Javascript Embroidered T-Shirt - Light

Developer Excuse Javascript Mug - White

Developer Excuse Javascript Mug - Black

SimpleSteps.guide branded stainless steel water bottle

Developer Excuse Javascript Hoodie - Light

Developer Excuse Javascript Hoodie - Dark