Data Classifying by Random Forest
A projects that use random forest to classify data.
Description
This is a projects that used Python to implement random forest, which could be used for classification and prediction. It combines the output of multiple decision trees to reach a single result. I used it on two datasets about survivor on the Titanic and Amazon reviews.
(Image from https://medium.com/@roiyeho/random-forests-98892261dc49)
Features
Analysis .pkl data and output result.
Used Python to implement random forest algorithm.
Trained random forest to predict if passengers on the Titanic could survive based on Titanic dataset.
Trained random forest to predict the author of a review on Amazon based on a high dimension dataset about Amazon reviews.
Conducted bootstrap aggregating technique to improve the stability and accuracy of model.