Run-d1/Employee-Attrition-AnalysisPublic

NotificationsYou must be signed in to change notification settings
Fork0
Star0

This repository contains a comprehensive study on employee attrition analysis using data mining techniques. It includes data preprocessing, visualization, and predictive modeling (with algorithms such as Decision Tree, Random Forest, and Logistic Regression) to identify key factors influencing attrition, using the IBM HR dataset.

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Paper		Paper
HR-Employee-Attrition-Updated.csv		HR-Employee-Attrition-Updated.csv
HR-Employee-Attrition.csv		HR-Employee-Attrition.csv
HR-employee-attrition.ipynb		HR-employee-attrition.ipynb
README.md		README.md

Repository files navigation

Employee Attrition Analysis

Overview

This project leverages data mining techniques to analyze employee attrition using the IBM HR Analytics dataset. The goal is to identify key factors influencing attrition and build predictive models to aid human resource departments in improving employee retention strategies.

The repository includes:

HR-Employee-Attrition.csv: Original dataset before preprocessing.
HR-Employee-Attrition-Updated.csv: Dataset after preprocessing.
HR-employee-attrition.ipynb: The Python script for data preprocessing.
paper/EmployeeAttrition_Paper_Group1.pdf: The final project paper detailing the methodology and findings.

Dataset

The dataset is a fictional IBM HR Analytics dataset designed to simulate employee attrition scenarios.

Source:Kaggle IBM HR Analytics Dataset

Methodology

Data Preprocessing: Cleaning, encoding, and feature selection.
Modeling: Decision Tree, Random Forest, and Logistic Regression.
Evaluation: Logistic Regression was selected as the best model based on recall.

Key Findings

Monthly income and overtime work significantly impact attrition.
Logistic Regression demonstrated the highest performance with a recall of ~59%.

About

Languages

Jupyter Notebook100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Folders and files

Latest commit

History

Repository files navigation

Employee Attrition Analysis

Overview

The repository includes:

Dataset

Methodology

Key Findings

About

Topics

Resources

Stars

Watchers

Forks

Languages

Movatterモバイル変換

Run-d1/Employee-Attrition-Analysis

Folders and files

Latest commit

History

Repository files navigation

Employee Attrition Analysis

Overview

The repository includes:

Dataset

Methodology

Key Findings

About

Topics

Resources

Stars

Watchers

Forks

Languages