Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

This repository contains a comprehensive study on employee attrition analysis using data mining techniques. It includes data preprocessing, visualization, and predictive modeling (with algorithms such as Decision Tree, Random Forest, and Logistic Regression) to identify key factors influencing attrition, using the IBM HR dataset.

NotificationsYou must be signed in to change notification settings

Run-d1/Employee-Attrition-Analysis

Repository files navigation

Overview

This project leverages data mining techniques to analyze employee attrition using the IBM HR Analytics dataset. The goal is to identify key factors influencing attrition and build predictive models to aid human resource departments in improving employee retention strategies.

The repository includes:

  • HR-Employee-Attrition.csv: Original dataset before preprocessing.
  • HR-Employee-Attrition-Updated.csv: Dataset after preprocessing.
  • HR-employee-attrition.ipynb: The Python script for data preprocessing.
  • paper/EmployeeAttrition_Paper_Group1.pdf: The final project paper detailing the methodology and findings.

Dataset

The dataset is a fictional IBM HR Analytics dataset designed to simulate employee attrition scenarios.

Methodology

  • Data Preprocessing: Cleaning, encoding, and feature selection.
  • Modeling: Decision Tree, Random Forest, and Logistic Regression.
  • Evaluation: Logistic Regression was selected as the best model based on recall.

Key Findings

  • Monthly income and overtime work significantly impact attrition.
  • Logistic Regression demonstrated the highest performance with a recall of ~59%.

About

This repository contains a comprehensive study on employee attrition analysis using data mining techniques. It includes data preprocessing, visualization, and predictive modeling (with algorithms such as Decision Tree, Random Forest, and Logistic Regression) to identify key factors influencing attrition, using the IBM HR dataset.

Topics

Resources

Stars

Watchers

Forks


[8]ページ先頭

©2009-2025 Movatter.jp