Uh oh!
There was an error while loading.Please reload this page.
- Notifications
You must be signed in to change notification settings - Fork332
Feature engineering package with sklearn like functionality
License
feature-engine/feature_engine
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
| Open Source | |
| Tutorials | |
| Code | |
| Downloads | |
| Meta | |
| Documentation | |
| Citation | |
| Testing |
Feature-engine is a Python library with multiple transformers to engineer and select features for use in machine learning models.Feature-engine's transformers follow Scikit-learn's functionality with fit() and transform() methods to learn thetransforming parameters from the data and then transform it.
Feature-engine: A new open-source Python package for feature engineering
Practical Code Implementations of Feature Engineering for Machine Learning with Python
We want to share Feature-engine with more people. It'd help us loads if you tell ushow you discovered us.
Then we'd know what we are doing right and which channels to use to share the love.
Please share your story by answering 1 quick questionat this link. 😃
- Missing Data Imputation
- Categorical Encoding
- Discretisation
- Outlier Capping or Removal
- Variable Transformation
- Variable Creation
- Variable Selection
- Datetime Features
- Time Series
- Preprocessing
- Scaling
- Scikit-learn Wrappers
- MeanMedianImputer
- ArbitraryNumberImputer
- RandomSampleImputer
- EndTailImputer
- CategoricalImputer
- AddMissingIndicator
- DropMissingData
- OneHotEncoder
- OrdinalEncoder
- CountFrequencyEncoder
- MeanEncoder
- WoEEncoder
- RareLabelEncoder
- DecisionTreeEncoder
- StringSimilarityEncoder
- EqualFrequencyDiscretiser
- EqualWidthDiscretiser
- GeometricWidthDiscretiser
- DecisionTreeDiscretiser
- ArbitraryDiscreriser
- Winsorizer
- ArbitraryOutlierCapper
- OutlierTrimmer
- LogTransformer
- LogCpTransformer
- ReciprocalTransformer
- ArcsinTransformer
- PowerTransformer
- BoxCoxTransformer
- YeoJohnsonTransformer
- MeanNormalizationScaler
- MathFeatures
- RelativeFeatures
- CyclicalFeatures
- DecisionTreeFeatures()
- DropFeatures
- DropConstantFeatures
- DropDuplicateFeatures
- DropCorrelatedFeatures
- SmartCorrelationSelection
- ShuffleFeaturesSelector
- SelectBySingleFeaturePerformance
- SelectByTargetMeanPerformance
- RecursiveFeatureElimination
- RecursiveFeatureAddition
- DropHighPSIFeatures
- SelectByInformationValue
- ProbeFeatureSelection
- MRMR
- DatetimeFeatures
- DatetimeSubtraction
- DatetimeOrdinal
- LagFeatures
- WindowFeatures
- ExpandingWindowFeatures
- Pipeline
- make_pipeline
- MatchCategories
- MatchVariables
- SklearnTransformerWrapper
From PyPI using pip:
pip install feature_engineFrom Anaconda:
conda install -c conda-forge feature_engineOr simply clone it:
git clone https://github.com/feature-engine/feature_engine.git>>>importpandasaspd>>>fromfeature_engine.encodingimportRareLabelEncoder>>>data= {'var_A': ['A']*10+ ['B']*10+ ['C']*2+ ['D']*1}>>>data=pd.DataFrame(data)>>>data['var_A'].value_counts()
Out[1]:A 10B 10C 2D 1Name: var_A, dtype: int64>>>rare_encoder=RareLabelEncoder(tol=0.10,n_categories=3)>>>data_encoded=rare_encoder.fit_transform(data)>>>data_encoded['var_A'].value_counts()
Out[2]:A 10B 10Rare 3Name: var_A, dtype: int64Find more examples in ourJupyter Notebook Galleryor in thedocumentation.
Details about how to contribute can be found in theContribute Page
Briefly:
- Fork the repo
- Clone your fork into your local computer:
git clone https://github.com/<YOURUSERNAME>/feature_engine.git- navigate into the repo folder
cd feature_engine- Install Feature-engine as a developer:
pip install -e .- Optional: Create and activate a virtual environment with any tool of choice
- Install Feature-engine developer dependencies:
pip install -e ".[tests]"- Create a feature branch with a meaningful name for your feature:
git checkout -b myfeaturebranch- Develop your feature, tests and documentation
- Make sure the tests pass
- Make a PR
Thank you!!
Feature-engine documentation is built usingSphinx and is hosted onRead the Docs.
To build the documentation make sure you have the dependencies installed: from the root directory:
pip install -r docs/requirements.txtNow you can build the docs using:
sphinx-build -b html docs buildThe content of this repository is licensed under aBSD 3-Clause license.
Sponsor us and support further ourmission to democratize machine learning and programming tools through open-sourcesoftware.
About
Feature engineering package with sklearn like functionality
Topics
Resources
License
Code of conduct
Contributing
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Sponsor this project
Uh oh!
There was an error while loading.Please reload this page.
